Adaptive bridge model for compressed domain point cloud classification

Seleem, Abdelrahman; Guarda, André F. R.; Rodrigues, Nuno M. M.; Pereira, Fernando

Publication

Adaptive bridge model for compressed domain point cloud classification

2024-06-08Journal article

dc.contributor.author	Seleem, Abdelrahman
dc.contributor.author	Guarda, André F. R.
dc.contributor.author	Rodrigues, Nuno M. M.
dc.contributor.author	Pereira, Fernando
dc.date.accessioned	2025-01-08T15:01:54Z
dc.date.available	2025-01-08T15:01:54Z
dc.date.issued	2024-06-08
dc.date.updated	2024-12-28T10:15:07Z
dc.description.abstract	The recent adoption of deep learning-based models for the processing and coding of multimedia signals has brought noticeable gains in performance, which have established deep learning-based solutions as the uncontested state-of-the-art both for computer vision tasks, targeting machine consumption, as well as, more recently, coding applications, targeting human visualization. Traditionally, applications requiring both coding and computer vision processing require frst decoding the bitstream and then applying the computer vision methods to the decompressed multimedia signals. However, the adoption of deep learning-based solutions enables the use of compressed domain computer vision processing, with gains in performance and computational complexity over the decompressed domain approach. For point clouds (PCs), these gains have been demonstrated in the single available compressed domain computer vision processing solution, named Compressed Domain PC Classifer, which processes JPEG Pleno PC coding (PCC) compressed streams using a PC classifer largely compatible with the state-of-the-art spatial domain PointGrid classifer. However, the available Compressed Domain PC Classifer presents strong limitations by imposing a single, specifc input size which is associated to specifc JPEG Pleno PCC confgurations; this limits the compression performance as these confgurations are not ideal for all PCs due to their diferent characteristics, notably density. To overcome these limitations, this paper proposes the frst Adaptive Compressed Domain PC Classifer solution which includes a novel adaptive bridge model that allows to process the JPEG Pleno PCC encoded bit streams using diferent coding confgurations, now maximizing the compression efciency. Experimental results show that the novel Adaptive Compressed Domain PC Classifer allows JPEG PCC to achieve better compression performance by not imposing a single, specifc coding confguration for all PCs, regardless of its diferent characteristics. Moreover, the added adaptability power can achieve slightly better PC classifcation performance than the previous Compressed Domain PC Classifer and largely better PC classifcation performance (and lower number of weights) than the PointGrid PC classifer working in the decompressed domain.	pt_PT
dc.description.version	info:eu-repo/semantics/publishedVersion	pt_PT
dc.identifier.citation	Seleem, A., Guarda, A.F.R., Rodrigues, N.M.M. et al. Adaptive bridge model for compressed domain point cloud classification. J Image Video Proc. 2024, 13 (2024). https://doi.org/10.1186/s13640-024-00631-6	pt_PT
dc.identifier.doi	https://doi.org/10.1186/s13640-024-00631-6	pt_PT
dc.identifier.eissn	1687-5281
dc.identifier.slug	cv-prod-4248122
dc.identifier.uri	http://hdl.handle.net/10400.8/10358
dc.language.iso	eng	pt_PT
dc.peerreviewed	yes	pt_PT
dc.publisher	SpringerOpen	pt_PT
dc.relation	Deep learning-based Point Cloud Representation
dc.relation.publisherversion	https://jivp-eurasipjournals.springeropen.com/articles/10.1186/s13640-024-00631-6#citeas	pt_PT
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	pt_PT
dc.subject	Point cloud	pt_PT
dc.subject	Classifcation	pt_PT
dc.subject	Coding	pt_PT
dc.subject	Compressed domain	pt_PT
dc.subject	Deep learning	pt_PT
dc.title	Adaptive bridge model for compressed domain point cloud classification	pt_PT
dc.type	journal article
dspace.entity.type	Publication
oaire.awardTitle	Deep learning-based Point Cloud Representation
oaire.awardURI	info:eu-repo/grantAgreement/FCT/3599-PPCDT/PTDC%2FEEI-COM%2F1125%2F2021/PT
oaire.citation.endPage	27	pt_PT
oaire.citation.issue	13	pt_PT
oaire.citation.startPage	1	pt_PT
oaire.citation.title	EURASIP Journal on Image and Video Processing	pt_PT
oaire.citation.volume	2024	pt_PT
oaire.fundingStream	3599-PPCDT
person.familyName	M. M. Rodrigues
person.givenName	Nuno
person.identifier.orcid	0000-0001-9536-1017
person.identifier.scopus-author-id	7006052345
project.funder.identifier	http://doi.org/10.13039/501100001871
project.funder.name	Fundação para a Ciência e a Tecnologia
rcaap.cv.cienciaid	6917-B121-4E34 \| NUNO MIGUEL MORAIS RODRIGUES
rcaap.rights	openAccess	pt_PT
rcaap.type	article	pt_PT
relation.isAuthorOfPublication	b4ebe652-7f0e-4e67-adb0-d5ea29fc9e69
relation.isAuthorOfPublication.latestForDiscovery	b4ebe652-7f0e-4e67-adb0-d5ea29fc9e69
relation.isProjectOfPublication	a018adbc-131c-448f-acdc-fad90c525470
relation.isProjectOfPublication.latestForDiscovery	a018adbc-131c-448f-acdc-fad90c525470