Scalable Graph-Guided Transformer for Point Cloud Geometry Coding

Ghafari, Mohammadreza; Guarda, André F. R.; Rodrigues, Nuno M. M.; Pereira, Fernando

Publicação

Scalable Graph-Guided Transformer for Point Cloud Geometry Coding

2025Artigo de investigação

datacite.subject.fos	Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática
datacite.subject.sdg	09:Indústria, Inovação e Infraestruturas
dc.contributor.author	Ghafari, Mohammadreza
dc.contributor.author	Guarda, André F. R.
dc.contributor.author	Rodrigues, Nuno M. M.
dc.contributor.author	Pereira, Fernando
dc.date.accessioned	2025-12-12T16:07:45Z
dc.date.available	2025-12-12T16:07:45Z
dc.date.issued	2025	en_US
dc.date.updated	2025-12-12T13:24:48Z
dc.description
dc.description.abstract	Attention models, particularly Transformers, have significantly advanced deep learning in fields like natural language processing and computer vision by capturing contextual relationships in both sequential and spatial data. This ability is valuable for Point Clouds (PC), which are unstructured sets of points in 3D space. Transformers can effectively identify correlations between distant points, allowing them to focus on the most critical regions of the data. To demonstrate this capability, this paper proposes a novel, scalable Graph-Guided Transformer model, labeled 2GFormer, for static PC geometry. This model is built using a scalable architecture that leverages Graph Convolutions to enhance a Relational Neighborhood SelfAttention (RNSA) base layer model. Both models are integrated into the JPEG Pleno Learning-based Point Cloud Coding (JPEG PCC) standard, resulting in the creation of two attention-enabled codecs for static PC coding: JPEG RNSA and JPEG 2GFormer. While JPEG RNSA codec delivers significant compression improvements for solid and dense PCs compared to the baseline JPEG PCC standard, JPEG 2GFormer extends these gains to solid, dense, and sparse PCs with only a marginal increase in model parameters. Additionally, JPEG 2GFormer outperforms both conventional and learning-based state-of-the-art PC codecs. These results position JPEG 2GFormer as a highly efficient solution for versatile PC coding.	eng
dc.description.sponsorship	This work was funded by the Fundação para a Ciência e a Tecnologia (FCT, Portugal) through the research project PTDC/EEI-COM/1125/2021, entitled “Deep Learning-based Point Cloud Representation.”
dc.description.version	N/A
dc.identifier.citation	M. Ghafari, A. F. R. Guarda, N. M. M. Rodrigues and F. Pereira, "Scalable Graph-Guided Transformer for Point Cloud Geometry Coding," in IEEE Transactions on Multimedia, doi: 10.1109/TMM.2025.3598605.
dc.identifier.doi	10.1109/tmm.2025.3598605	en_US
dc.identifier.issn	1520-9210	en_US
dc.identifier.issn	1941-0077	en_US
dc.identifier.slug	cv-prod-4622938
dc.identifier.uri	http://hdl.handle.net/10400.8/15021
dc.language.iso	eng
dc.peerreviewed	yes
dc.publisher	IEEE
dc.relation	Deep learning-based Point Cloud Representation
dc.relation.hasversion	https://ieeexplore.ieee.org/document/11123804
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject	Graph Convolutions
dc.subject	JPEG Pleno
dc.subject	Point Cloud Coding
dc.subject	Scalable Transformer
dc.subject	Self-Attention
dc.title	Scalable Graph-Guided Transformer for Point Cloud Geometry Coding	eng
dc.type	research article	en_US
dspace.entity.type	Publication
oaire.awardNumber	PTDC/EEI-COM/1125/2021
oaire.awardTitle	Deep learning-based Point Cloud Representation
oaire.awardURI	info:eu-repo/grantAgreement/FCT/3599-PPCDT/PTDC%2FEEI-COM%2F1125%2F2021/PT
oaire.citation.endPage	14
oaire.citation.startPage	1
oaire.citation.title	IEEE Transactions on Multimedia	en_US
oaire.fundingStream	3599-PPCDT
oaire.version	http://purl.org/coar/version/c_ab4af688f83e57aa
person.familyName	Guarda
person.familyName	M. M. Rodrigues
person.givenName	André
person.givenName	Nuno
person.identifier.ciencia-id	F811-146F-4EE9
person.identifier.ciencia-id	6917-B121-4E34
person.identifier.orcid	0000-0001-5996-1074
person.identifier.orcid	0000-0001-9536-1017
person.identifier.scopus-author-id	7006052345
project.funder.identifier	http://doi.org/10.13039/501100001871
project.funder.name	Fundação para a Ciência e a Tecnologia
rcaap.cv.cienciaid	6917-B121-4E34 \| NUNO MIGUEL MORAIS RODRIGUES
rcaap.rights	closedAccess	en_US
relation.isAuthorOfPublication	ab4d7e6e-b391-49ba-a618-a52fc62c8837
relation.isAuthorOfPublication	b4ebe652-7f0e-4e67-adb0-d5ea29fc9e69
relation.isAuthorOfPublication.latestForDiscovery	b4ebe652-7f0e-4e67-adb0-d5ea29fc9e69
relation.isProjectOfPublication	a018adbc-131c-448f-acdc-fad90c525470
relation.isProjectOfPublication.latestForDiscovery	a018adbc-131c-448f-acdc-fad90c525470

Ficheiros

Principais

A mostrar 1 - 2 de 2

Nome:: FINAL_VERSION.pdf
Tamanho:: 1.19 MB
Formato:: Adobe Portable Document Format

Ver/Abrir

Nome:: ScalableGraph-GuidedTransformerforPointCloud_Accepted_version.pdf
Tamanho:: 3.31 MB
Formato:: Adobe Portable Document Format
Descrição:: This article has been accepted for publication in IEEE Transactions on Multimedia. This is the author's version which has not been fully edited and content may change prior to final publication.

Ver/Abrir

Licença

A mostrar 1 - 1 de 1

Nome:: license.txt
Tamanho:: 1.33 KB
Formato:: Item-specific license agreed upon to submission
Descrição:

Ver/Abrir

Coleções

ESTG - Artigos em revistas internacionais