Publicação
Scalable Graph-Guided Transformer for Point Cloud Geometry Coding
| datacite.subject.fos | Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática | |
| datacite.subject.sdg | 09:Indústria, Inovação e Infraestruturas | |
| dc.contributor.author | Ghafari, Mohammadreza | |
| dc.contributor.author | Guarda, André F. R. | |
| dc.contributor.author | Rodrigues, Nuno M. M. | |
| dc.contributor.author | Pereira, Fernando | |
| dc.date.accessioned | 2025-12-12T16:07:45Z | |
| dc.date.available | 2025-12-12T16:07:45Z | |
| dc.date.issued | 2025 | en_US |
| dc.date.updated | 2025-12-12T13:24:48Z | |
| dc.description | ||
| dc.description.abstract | Attention models, particularly Transformers, have significantly advanced deep learning in fields like natural language processing and computer vision by capturing contextual relationships in both sequential and spatial data. This ability is valuable for Point Clouds (PC), which are unstructured sets of points in 3D space. Transformers can effectively identify correlations between distant points, allowing them to focus on the most critical regions of the data. To demonstrate this capability, this paper proposes a novel, scalable Graph-Guided Transformer model, labeled 2GFormer, for static PC geometry. This model is built using a scalable architecture that leverages Graph Convolutions to enhance a Relational Neighborhood SelfAttention (RNSA) base layer model. Both models are integrated into the JPEG Pleno Learning-based Point Cloud Coding (JPEG PCC) standard, resulting in the creation of two attention-enabled codecs for static PC coding: JPEG RNSA and JPEG 2GFormer. While JPEG RNSA codec delivers significant compression improvements for solid and dense PCs compared to the baseline JPEG PCC standard, JPEG 2GFormer extends these gains to solid, dense, and sparse PCs with only a marginal increase in model parameters. Additionally, JPEG 2GFormer outperforms both conventional and learning-based state-of-the-art PC codecs. These results position JPEG 2GFormer as a highly efficient solution for versatile PC coding. | eng |
| dc.description.sponsorship | This work was funded by the Fundação para a Ciência e a Tecnologia (FCT, Portugal) through the research project PTDC/EEI-COM/1125/2021, entitled “Deep Learning-based Point Cloud Representation.” | |
| dc.description.version | N/A | |
| dc.identifier.citation | M. Ghafari, A. F. R. Guarda, N. M. M. Rodrigues and F. Pereira, "Scalable Graph-Guided Transformer for Point Cloud Geometry Coding," in IEEE Transactions on Multimedia, doi: 10.1109/TMM.2025.3598605. | |
| dc.identifier.doi | 10.1109/tmm.2025.3598605 | en_US |
| dc.identifier.issn | 1520-9210 | en_US |
| dc.identifier.issn | 1941-0077 | en_US |
| dc.identifier.slug | cv-prod-4622938 | |
| dc.identifier.uri | http://hdl.handle.net/10400.8/15021 | |
| dc.language.iso | eng | |
| dc.peerreviewed | yes | |
| dc.publisher | IEEE | |
| dc.relation | Deep learning-based Point Cloud Representation | |
| dc.relation.hasversion | https://ieeexplore.ieee.org/document/11123804 | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Graph Convolutions | |
| dc.subject | JPEG Pleno | |
| dc.subject | Point Cloud Coding | |
| dc.subject | Scalable Transformer | |
| dc.subject | Self-Attention | |
| dc.title | Scalable Graph-Guided Transformer for Point Cloud Geometry Coding | eng |
| dc.type | research article | en_US |
| dspace.entity.type | Publication | |
| oaire.awardTitle | Deep learning-based Point Cloud Representation | |
| oaire.awardURI | info:eu-repo/grantAgreement/FCT/3599-PPCDT/PTDC%2FEEI-COM%2F1125%2F2021/PT | |
| oaire.citation.endPage | 14 | |
| oaire.citation.startPage | 1 | |
| oaire.citation.title | IEEE Transactions on Multimedia | en_US |
| oaire.fundingStream | 3599-PPCDT | |
| oaire.version | http://purl.org/coar/version/c_ab4af688f83e57aa | |
| person.familyName | Guarda | |
| person.familyName | M. M. Rodrigues | |
| person.givenName | André | |
| person.givenName | Nuno | |
| person.identifier.ciencia-id | F811-146F-4EE9 | |
| person.identifier.ciencia-id | 6917-B121-4E34 | |
| person.identifier.orcid | 0000-0001-5996-1074 | |
| person.identifier.orcid | 0000-0001-9536-1017 | |
| person.identifier.scopus-author-id | 7006052345 | |
| project.funder.identifier | http://doi.org/10.13039/501100001871 | |
| project.funder.name | Fundação para a Ciência e a Tecnologia | |
| rcaap.cv.cienciaid | 6917-B121-4E34 | NUNO MIGUEL MORAIS RODRIGUES | |
| rcaap.rights | closedAccess | en_US |
| relation.isAuthorOfPublication | ab4d7e6e-b391-49ba-a618-a52fc62c8837 | |
| relation.isAuthorOfPublication | b4ebe652-7f0e-4e67-adb0-d5ea29fc9e69 | |
| relation.isAuthorOfPublication.latestForDiscovery | b4ebe652-7f0e-4e67-adb0-d5ea29fc9e69 | |
| relation.isProjectOfPublication | a018adbc-131c-448f-acdc-fad90c525470 | |
| relation.isProjectOfPublication.latestForDiscovery | a018adbc-131c-448f-acdc-fad90c525470 |
Ficheiros
Principais
1 - 2 de 2
Miniatura indisponível
- Nome:
- FINAL_VERSION.pdf
- Tamanho:
- 1.19 MB
- Formato:
- Adobe Portable Document Format
A carregar...
- Nome:
- ScalableGraph-GuidedTransformerforPointCloud_Accepted_version.pdf
- Tamanho:
- 3.31 MB
- Formato:
- Adobe Portable Document Format
- Descrição:
- This article has been accepted for publication in IEEE Transactions on Multimedia. This is the author's version which has not been fully edited and content may change prior to final publication.
Licença
1 - 1 de 1
Miniatura indisponível
- Nome:
- license.txt
- Tamanho:
- 1.33 KB
- Formato:
- Item-specific license agreed upon to submission
- Descrição:
