Publication
OPTIMIZING IMAGE-BASED TASKS IN MANUFACTURING WITH RGB-D FUSION
datacite.subject.fos | Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática | pt_PT |
dc.contributor.advisor | Pereira, António Manuel de Jesus | |
dc.contributor.advisor | Rodrigues, Nuno Carlos Sousa | |
dc.contributor.advisor | Grilo, Carlos Fernando de Almeida | |
dc.contributor.author | Carreira, Daniel Soares | |
dc.date.accessioned | 2024-12-04T14:50:32Z | |
dc.date.embargo | 2027-11-15 | |
dc.date.issued | 2024-11-15 | |
dc.description.abstract | The manufacturing industry is undergoing a significant transformation with the onset of the Fourth Industrial Revolution. A key aspect of this shift is the integration of advanced technologies, such as smart sensors and automation, into production processes. Within this context, 3D cameras have become invaluable, enabling manufacturers to capture precise surface measurements of the products. In response, the computer vision community has begun exploring new methods to combine depth information with color data, enhancing existing solutions for classification and design generation. By harnessing these advancements, manufacturers can streamline production lines, reduce waste, and elevate product quality. This dissertation presents two key innovations for classification and generation tasks: (1) a novel branched Convolutional Neural Network (CNN), which achieves stateof- the-art performance in RGB-Depth (RGB-D) image classification, and (2) a novel branched Generative Adversarial Network (GAN), inspired by the same branched architecture, that delivers state-of-the-art results on the Stanford Cars dataset. The core idea of this branched approach is to specialize each branch to handle a specific modality. In the experiments, the classification performance improved by approximately 1%, while achieving nearly three times the speed of the next best method. For image generation, results varied depending on the dataset. On the Stanford Cars benchmark, the model showed slight improvements in image quality and better diversity. | pt_PT |
dc.identifier.tid | 203745973 | pt_PT |
dc.identifier.uri | http://hdl.handle.net/10400.8/10281 | |
dc.language.iso | eng | pt_PT |
dc.subject | Branched CNN | pt_PT |
dc.subject | Branched GAN | pt_PT |
dc.subject | Image Classification | pt_PT |
dc.subject | Image Generation | pt_PT |
dc.subject | Image Manipulation | pt_PT |
dc.subject | RGB-D Fusion | pt_PT |
dc.title | OPTIMIZING IMAGE-BASED TASKS IN MANUFACTURING WITH RGB-D FUSION | pt_PT |
dc.type | master thesis | |
dspace.entity.type | Publication | |
rcaap.rights | restrictedAccess | pt_PT |
rcaap.type | masterThesis | pt_PT |
thesis.degree.name | Mestrado em Engenharia Informática - Computação Móvel | pt_PT |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- MasterDissertation_signed.pdf
- Size:
- 40.57 MB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.32 KB
- Format:
- Item-specific license agreed upon to submission
- Description: