Repository logo
 
Publication

OPTIMIZING IMAGE-BASED TASKS IN MANUFACTURING WITH RGB-D FUSION

datacite.subject.fosEngenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informáticapt_PT
dc.contributor.advisorPereira, António Manuel de Jesus
dc.contributor.advisorRodrigues, Nuno Carlos Sousa
dc.contributor.advisorGrilo, Carlos Fernando de Almeida
dc.contributor.authorCarreira, Daniel Soares
dc.date.accessioned2024-12-04T14:50:32Z
dc.date.embargo2027-11-15
dc.date.issued2024-11-15
dc.description.abstractThe manufacturing industry is undergoing a significant transformation with the onset of the Fourth Industrial Revolution. A key aspect of this shift is the integration of advanced technologies, such as smart sensors and automation, into production processes. Within this context, 3D cameras have become invaluable, enabling manufacturers to capture precise surface measurements of the products. In response, the computer vision community has begun exploring new methods to combine depth information with color data, enhancing existing solutions for classification and design generation. By harnessing these advancements, manufacturers can streamline production lines, reduce waste, and elevate product quality. This dissertation presents two key innovations for classification and generation tasks: (1) a novel branched Convolutional Neural Network (CNN), which achieves stateof- the-art performance in RGB-Depth (RGB-D) image classification, and (2) a novel branched Generative Adversarial Network (GAN), inspired by the same branched architecture, that delivers state-of-the-art results on the Stanford Cars dataset. The core idea of this branched approach is to specialize each branch to handle a specific modality. In the experiments, the classification performance improved by approximately 1%, while achieving nearly three times the speed of the next best method. For image generation, results varied depending on the dataset. On the Stanford Cars benchmark, the model showed slight improvements in image quality and better diversity.pt_PT
dc.identifier.tid203745973pt_PT
dc.identifier.urihttp://hdl.handle.net/10400.8/10281
dc.language.isoengpt_PT
dc.subjectBranched CNNpt_PT
dc.subjectBranched GANpt_PT
dc.subjectImage Classificationpt_PT
dc.subjectImage Generationpt_PT
dc.subjectImage Manipulationpt_PT
dc.subjectRGB-D Fusionpt_PT
dc.titleOPTIMIZING IMAGE-BASED TASKS IN MANUFACTURING WITH RGB-D FUSIONpt_PT
dc.typemaster thesis
dspace.entity.typePublication
rcaap.rightsrestrictedAccesspt_PT
rcaap.typemasterThesispt_PT
thesis.degree.nameMestrado em Engenharia Informática - Computação Móvelpt_PT

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
MasterDissertation_signed.pdf
Size:
40.57 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.32 KB
Format:
Item-specific license agreed upon to submission
Description: