ALGORITHMS FOR BUBBLE SEGMENTATION IN LIQUID-GAS MIXTURES

Pinheiro, Rafael FernandesPinto, Rui FonsecaOliveira, João Rafael Freitas2025-12-102025-12-102025-11-03http://hdl.handle.net/10400.8/14967The accurate, automated analysis of liquid-gas mixtures is essential for advancing numerous scientific and industrial fields, with the precise segmentation of individual bubbles being a critical step. This thesis addresses the challenge of bubble instance segmentation using deep learning. A significant practical constraint was the availability of a large real-world dataset that lacked pixel-level annotations, making traditional supervised training infeasible. To overcome this, a methodology based on training models with synthetically generated data was developed. A comparative study of various Machine Learning architectures was conducted to identify an optimal approach for this task. The results demonstrate that a Mask R-CNN model with a ResNet-50 backbone provides the most effective performance, establishing a strong baseline for the segmentation task. The primary limitation observed was the performance degradation when applying the synthetically-trained model to real-world images, a result of the domain gap between the two datasets. This effect was particularly evident in images with dense or overlapping bubble formations. In summary, this research successfully validates the viability of leveraging synthetic data to train deep learning models for bubble segmentation. The work provides a foundational framework, establishes a robust performance benchmark, and identifies the domain gap as the central challenge for achieving high-fidelity results on realworld imagery.A análise precisa e automatizada de misturas líquido-gás é essencial para o avanço de inúmeros campos científicos e industriais, sendo a segmentação exata de bolhas individuais um passo crítico. Esta tese aborda o desafio da segmentação de instâncias de bolhas utilizando deep learning. Uma restrição prática significativa foi a disponibilidade de um grande conjunto de dados do mundo real que carecia de anotações, tornando o treino supervisionado tradicional inviável. Para superar esta limitação, foi desenvolvida uma metodologia baseada no treino de modelos com dados gerados sinteticamente. Foi realizado um estudo comparativo de várias arquiteturas de Machine Learning para identificar a abordagem ideal para esta tarefa. Os resultados demonstram que um modelo Mask R-CNN com uma backbone ResNet-50 proporciona o desempenho mais eficaz, estabelecendo uma base de referência sólida para a tarefa de segmentação. A principal limitação observada foi a degradação do desempenho ao aplicar o modelo treinado com dados sintéticos a imagens do mundo real, resultado dum domain-gap entre os dois conjuntos de dados. Este efeito foi particularmente evidente em imagens com formações de bolhas densas ou sobrepostas. Em suma, esta investigação valida com sucesso a viabilidade de utilizar dados sintéticos para treinar modelos de deep learning para a segmentação de bolhas. O trabalho fornece uma estrutura fundamental, estabelece um padrão de desempenho robusto e identifica o domain-gap como o desafio central para alcançar resultados de alta fidelidade em imagens do mundo real.engBubble segmentationInstance segmentationDeep learningSynthetic dataDomain adaptationLiquid-Gas mixturesALGORITHMS FOR BUBBLE SEGMENTATION IN LIQUID-GAS MIXTURESmaster thesis204075718