Compression of Multimodal Biomedical Images using Neural Networks

Financiador

Organização

Publicações

LOSSY COMPRESSION OF BIOMEDICAL IMAGES FOR COMPUTER VISION ANALYSIS

Publication . Paulo, Edgar da Silva; Faria, Sérgio Manuel Maciel de; Távora, Luís Miguel de Oliveira Pegado de Noronha e; Thomaz, Lucas Arrabal

The exponential increase in medical and biomedical data acquisition is compelled by technological advances, namely in the imaging field. However, this exponential growth brings with it challenges in terms of processing capacity, transmission, and data storage. In response to this growing demand, increasingly efficient solutions have emerged, especially through computer vision for automatic image analysis and compression algorithms. This dissertation aims, on the one hand, to evaluate the performance of computer vision systems on previously compressed biomedical images. On the other hand, it increases the useful range of image variations, almost lossless and lossy, decreasing the impact of the change added by this method on the performance of computer vision algorithms in biomedical image analysis. In this sense, YOLO and Detectron2 are employed to evaluate the impact of coding distortion on their ability to detect mitochondria in electron microscopy images. The results of this study reveal that although the distortion introduced by compression affects their detection performance, it is negligible at lower compression ratios. Furthermore, two proposals are presented to improve the useful compression ratio, keeping the images characteristics that allow to perform the automatic detection of mitochondria. On the one hand, it is demonstrated that the proposed training methodology, which incorporates compressed versions of the original data during training, mitigates the impact of distortion on the performance of computer vision algorithms; on the other hand, allocating higher quality levels to regions of interest, compared to background elements, helps to sustain high performance at compression rates where computer vision algorithms typically start to lose effectiveness. These approaches allow the extension of the compression range with little impact on detection performance, thus contributing to the improvement of data processing, storage, and transmission in biomedical applications.

2024-06-14Dissertação de mestrado

Acesso aberto

Ver mais

PROCESSING OF MICROSCOPY IMAGES IN THE COMPRESSED DOMAIN

Publication . Vasconcellos, Nicolas David Freire; Thomaz, Lucas Arrabal; Távora, Luís Miguel de Oliveira Pegado de Noronha e; Grilo, Carlos Fernando de Almeida; Miragaia, Rolando Lúcio Germano

This work presents a novel approach to the detection of mitochondria in microscopy images within the compressed domain, eliminating the need for full image decompression while maintaining high detection accuracy. The proposed architecture, Processing of Microscopy Images in the Compressed Domain (ProMIC), integrates a Domain Translator architecture, enabling object detection networks to operate directly on images’ latent representations. This approach significantly reduces computational costs while preserving essential visual information for detection tasks. The study also included a preliminary analysis of JPEG-AI compression efficiency against classical codecs (HEVC/H.265 and VVC/H.266) across multiple datasets. The proposed method builds upon a Base model inspired by a previous compresseddomain object detection approach. Key innovations in this work include: the use of a Lightweight Residual Block that enhances feature extraction from latent representations, improving detection robustness across different compression rates; a Pixel Domain Fine-Tuning process that enhances the Domain Adaptation of the model; and a Guided Domain Translator Training strategy that refines the Domain Translator to extract meaningful information directly from latent codes, minimising the gap between compressed and pixel-domain object detection models. Extensive experiments were conducted to compare the performance of both the Base model and the proposed architecture against a reference object detection model that operates on reconstructed images. Results on the Lucchi++ dataset show that ProMIC significantly outperforms the Base model. Notably, the proposed architecture exceeds the reference model in mean average precision, achieving a 2.31 percentage point improvement at the highest bitrate. This highlights the potential of compressed-domain image processing to capture meaningful features that may be lost in fully reconstructed images. Additionally, applying the ProMIC method reduced the complexity of the object detection system by 42.34% compared to the reference model. These findings validate the feasibility of direct object detection in latent codes, offering a computationally efficient alternative to traditional pixel-domain approaches. The proposed framework has promising applications in real-time processing, embedded systems, and large-scale AI-driven image analysis, paving the way for more efficient and scalable vision models that operate directly in the compressed domain.

2025-03-19Dissertação de mestrado

Acesso aberto

Ver mais

Explainable prototype-based image classification using adaptive feature extractors in medical images

Publication . Vasconcellos, Nicolas; Tavora, Luis M. N.; Miragaia, Rolando; Grilo, Carlos; Thomaz, Lucas

Prototype-based classifiers are a category of Explainable Artificial Intelligence methods that use representative samples from the data, called prototypes, to classify new inputs based on a similarity criterion. However, these methods often rely on pre-trained Convolutional Neural Networks as feature extractors, which may not be adapted for the specific type of data being used, thus not suited for identifying the most representative prototypes. In this paper, we propose a method named Explainable Prototype-based Image Classification, a cluster-oriented training strategy that enhances the performance and explainability of prototype-based classifiers. Our method uses a novel loss function, called Cluster Density Error, to fine-tune the feature extractor and preserve the most representative feature vectors in the latent space. We also use Principal Component Analysis-based approach to reduce the dimensionality and complexity of the feature vectors. We conduct experiments on four medical image datasets and compare the results with those from different prototype-based classifiers and state-of-the-art non-explainable learning methods. The proposed method demonstrated superior explainable capabilities and comparable classification performance to the compared methods. Specifically, the proposed method achieved up to 95.01% accuracy and 0.992 AUC using only 43 prototypes. This translated to an improvement in accuracy and AUC score of 21.54% and 9.06%, respectively, and a substantial reduction in the number of prototypes by 98,38%

2025-12Artigo científico

Acesso aberto

Ver mais

Entidade financiadora

Fundação para a Ciência e a Tecnologia

Programa de financiamento

3599-PPCDT

Número da atribuição

2022.09914.PTDC