Repository logo
 
Loading...
Project Logo
Research Project

Compression of Multimodal Biomedical Images using Neural Networks

Authors

Publications

LOSSY COMPRESSION OF BIOMEDICAL IMAGES FOR COMPUTER VISION ANALYSIS
Publication . Paulo, Edgar da Silva; Faria, Sérgio Manuel Maciel de; Távora, Luís Miguel de Oliveira Pegado de Noronha e; Thomaz, Lucas Arrabal
The exponential increase in medical and biomedical data acquisition is compelled by technological advances, namely in the imaging field. However, this exponential growth brings with it challenges in terms of processing capacity, transmission, and data storage. In response to this growing demand, increasingly efficient solutions have emerged, especially through computer vision for automatic image analysis and compression algorithms. This dissertation aims, on the one hand, to evaluate the performance of computer vision systems on previously compressed biomedical images. On the other hand, it increases the useful range of image variations, almost lossless and lossy, decreasing the impact of the change added by this method on the performance of computer vision algorithms in biomedical image analysis. In this sense, YOLO and Detectron2 are employed to evaluate the impact of coding distortion on their ability to detect mitochondria in electron microscopy images. The results of this study reveal that although the distortion introduced by compression affects their detection performance, it is negligible at lower compression ratios. Furthermore, two proposals are presented to improve the useful compression ratio, keeping the images characteristics that allow to perform the automatic detection of mitochondria. On the one hand, it is demonstrated that the proposed training methodology, which incorporates compressed versions of the original data during training, mitigates the impact of distortion on the performance of computer vision algorithms; on the other hand, allocating higher quality levels to regions of interest, compared to background elements, helps to sustain high performance at compression rates where computer vision algorithms typically start to lose effectiveness. These approaches allow the extension of the compression range with little impact on detection performance, thus contributing to the improvement of data processing, storage, and transmission in biomedical applications.
PROCESSING OF MICROSCOPY IMAGES IN THE COMPRESSED DOMAIN
Publication . Vasconcellos, Nicolas David Freire; Thomaz, Lucas Arrabal; Távora, Luís Miguel de Oliveira Pegado de Noronha e; Grilo, Carlos Fernando de Almeida; Miragaia, Rolando Lúcio Germano
This work presents a novel approach to the detection of mitochondria in microscopy images within the compressed domain, eliminating the need for full image decompression while maintaining high detection accuracy. The proposed architecture, Processing of Microscopy Images in the Compressed Domain (ProMIC), integrates a Domain Translator architecture, enabling object detection networks to operate directly on images’ latent representations. This approach significantly reduces computational costs while preserving essential visual information for detection tasks. The study also included a preliminary analysis of JPEG-AI compression efficiency against classical codecs (HEVC/H.265 and VVC/H.266) across multiple datasets. The proposed method builds upon a Base model inspired by a previous compresseddomain object detection approach. Key innovations in this work include: the use of a Lightweight Residual Block that enhances feature extraction from latent representations, improving detection robustness across different compression rates; a Pixel Domain Fine-Tuning process that enhances the Domain Adaptation of the model; and a Guided Domain Translator Training strategy that refines the Domain Translator to extract meaningful information directly from latent codes, minimising the gap between compressed and pixel-domain object detection models. Extensive experiments were conducted to compare the performance of both the Base model and the proposed architecture against a reference object detection model that operates on reconstructed images. Results on the Lucchi++ dataset show that ProMIC significantly outperforms the Base model. Notably, the proposed architecture exceeds the reference model in mean average precision, achieving a 2.31 percentage point improvement at the highest bitrate. This highlights the potential of compressed-domain image processing to capture meaningful features that may be lost in fully reconstructed images. Additionally, applying the ProMIC method reduced the complexity of the object detection system by 42.34% compared to the reference model. These findings validate the feasibility of direct object detection in latent codes, offering a computationally efficient alternative to traditional pixel-domain approaches. The proposed framework has promising applications in real-time processing, embedded systems, and large-scale AI-driven image analysis, paving the way for more efficient and scalable vision models that operate directly in the compressed domain.

Organizational Units

Description

Keywords

Contributors

Funders

Funding agency

Fundação para a Ciência e a Tecnologia

Funding programme

3599-PPCDT

Funding Award Number

2022.09914.PTDC

ID