Percorrer por autor "Faria, Sérgio M. M. de"
A mostrar 1 - 10 de 12
Resultados por página
Opções de ordenação
- Efficient depth map coding using linear residue approximation and a flexible prediction frameworkPublication . Lucas, Luis F. R.; Rodrigues, Nuno M. M.; Pagliari, Carla L.; Silva, Eduardo A. B. da; Faria, Sérgio M. M. deThe importance to develop more efficient 3D and multiview data representation algorithms results from the recent market growth for 3D video equipments and associated services. One of the most investigated formats is video+depth which uses depth image based rendering (DIBR) to combine the information of texture and depth, in order to create an arbitrary number of views in the decoder. Such approach requires that depth information must be accurately encoded. However, methods usually employed to encode texture do not seem to be suitable for depth map coding. In this paper we propose a novel depth map coding algorithm based on the assumption that depth images are piecewise-linear smooth signals. This algorithm is designed to encode sharp edges using a flexible dyadic block segmentation and hierarchical intra-prediction framework. The residual signal from this operation is aggregated into blocks which are approximated using linear modeling functions. Furthermore, the proposed algorithm uses a dictionary that increases the coding efficiency for previously used approximations. Experimental results for depth map coding show that synthesized views using the depth maps encoded by the proposed algorithm present higher PSNR than their counterparts, demonstrating the method’s efficiency.
- Efficient Recurrent Pattern Matching Video CodingPublication . Francisco, Nelson C.; Rodrigues, Nuno M. M.; B. da Silva, Eduardo A.; Bresciani de Carvalho, Murilo; Faria, Sérgio M. M. deIn this paper, we propose a pattern-matchingbased algorithm for video compression. This algorithm, named multidimensional multiscale parser (MMP)-Video, is based on the H.264/AVC video encoder, but uses a pattern-matching paradigm instead of the state-of-the-art transform-quantizationentropy encoding approach. The proposed method adopts the use of multiscale recurrent patterns to compress both spatial and temporal prediction residues, totally replacing the use of transforms and quantization. Experimental results show that the coding performance of MMP-Video is better than the one of H.264/AVC high profile, especially for medium to high bit-rates. The gains range up to 0.7 dB, showing that, in spite of its larger computational complexity, the use of multiscale recurrent pattern matching paradigm deserves being investigated as an alternative for video compression.
- Evaluation of Focus Metrics in Extended Depth-of-field ReconstructionPublication . Filipe, Jose N.; Távora, Luis M.N.; Assunção, Pedro A. A.; Fonseca-Pinto, Rui; Faria, Sérgio M. M. deThe performance of focus metrics in the evaluation of refocused images from light fields is investigated in this paper. To this aim, the paper presents a comprehensive study on the performance of a large set of different focus metrics (34 in total) in the evaluation of patch-based extended depth-of-field reconstructed images (e.g., all-in-focus). The new findings of this work demonstrate that optimal reconstruction of extended depth-of-field images, from light fields captured with focused plenoptic cameras, is not consistent with the focus level figures computed by available focus metrics. The results show that a higher focus level, as given by such computational methods, does not actually correspond to better all-in-focus images, thus indicating that currently available focus metrics are not suitable for evaluating the quality of extended depth-of-field reconstruction. The results obtained through subjective evaluation confirm that objective focus metrics fail to indicate the best focused images. Finally, the paper presents recommendations to adapt existing metrics to extended depth-of-field reconstruction in plenoptic imaging.
- Improving multiscale recurrent pattern image coding with least-squares prediction modePublication . Graziosi, Danillo B.; Rodrigues, Nuno M. M.; Silva, Eduardo A. B. da; Faria, Sérgio M. M. de; Carvalho, Murilo B. de; Faria, Sergio; M. M. Rodrigues, Nuno;The Multidimensional Multiscale Parser-based (MMP) image coding algorithm, when combined with flexible partitioning and predictive coding techniques (MMP-FP), provides state-of-the-art performance. In this paper we investigate the use of adaptive least-squares prediction in MMP. The linear prediction coefficients implicitly embed the local texture characteristics, and are computed based on a block’s causal neighborhood (composed of already reconstructed data). Thus, the intra prediction mode is adaptively adjusted according to the local context and no extra overhead is needed for signaling the coefficients. We add this new context-adaptive linear prediction mode to the other MMP prediction modes, that are based on the ones used in H.264/AVC; the best mode is chosen through rate-distortion optimization. Simulation results show that least-squares prediction is able to significantly increase MMP-FPs rate-distortion performance for smooth images, leading to better results than the ones of state-of-theart, transform-based methods. Yet with the addition of least-squares prediction MMP-FP presents no performance loss when used for encoding non-smooth images, such as text and graphics.
- Integer DCT Approximation With Arbitrary Size and Adjustable PrecisionPublication . Thomaz, Lucas A.; Assunção, Pedro A. A.; Tavora, Luis M. N.; Faria, Sérgio M. M. deThis letter proposes a method to obtain integer reversible discrete cosine transforms for generic transform-based coding schemes. The novelty of the proposed method, which is based on decomposition of the DCT-II matrix into two triangular and one diagonal matrices, is twofold: (i) the new matrices can be of arbitrary size, i.e., any square N\times N dimension, thus suitable for applications where non power-of-2 dimensions are required; (ii) they can be designed with adjustable precision in a trade-off with the number of representation bits. Furthermore, improvements are also proposed over the base scheme to avoid numerical issues when working with large matrices and to obtain more reliable approximations. The performance evaluation demonstrate the effectiveness of the proposed transforms to approximate the coding gain capabilities of the original DCT-II.
- Intra-prediction for color image coding using YUV correlationPublication . Lucas, Luís F.R.; Rodrigues, Nuno M. M.; Faria, Sérgio M. M. de; Silva, Eduardo A. B. da; Carvalho, Murilo B. de; Silva, Vitor M. M. daIn this paper we present a new algorithm for chroma prediction in YUV images, based on inter component correlation. Despite the YUV color space transformation for inter component decorrelation, some dependency still exists between the Y, U and V chroma components. This dependency has been previously used to predict the chrominance data from the reconstructed luminance. In this paper we show that a chrominance component can be more efficiently predicted by using the reconstructed data from both the luminance and the remaining chrominance signal. The proposed chroma prediction is implemented and tested using the Multidimensional Multiscale Parser (MMP) image encoding algorithm. It is shown that the new color prediction mode outperforms the originally proposed prediction methods. Furthermore, by using the new color prediction scheme, MMP is consistently better than the state-of-the-art H.264/AVC for coding both for the luminance and the chrominance image components.
- Light field image Coding using high-order intrablock predictionPublication . Monteiro, Ricardo J. S.; Nunes, Paulo J. L.; Rodrigues, Nuno M. M.; Faria, Sérgio M. M. deThis paper proposes a two-stage high-order intrablock prediction method for light field image coding. This method exploits the spatial redundancy in lenslet light field images by predicting each image block, through a geometric transformation applied to a region of the causal encoded area. Light field images comprise an array of microimages that are related by complex geometric transformations that cannot be efficiently compensated by state-of-the-art image coding techniques, which are usually based on low-order translational prediction models. The two-stage nature of the proposed method allows us to choose the order of the prediction model most suitable for each block, ranging from pure translations to projective or bilinear transformations, optimized according to an appropriate rate-distortion criterion. The proposed higher order intrablock prediction approach was integrated into a high efficiency video coding (HEVC) codec and evaluated for both unfocused and focused light field camera models, using different resolutions and microlens arrays. Experimental results show consistent bitrate savings, which can go up to 12.62%, when compared to a lower order intrablock prediction solution and 49.82% when compared to HEVC still picture coding.
- A method to improve HEVC lossless coding of volumetric medical imagesPublication . F.R. Guarda, André F. R.; Santos, João M.; Cruz, Luís A. da Silva; Assunção, Pedro A. A.; Rodrigues, Nuno M. M.; Faria, Sérgio M. M. deMedical imaging technology and applications are continuously evolving, dealing with images of increasing spatial and temporal resolutions, which enable more accurate processing, feature analysis and medical diagnosis. However, the increase in resolution also requires a growing amount of data to be stored, processed and exchanged or transmitted through networks. Despite the high coding efficiency achieved by the most recent image and video coding standards in lossy compression, they are not well suited for quality-critical medical image compression where either near-lossless or lossless coding is required. In this paper we propose a method to improve lossless coding of volumetric medical images, such as Magnetic Resonance and Computed Tomography, and medical sequences such as X-ray Angiography images, using the latest standard High Efficiency Video Encoder (HEVC). New pixel-wise prediction techniques are proposed to extend the current HEVC lossless tools, based on Least-Squares Prediction (LSP). Experimental results show a bitrate reduction of over 44%, when compared to DICOM recommended encoders, and 13.8% when compared to standard lossless HEVC, for 8 bpp volumetric images, and over 8% and 4.6%, respectively, for volumetric images using more than 8 bpp.
- Multiscale recurrent pattern matching approach for depth map codingPublication . Graziosi, Danillo B.; Rodrigues, Nuno M. M.; Pagliari, Carla L.; Silva, Eduardo A. B. da; Faria, Sérgio M. M. de; Perez, Marcelo M.; Carvalho, Murilo B. deIn this article we propose to compress depth maps using a coding scheme based on multiscale recurrent pattern matching and evaluate its impact on depth image based rendering (DIBR). Depth maps are usually converted into gray scale images and compressed like a conventional luminance signal. However, using traditional transform-based encoders to compress depth maps may result in undesired artifacts at sharp edges due to the quantization of high frequency coefficients. The Multidimensional Multiscale Parser (MMP) is a pattern matching-based encoder, that is able to preserve and efficiently encode high frequency patterns, such as edge information. This ability is critical for encoding depth map images. Experimental results for encoding depth maps show that MMP is much more efficient in a rate-distortion sense than standard image compression techniques such as JPEG2000 or H.264/AVC. In addition, the depth maps compressed with MMP generate reconstructed views with a higher quality than all other tested compression algorithms.
- Recurrent pattern matching based stereo image coding using linear predictorsPublication . Lucas, Luís F. R.; M. M. Rodrigues, Nuno; Pagliari, Carla L.; Silva, Eduardo A. B. da; Faria, Sérgio M. M. deThe Multidimensional Multiscale Parser (MMP) is a pattern-matching-based generic image encoding solution which has been investigated earlier for the compression of stereo images with successful results. While first MMP-based proposals for stereo image coding employed dictionary-based techniques for disparity compensation, posterior developments have demonstrated the advantage of using predictive methods. In this paper, we focus on recent investigations on the use of predictive methods in the MMP algorithm and propose a new prediction framework for efficient stereo image coding. This framework comprises an advanced intra directional prediction model and a new linear predictive scheme for efficient disparity compensation. The linear prediction model is the main novelty of this work, combining adaptive linear models estimated by least-squares algorithm with fixed linear models provided by the block-matching algorithm. The performance of the proposed intra prediction and disparity compensation methods when applied in an MMP encoder has been evaluated experimentally. Comparisons with the current stereo image coding standards showed that the proposed MMP algorithm significantly outperforms the Stereo High Profile of H.264/AVC standard. In addition, it presents a competitive performance relative to the MV-HEVC standard. These results also suggest that current stereo image coding standards may benefit from the proposed linear prediction scheme for disparity compensation, as an extension to the omnipresent block-matching solution.
