MCL Research on Learning-based Image Coding

By Zhiruo Zhou | February 20, 2022 | News

Traditional image coding has achieved great success within four decades. Image coding standards have been developed and widely used today such as JPEG and JPEG-2000. Furthermore, intra coding schemes of modern video coding standards also provide very effective image coding solutions. Several powerful tools have been used to de-correlate the pixel values:
1. Block transform coding, which is used in the majority of the codecs where images are partitioned into blocks of different sizes and pixel values in blocks are transformed from the spatial domain to the spectrum domain for energy compaction before quantization and entropy coding.
2. Intra prediction, as another powerful tool that reduces the pixel correlation using pixel values from neighboring blocks at a low cost. Residuals after intra prediction are still coded by block transform coding.

Recently, deep-learning-based compression methods have attracted a lot of attention due to their superior rate-distortion performance. Compared with the traditional codecs, learned based codec has the following characteristic:
1. Inter correlations: Traditional image codecs only explore correlation in the same image while learning-based image codecs can exploit correlation from other images (i.e., inter-image correlation).
2. Multi-scale representation: Traditional image codecs only capture the representation with variable block size while learning-based image codecs can exploit the multi-scale representation based on pooling. In other words, traditional image codecs primarily explore correlation at the block level while learning-based image codecs can exploit short, middle, and long-range correlations using the multi-scale representation.
3. Advanced loss functions: different loss functions can be easily designed in learning-based schemes to fit the human visual system (HVS) and attention can be introduced to the learning-based schemes conveniently.

To achieve low-complexity learning-based image coding, we propose a multi-grid multi-block-size vector quantization (MGBVQ) method based on these characteristics.
1. Input images are decomposed into different representations of variable resolutions through Lanczos interpolation. We can get a set of downsampled images and their corresponding downsample residuals with regard to its neighbor representations.
2. With this different representation available, we can easily capture the correlations using VQ. We capture the long-range correlation in small representations. And short-range correlation in large representations.
3. Components like adaptive codebook selection are used to provide better rate-distortion gain. For example, it can use a small number of codewords for a smooth/simple image while using many codewords for a complex image.
4. Currently, we are working on the context-guided sub-codebook selection which utilizes the pre-decoded representation to find the suitable sub-codebook design.

— By Yifan Wang

About the Author: Zhiruo Zhou

Thesis Title: Green Unsupervised Single Object Tracking: Technologies and Performance Evaluation, September 2023. Employment: Apple, Inc., San Diego, California, USA The 170th PhD from MCL

MCL Research on Learning-based Image Coding

Share This Story, Choose Your Platform!

About the Author: Zhiruo Zhou

You May Also Like

MCL Research on Motion YOLO

MCL Research on Multi-Stage XGBoost

Welcome New MCL Member James Zhan

Welcome New MCL Member Alek Yegazarian

Welcome New MCL Member Jimmy Xiao

Welcome New MCL Member Kevin Lim

Welcome New MCL Member Qi Cao

MCL Research on Image Classification

Congratulations to Wei Wang for Passing Her Defense!

Congratulations to Qingyang Zhou for Passing His Defense!

MCL Research on Prostate Segmentation

MCL Research on Green Image Super-resolution

MCL Research on Nuclei Segmentation

MCL Research on Seismic Data Processing

MCL Research on Image Denoising

MCL Research on Video-Text Retrieval

MCL Research on Enhanced Object Detection

MCL Research on Video Camouflaged Object Detection (VCOD)

MCL Research on Image Dehazing

Reunion of MCL Alumni at Southern California

MCL Research on Image Demosaicing

MCL Research on Transfer Learning

MCL Research on EDA

Professor Kuo Gave a Keynote at AIxMM 2025

Welcome New MCL Member Cynthia Huang

Congratulations to Ganning Zhao for Passing Her Defense!

Welcome to the Spring 2025 semester!

Happy New Year!

Merry Christmas!

MCL Research on Green Learning for Medical Imaging

Research on Green Image Segmentation

MCL’s Thanksgiving Luncheon

MCL Research on Radar Signal Processing: Jamming signal detection

Congratulations to Professor Kuo for Receiving NTU Distinguished Alumni Award

MCL Research on Feedforward Visual Attention

MCL Research on Word Embedding Dimension Reduction

Welcome New MCL Member Hong-En Chen

Welcome New MCL Member Laurence Palmer

Professor C.-C. Jay Kuo Named Inaugural Ming Hsieh Chair Holder

Welcome New MCL Member Alexander Jou

Welcome New MCL Member Qixin Hu

Welcome New MCL Member Youngrae Kim

Congratulations to Vasileios Magoulianitis for Passing His Defense

Congratulations to Zhanxuan Mei for Passing His Defense

MCL Research on Supervised Feature Learning

MCL Research on Green Saliency-guided Blind Image Quality Assessment (GSBIQA)

MCL Research on Green Raw Image Demosaicking

MCL Research on Green Saliency-guided Blind Image Quality Assessment (GSBIQA)

MCL Research on Prostate Lesion Detection from MRI Images

MCL Research on Green Image Super-resolution

Professor Kuo Attended ICME in Niagara Falls, Canada

MCL Research on 3D Perception with Large Foundational Models

MCL Research on Prostate MRI Image Segmentation

Professor Kuo Met MCL Alumni in Thailand

Professor Kuo visited Singapore

Professor Kuo met MCL Alumni in Taiwan

MCL Research on Nuclei Segmentation for Histological Images

MCL Research on Seismic Data Processing

MCL Research on Point Cloud Surface Reconstruction

Congratulations to Chengwei Wei for Passing His Defense

Congratulations on MCL Members Attending Ph.D. Hooding Ceremony

Welcome New MCL Member Dingyi Nie

MCL Research on Parsing Tree Construction

MCL Research on Video Camouflaged Object

MCL Research on Green Learning for Electronic Design Automation (EDA)

MCL Research on 3D Perception with Large Foundational Models

MCL Research on Green Image Coding

MCL Research on Green Point Cloud Surface Reconstruction

MCL Research on POS Tagging Prediction

MCL Research on Saliency Detection Method

Welcome New MCL Member Xuechun Hua

MCL Research on Transfer Learning

MCL Research on Image Demosaicing

MCL Research on LQBoost Regressor

MCL Research on SLMBoost Classifier

Congratulations to Xuejing Lei for Passing Her Defense