News

Welcome New MCL Member Jocelin Su

We are very happy to welcome a new MCL member, Jocelin Su. Here is a quick interview with Jocelin:

1. Could you briefly introduce yourself and your research interests?My name is Jocelin Su, a USC ECE Master’s student. I earned my B.S. from National Cheng Kung University. Working as a firmware engineer afterward sparked my passion for hardware-software integration. My research interests include signal processing, efficient machine learning, and edge computing. Outside of academics, I enjoy hiking, reading, and solving puzzles.2. What is your impression of MCL and USC?USC offers exceptional research resources and a strong alumni network. I am deeply impressed by MCL’s pioneering work in Green Learning. Its energy efficiency is ideal for edge devices. Furthermore, because Green Learning is fully explainable and mathematically transparent, it is uniquely suited for error-sensitive applications where traditional deep learning fails.3. What is your future expectation and plan in MCL?  I aim to collaborate with and learn from MCL’s Ph.D. members to master Green Learning’s methodology. My goal is to build reliable frameworks under hardware constraints, becoming a mature researcher who can confidently design and generalize these structured approaches to diverse signal processing and control problems.

By |June 28th, 2026|News|Comments Off on Welcome New MCL Member Jocelin Su|

Welcome New MCL Member Yijin Chen

We are very happy to welcome a new MCL member, Yijin Chen. Here is a quick interview with Yijin:

1. Could you briefly introduce yourself and your research interests?

My name is Yijin Chen. I joined MCL this summer. My main research interest is image recognition based on Green Learning, building mathematically interpretable visual recognition algorithms that can surpass the performance of deep networks without relying on backpropagation. Before joining MCL, I mainly focused on applications of artificial intelligence, as well as some work related to hardware, such as embedded systems and FPGA.

2. What is your impression of MCL and USC?

My first impression of MCL is that it is a very principled lab. Its emphasis on mathematical clarity, reproducible hard metrics, and the requirement that every module can be explained from beginning to end is very different from the “tune until it works” atmosphere I had encountered before. The USC campus itself is also beautiful, and the school’s culture is very appealing. I feel extremely proud to become a member of MCL and of USC.

3. What is your future expectation and plan in MCL?

MCL is a lab that places great emphasis on mathematical thinking, and I hope I can strengthen my own mathematical thinking here. Deep learning and Green Learning are two very different things, and I hope I can approach research on Green Learning from a more rational, more mathematically grounded perspective. We are currently working on advancing Green Learning into its next stage. In the long run, I hope I can make contributions to this work, and grow into a researcher who can explain every design choice from first principles.

By |June 21st, 2026|News|Comments Off on Welcome New MCL Member Yijin Chen|

Jie-En Yao Presented His Paper at CVPR 2026

CVPR 2026 was held in Denver, Colorado, bringing together thousands of researchers, engineers, and industry professionals from around the world. CVPR featured a broad range of technical sessions, keynote talks, tutorials, workshops, demonstrations, and industry exhibitions. 

I had the opportunity to present our paper, HCL-FF: Hierarchical and Contrastive Learning for Forward-Forward Algorithm, in the poster session. The work introduces a novel framework that combines hierarchical representation learning with contrastive objectives to improve the Forward-Forward algorithm, an alternative learning paradigm that seeks to move beyond conventional backpropagation. Throughout the session, I had engaging discussions with researchers working on biologically inspired learning, efficient neural network training, and alternative optimization methods.

Beyond the technical presentations, CVPR offered numerous networking opportunities. The industry exhibition featured demonstrations from leading technology companies and AI startups, showcasing state-of-the-art developments in computer vision applications ranging from autonomous systems and robotics to generative media and multimodal AI. The conference environment encouraged interactions between academic researchers and industry practitioners, creating a vibrant atmosphere for exchanging ideas.

By |June 14th, 2026|News|Comments Off on Jie-En Yao Presented His Paper at CVPR 2026|

Congratulations to Xinyu Wang for Passing her Defense

Congratulations to Xinyu Wang for passing her defense! Xinyu’s thesis is titled “ Towards Efficient Visual Perception: From Feature Learning to Visual Reasoning.” Here is a brief summary of her thesis:

Visual perception serves as a fundamental component of modern computer vision, enabling the interpretation of large-scale image and video data. As visual data continues to grow in complexity, there is an increasing demand for efficient and scalable frameworks that bridge low-level representations and high-level understanding. This dissertation addresses this challenge by exploring a unified trajectory of efficient visual perception, evolving from feature learning to spatio-temporal modeling and ultimately to scene-conditioned visual reasoning. This dissertation first introduces a statistics-based feature generation framework for image classification, built upon the Least-squares Normal Transform, which reformulates classification as a regression problem for efficient feature learning. It generates discriminative and complementary features, boosting decision learning and training convergence with low computational overhead. It then investigates a particularly challenging visual task, video camouflaged object detection. The proposed GreenVCOD is a lightweight framework that captures temporal context through a Temporal Neighborhood prediction cube, enabling implicit motion modeling without additional computational cost. Building upon this, IDM-VCOD introduces a dual-motion design that combines implicit semantic refinement with explicit motion alignment, along with a selective activation mechanism to balance accuracy and efficiency. Finally, this dissertation shifts toward a reasoning-based paradigm for visual perception. Camouflaged object detection is reformulated as a scene-conditioned pattern-deviation reasoning problem. By leveraging background-aware retrieval and prototype-based reasoning, it identifies subtle deviations without relying on pixel-level supervision. Overall, the proposed methods demonstrate that lightweight design, combined with structured statistical modeling and scene-conditioned reasoning, can effectively address challenging visual perception tasks without relying on heavy supervision or large-scale model training.

By |June 7th, 2026|News|Comments Off on Congratulations to Xinyu Wang for Passing her Defense|

Congratulations to Aolin Feng for Passing his Defense

Congratulations to Aolin Feng for passing his defense! Aolin’s thesis is titled “Green Image Coding: Principle, Implementation, and Performance Evaluation.” Here is a brief summary of his thesis:

This research introduces Green Image Coding (GIC), a novel framework for lightweight, modular, and scalable image compression founded on two pillars: multi-grid representation and vector quantization (VQ). Multi-grid representation decomposes images into hierarchical layers to reduce intra-layer content diversity, while VQ-based techniques encode each layer.

For the multi-grid representation pillar, we propose a multi-grid rate control theory that reduces high-dimensional optimization to sequential parameter decisions. Driven by this theory, a slope-matching-based rate control strategy is designed to improve scalability and rate-distortion (RD) balance. For the VQ pillar, we develop a suite of advanced techniques for efficient, scalable encoding: an RD-oriented codebook construction method built on tree-structured VQ (TSVQ) for multi-rate coding; a cascade VQ strategy to prevent early convergence in high-dimensional VQ; and a quadtree (QT) structure that combines multi-dimensional VQs, optimized via an iterative method to overcome statistical issues during rate-distortion-optimization (RDO).

Architecturally, the thesis evaluates single-grid versus multi-grid paradigms and analyzes independent parallel VQ versus residual-based cascade VQ regarding train-test match and content adaptivity. To mitigate compression artifacts, we propose a multi-tiered enhancement pipeline that decouples block-level structural learning (using residual VQ) from pixel-level refinements (using CNN-based post-processing).

Experimental results demonstrate that GIC achieves competitive coding efficiency with low theoretical complexity. Its modular, interpretable design offers strong potential for future improvement, functioning effectively as an extension module for existing codecs or as a foundation for future video coding research.

By |May 31st, 2026|News|Comments Off on Congratulations to Aolin Feng for Passing his Defense|

Congratulations to Kevin Yang for Receiving His PhD Degree

We would like to congratulate Kevin Yang for receiving his Ph.D. degree during the Viterbi Hooding Ceremony held on May 13, 2026, at the Bovard Auditorium. Here is a brief sharing of his Ph.D. experience at MCL:

My Ph.D. journey at the University of Southern California has been both challenging and rewarding. My dissertation, titled “Interpretable and Efficient Multimodal Data Interplay: Algorithms and Applications,” focuses on developing machine learning methods that better connect and understand information across different modalities, such as text and images. Through this research, I explored ways to improve the interpretability and efficiency of multimodal systems while designing algorithms that better align with human perception and understanding. This experience strengthened my passion for artificial intelligence and its potential to create meaningful real-world impact.

I am deeply grateful to my advisor, Professor C.-C. Jay Kuo, for his continuous guidance, encouragement, and support throughout my Ph.D. journey. His mentorship has profoundly influenced both my research perspective and personal growth. I would also like to sincerely thank my labmates for their collaboration, friendship, and inspiring discussions along the way. The supportive environment in our lab made challenging moments manageable and accomplishments even more meaningful. I am excited to continue this journey by joining LinkedIn as an AI Engineer in June 2026, where I hope to contribute to the development of impactful AI systems and applications. I will always cherish the experiences, lessons, and relationships built during my Ph.D. years.

By |May 24th, 2026|News|Comments Off on Congratulations to Kevin Yang for Receiving His PhD Degree|

PhD Hooding Ceremony 2026

Three MCL members attended the Viterbi PhD hooding ceremony on Wednesday, May 13, 2026, in the Bovard Auditorium. They were Tsung-Shan (Kevin) Yang, Aoling Feng, and Xinyu Wang. Congratulations to them on their accomplishments and on completing their PhD program at USC!

Tsung-Shan (Kevin) Yang received his Bachelor’s degree in Chemistry and Electrical Engineering from National Taiwan University (NTU) in 2019 and his Master’s degree in Electrical Engineering from NTU in 2021. His thesis is titled “Interpretable and Efficient Multi-Modal Data Interplay: Algorithms and Applications”. He has joined LinkedIn.

Aolin Feng received the B.S. degree in electronic information engineering and M.S. degree in information and communication engineering from the University of Science and Technology of China, Hefei, China, in 2019 and 2022, respectively. His thesis is titled “Green Image Coding: Principle, Implementation, and Performance Evaluation”. He will be joining Google.

Xinyu Wang received the B.S. degree in Electronic and Electrical Engineering from the University of Electronic Science and Technology of China (UESTC) in June 2019, and the M.S. degree in Electrical Engineering from the University of Southern California (USC) in 2021. She joined the Media Communications Lab in summer 2021. Her thesis is titled “Towards Efficient Visual Perception: From Feature Learning to Visual Reasoning”. She will be joining Google.

Congratulations to them all! We wish them all the best for their future!

By |May 17th, 2026|News|Comments Off on PhD Hooding Ceremony 2026|

MCL Research on LLM and Medical Application

Our recent work explores how multi-modality memory can improve medical agents for visual question answering. By storing and retrieving useful information from medical images, patient queries, and clinical reports, the agent can better understand complex cases and provide more accurate and context-aware responses. This direction shows the potential of memory-enhanced medical AI systems for supporting medical VQA tasks.

By |May 3rd, 2026|News|Comments Off on MCL Research on LLM and Medical Application|

MCL Research on Robust Machine Learning

Autoregressive video diffusion models can generate high-quality frames in real time, but are limited to short clips — push them further, and the KV cache silently discards past context, causing identity drift and quality collapse. We introduce MemRoPE, a training-free framework that solves this with two co-designed mechanisms: Memory Tokens compress evicted frames into evolving dual-rate EMA representations, while Online RoPE Indexing stores keys without positional encoding and applies it dynamically at attention time, keeping temporal aggregation mathematically valid. The result is unbounded video generation with a fixed-size cache — we demonstrate continuous one-hour generation that preserves subject identity and visual fidelity throughout.

By |April 26th, 2026|News|Comments Off on MCL Research on Robust Machine Learning|

MCL Research on Mouse Motion Behavior

Understanding animal motion behavior is important for understanding how the brain organizes memory, decision-making, and goal-directed action. In our research, we study mouse navigation behavior using the Morris Water Maze (MWM), a widely used behavioral paradigm for investigating spatial learning and memory in rodents. While conventional measures such as escape latency and path length provide useful summaries of performance, they often do not fully capture the rich and dynamic nature of movement during navigation.

Our research focuses on understanding mouse motion behavior at a finer temporal scale. Instead of treating each trial as a single behavioral unit, we examine how navigation strategies evolve within a trial, since mice may shift among exploration, wall-following, scanning, circling, and more direct platform-oriented movement over time. These within-trial changes can offer deeper insight into learning processes, behavioral flexibility, and group differences that may be overlooked by aggregate metrics alone.

To support this goal, we develop an interpretable and lightweight computational framework for analyzing tracked trajectories from behavioral videos. Our approach analyzes motion continuously over time and identifies sub-trajectory-level navigational states. The pipeline first corrects for tracking irregularities through uniform resampling and smoothing, then derives geometry- and kinematics-based descriptors such as curvature, displacement, turning behavior, and target alignment. These features are mapped to human-readable behavioral categories through a hierarchical rule-based inference process, followed by temporal refinement to reduce fragmented or implausible label switching.

This work emphasizes interpretability and practical usability in neuroscience research. By providing point-wise annotations and visually verifiable outputs, the framework enables behavioral phenotyping and hypothesis-driven analysis of strategy transitions and learning dynamics.

By |April 19th, 2026|News|Comments Off on MCL Research on Mouse Motion Behavior|