Congratulations to Shangwen Li for Passing PhD Defense

By Ruiyuan Lin | December 18, 2016 | News

Congratulations to Shangwen Li for passing his defense on December 1, 2016. His Ph.D. thesis is entitled “Multimodal Image Retrieval and Object Classification Using Deep Learning Features”.

Abstract of thesis:

Computer vision has achieved a major breakthrough in recent years with the advancement of deep learning based methods. However, its performance is still yet to be claimed as robust for practical applications, and more advanced methods on top of deep learning architecture are needed. This work targets at using deep learning features to tackle two major computer vision problems: Multimodal Image Retrieval and Object Classification.

Multimodal Image Retrieval (MIR) aims at building the alignment between the visual and textual modalities, thus reduce the well-known “semantic gap” in image retrieval problem. As the most widely existing textual information of images, tag plays an important semantic role in MIR framework. However, treating all tags in an image as equally important may result in misalignment between visual and textual domains, leading to bad retrieval performance. To address this problem and build a robust retrieval system, we propose an MIR framework that embeds tag importance as the textual feature. In the first part, we propose an MIR system, called Multimodal Image Retrieval with Tag Importance Prediction (MIR/TIP), to embed the automatically predicted object tag importance in image retrieval. To achieve this goal, a discounted probability metric is first presented to measure the object tag importance from human sentence descriptions. Using this as ground truth, a structured object tag importance prediction model is proposed. The proposed model integrates visual, semantic, and context cues to achieve robust object tag importance prediction performance. Our experimental results demonstrate that, by embedding the predicted object tag importance, significant performance gain can be obtained in terms of both objective and subjective evaluation. In the second part, the MIR/TIP system is extended to account “scene”, which is another important aspect of image. To jointly measure the scene and object tag importance, the discounted probability metric is modified to consider the grammatical role of the scene tag in the human annotated sentence. The structured model is modified to predict the scene and object tag importance at the same time. Our experimental results demonstrate that the robustness of MIR system is greatly enhanced by our predicted scene and object tag importance.

Object classification is a long-standing problem in the computer vision field, which serves as the foundation for other problems such as object detection, scene classification, and image annotation. As the number of object categories continues to increase, it is inevitable to have certain categories that are more confusing than others due to the proximity of their samples in the feature space. In the third part, we conduct a detail analysis on confusing categories and propose a confusing categories identification and resolution (CCIR) scheme, which can be applied to any CNN-based object classification baseline method to further improve its performance. In the CCIR scheme, we first present a procedure to cluster confusing object categories together to form a confusion set automatically. Then, a binary-tree-structured (BTS) clustering method is adopted to split a confusion set into multiple subsets. A classifier is subsequently learned within each subset to enhance its performance. Experimental results on the ImageNet ILSVRC2012 dataset show that the proposed CCIR scheme can offer a significant performance gain over the AlexNet and the VGG16.

We are so glad to have him share his Ph.D. experience with us. Here is his sharing.

Ph.D. experience:

First, I would like to thank Professor Kuo for offering me this valuable PhD experience. I have learnt many critical thinking skills along with lifelong wisdom from Professor Kuo’s pre-seminar sharing. The weekly report mechanism also teaches me the importance of self-discipline. I admire Professor Kuo’s ability in managing such a large research group with such diversity. The alumni network is really a great asset for all members of MCL lab. Last, his dedicated attitude to research is something that I need to learn throughout my life.

PhD is definitely a rewarding experience. It not only prepares you technically for your future career, but also strengthens your mind. In my opinion, self-motivation, persistence, endurance, consideration are key factors leading to a successful PhD life. PhD life is surely full of frustration, but the self-satisfaction of passing the finishing line is indescribable. I would like to thank my labmates as well for their insight discussion and encouragement. The friendship with them is an indispensable part of PhD life.

Congratulations again to Shangwen and we wish him all the best in his future career.

About the Author: Ruiyuan Lin

Thesis Title: Experimental Analysis and Feedforward Design of Neural Networks, March 2021. Employment: OPPO US Research Center, Bellevue, WA, USA The 159th PhD from MCL

Congratulations to Shangwen Li for Passing PhD Defense

Share This Story, Choose Your Platform!

About the Author: Ruiyuan Lin

You May Also Like

Welcome New MCL Member James Zhan

Welcome New MCL Member Alek Yegazarian

Welcome New MCL Member Jimmy Xiao

Welcome New MCL Member Kevin Lim

Welcome New MCL Member Qi Cao

MCL Research on Image Classification

Congratulations to Wei Wang for Passing Her Defense!

Congratulations to Qingyang Zhou for Passing His Defense!

MCL Research on Prostate Segmentation

MCL Research on Green Image Super-resolution

MCL Research on Nuclei Segmentation

MCL Research on Seismic Data Processing

MCL Research on Image Denoising

MCL Research on Video-Text Retrieval

MCL Research on Enhanced Object Detection

MCL Research on Video Camouflaged Object Detection (VCOD)

MCL Research on Image Dehazing

Reunion of MCL Alumni at Southern California

MCL Research on Image Demosaicing

MCL Research on Transfer Learning

MCL Research on EDA

Professor Kuo Gave a Keynote at AIxMM 2025

Welcome New MCL Member Cynthia Huang

Congratulations to Ganning Zhao for Passing Her Defense!

Welcome to the Spring 2025 semester!

Happy New Year!

Merry Christmas!

MCL Research on Green Learning for Medical Imaging

Research on Green Image Segmentation

MCL’s Thanksgiving Luncheon

MCL Research on Radar Signal Processing: Jamming signal detection

Congratulations to Professor Kuo for Receiving NTU Distinguished Alumni Award

MCL Research on Feedforward Visual Attention

MCL Research on Word Embedding Dimension Reduction

Welcome New MCL Member Hong-En Chen

Welcome New MCL Member Laurence Palmer

Professor C.-C. Jay Kuo Named Inaugural Ming Hsieh Chair Holder

Welcome New MCL Member Alexander Jou

Welcome New MCL Member Qixin Hu

Welcome New MCL Member Youngrae Kim

Congratulations to Vasileios Magoulianitis for Passing His Defense

Congratulations to Zhanxuan Mei for Passing His Defense

MCL Research on Supervised Feature Learning

MCL Research on Green Saliency-guided Blind Image Quality Assessment (GSBIQA)

MCL Research on Green Raw Image Demosaicking

MCL Research on Green Saliency-guided Blind Image Quality Assessment (GSBIQA)

MCL Research on Prostate Lesion Detection from MRI Images

MCL Research on Green Image Super-resolution

Professor Kuo Attended ICME in Niagara Falls, Canada

MCL Research on 3D Perception with Large Foundational Models

MCL Research on Prostate MRI Image Segmentation

Professor Kuo Met MCL Alumni in Thailand

Professor Kuo visited Singapore

Professor Kuo met MCL Alumni in Taiwan

MCL Research on Nuclei Segmentation for Histological Images

MCL Research on Seismic Data Processing

MCL Research on Point Cloud Surface Reconstruction

Congratulations to Chengwei Wei for Passing His Defense

Congratulations on MCL Members Attending Ph.D. Hooding Ceremony

Welcome New MCL Member Dingyi Nie

MCL Research on Parsing Tree Construction

MCL Research on Video Camouflaged Object

MCL Research on Green Learning for Electronic Design Automation (EDA)

MCL Research on 3D Perception with Large Foundational Models

MCL Research on Green Image Coding

MCL Research on Green Point Cloud Surface Reconstruction

MCL Research on POS Tagging Prediction

MCL Research on Saliency Detection Method

Welcome New MCL Member Xuechun Hua

MCL Research on Transfer Learning

MCL Research on Image Demosaicing

MCL Research on LQBoost Regressor

MCL Research on SLMBoost Classifier

Congratulations to Xuejing Lei for Passing Her Defense

Congratulations to Yifan Wang for Passing His Defense

Welcome to Join MCL as an Intern Sanket Kumbhar