News

  • Permalink Gallery

    Professor Kuo received IEEE Computer Society 2019 Edward J. McCluskey Technical Achievement Award

Professor Kuo received IEEE Computer Society 2019 Edward J. McCluskey Technical Achievement Award

Dr. C.-C. Jay Kuo, Distinguished Professor of Electrical Engineering and Computer Science, and the Director of the Multimedia Communications Laboratory at USC, has been selected to receive the IEEE Computer Society 2019 Edward J. McCluskey Technical Achievement Award, for “outstanding contributions to multimedia computing technologies and their applications.”

Professor Kuo is a world-renowned technical leader in multimedia computing technologies, systems and applications with an enduring impact on both academic and industry realms in the last three decades. He has made seminal contributions to video coding technologies in three areas: fast motion search, H.264 rate control, and perceptual coding. Professor Kuo’s deblocking filter and rate control technologies are widely used in video capturing devices such as smart phone cameras. Furthermore, he conducted extensive work in applying wavelets to image processing such as texture analysis, curve representation, fractal analysis, watermarking and data hiding. Recently, he has focused on machine learning, artificial intelligence, and computer vision and has developed a mathematical model that shed light on the mysterious behavior of deep learning networks.

Professor Kuo said, “It is a great honor to be named as the recipient of the IEEE Computer Society 2019 Edward J. McCluskey Technical Achievement Award. There are many outstanding researchers in the field, and I am truly humbled for this recognition.”.

For more, please click https://www.computer.org/press-room/2019-news/2019-edward-j-mccluskey-technical-achievement-award-c-c-jay-cuo

By |March 11th, 2019|News|Comments Off on Professor Kuo received IEEE Computer Society 2019 Edward J. McCluskey Technical Achievement Award|

MCL Research on Explainable Deep Learning

The deep learning technologies such as convolutional neural networks (CNNs) and recurrent neural networks (RNNs) have great impacts on modern machine learning due to their impressive performance in many application fields that involve learning, modeling, and processing of complex sensing data. Yet, the working principle of deep learning remains mysterious. Furthermore, it has several well-known weaknesses: 1) vulnerability to adversarial attacks, 2) demanding heavy supervision, 3) generalizability from one domain to the other. Professor Kuo and his PhD students at Media Communications Lab (MCL) have been working on explainable deep learning since 2014 and published a sequence of pioneering papers on this topic.

Explanation of nonlinear activation, convolutional filters and discriminability of trained features of CNNs [1]-[3]. The role of CNN’s nonlinear activation function is well explained in [1] at the first time. That is, the nonlinear activation operation is used to resolve the sign confusion problem due to the cascade of convolutional operations in multiple layers. This work received the 2018 best paper award from the Journal of Visual Communication and Image Representation. The convolutional filters is viewed as a rectified correlations on a sphere (RECOS) and CNN’s operation is interpreted as a multi-layer RECOS transform in [2].  The discriminability of trained features of a CNN at different convolution layers is analyzed using two quantitative metrics in [3] – the Gaussian confusion measure (GCM) and the cluster purity measure (CPM), The analysis is validated by experimental results.

Saak transform and its application to adversarial attacks [4]-[5]. Being inspired by deep learning, we  develop  a new mathematical transform called the Saak (Subspace approximation with augmented kernels) transform in [4]. The Saak and inverse Saak transforms provide signal analysis and synthesis tools, respectively. CNNs are known to [...]

By |March 4th, 2019|News|Comments Off on MCL Research on Explainable Deep Learning|

MCL Research on CNN Ensembles

CNN technology provides state-of-the-art solutions to many image processing and computer vision problems. Given a CNN architecture, all of its parameters are determined by the stochastic gradient descent (SGD) algorithm through backpropagation (BP). The BP training demands a high computational cost. Furthermore, most CNN publications are application-oriented. There is a limited amount of progress after the classical result in [1]. Examples include explainable CNNs [2,3,4] and feedforward designs without backpropagation [5,6].

The determination of CNN model parameters in the one-pass feedforward (FF) manner was recently proposed by Kuo et al. in [6]. It derives network parameters of a target layer based on statistics of output data from its previous layer. No BP is used at all. This feedforward design provides valuable insights into the CNN operational mechanism. Besides, under the same network architecture, its training complexity is significantly lower than that of the BP design CNN. FF-designed and BP-designed CNNs are denoted by FF-CNNs and BP-CNNs, respectively.

We focus on solving the image classification problem based on the feedforward-designed convolutional neural networks (FF-CNNs) [6]. An ensemble method that fuses the output decision vectors of multiple FF-CNNs to solve the image classification problem is proposed. To enhance the performance of the ensemble system, it is critical to increasing the diversity of FF-CNN models. To achieve this objective, we introduce diversities by adopting three strategies: 1) different parameter settings in convolutional layers, 2) flexible feature subsets fed into the Fully-connected (FC) layers, and 3) multiple image embeddings of the same input source. Furthermore, we partition input samples into easy and hard ones based on their decision confidence scores. As a result, we can develop a new ensemble system tailored to hard samples to further boost classification accuracy.  Although [...]

By |February 26th, 2019|News|Comments Off on MCL Research on CNN Ensembles|

Welcome New MCL Member Haoxuan You

Welcome New MCL Member Haoxuan You! Here is an short interview with Haoxuan:

1. Could you briefly introduce yourself and your research interests?

My name is Haoxuan You. I received my bachelor’s degree at Xidian University in China last July and spent my gap year at Tsinghua University as a research assistant. Now I am having my six-month visiting at USC. Generally speaking, my research interest lies in computer vision and machine learning. I hope to model the 3D environment by effectively 3D data processing.

2. What is your impression about MCL and USC?

The first impression of USC to me is the sunshine and beautiful campus. Then I found MCL a warm and big family with very kind people. The seminar lunch, the group meeting all give me a good opportunity to get involved in and help me to broaden my horizon. I feel really lucky to be with these lovely friends.

3. What is your future expectation and plan in MCL?

I hope to have a better understanding of how to be a good researcher and independent thinker from Prof. Kuo and other MCL members. And I wish to do some more valuable work such as the interpretable network in 3D data processing and 3D object detection.

By |February 16th, 2019|News|Comments Off on Welcome New MCL Member Haoxuan You|

Welcome New MCL Member Prof. Yongfei Zhang

Welcome new MCL member Prof. Yongfei Zhang! Prof. Zhang now is a visiting scholar at Media Communications Lab (MCL) at University of Southern California (USC) under the supervision of Prof. C.-C. Jay Kuo. He is an Associate Professor with the Beijing Key Laboratory of Digital Media, School of Computer Science and Engineering, and the State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, Beijing, China. He was a visiting scholar at University of Missouri, Columbia and Sept. 2007 to Sept. 2009.

Here is a short interview with Prof. Zhang:

1. Could you briefly introduce yourself and your research interests?

I am Yongfei Zhang, an Associate Professor in Beihang University. I received my B.S. degree in Electrical Engineering and Ph.D. degree in Pattern Recognition and Intelligent Systems, both from Beihang University, Beijing, China, in 2005 and 2011 respectively. My current research interests include Real-time High Efficiency Video Coding/Decoding, Computer Vision (Image Retrieval, Vehicle/Person Re-ID), Construction & Referencing of Domain-Specific Knowledge Graph.

2. What is your impression about MCL and USC?

MCL is a large and warm group with great director and students. Group routine activities, like individual meetings and weekly group seminars, are clearly scheduled and well organized with the contributions of each MCL member, which make the research and work very efficient. Besides, Prof. Kuo’s passion for the academic research and group culture development impress me a lot.

3. What is your future expectation and plan in MCL?

Great thanks to Prof. Kuo for hosting me in the big family of MCL. I wish I could enhance myself, with the help and guidance of Prof. Kuo, on teaching, research, and group management. Also, I’d like to make friends with all MCL members. Academically, I wish I could proceed/cooperate on [...]

By |February 9th, 2019|News|Comments Off on Welcome New MCL Member Prof. Yongfei Zhang|
  • Permalink Gallery

    Congratulations to Yueru Chen for passing her Qualifying Exam

Congratulations to Yueru Chen for passing her Qualifying Exam

Congratulations to Yueru Chen for passing her Qualifying Exam on Jan. 23, 2019! The title of her Ph.D. thesis proposal is “OBJECT CLASSIFICATION BASED ON NEURAL-NETWORK-INSPIRED IMAGE TRANSFORMS”. Her Qualifying Exam Committee includes: Jay Kuo (Chair), Antonio Ortega, Shri Narayanan, Keith Chugg and Ulrich Neumann (Outside Member).

Abstract of thesis proposal:

Convolutional neural networks (CNNs) have recently demonstrated impressive performance in image classification and change the way building feature extractors from carefully handcrafted design to automatically deep learned from a large labeled dataset. However, a great majority of current CNN literature are application-oriented, and there is no clear understanding and theoretical foundation to explain the outstanding performance and indicate the way to improve. In this thesis proposal, we focus on solving the image classification problem based on the neural-network-inspired Saak (Subspace approximation with augmented kernels) transform and Saab (Subspace approximation with adjusted bias) transform.

Based on the lossy Saak transform, we firstly proposed an efficient, scalable and robust approach to the handwritten digits recognition problem. We conduct a comparative study on the performance of the LeNet-5 and the Saak-transform-based solutions in terms of scalability and robustness as well as the efficiency of lossless and lossy Saak transform under a comparable accuracy level. We also develop an ensemble method that fuses the output decision vectors of Saab-transform-based decision system (i.e. the FF-CNN model) to solve the image classification problem. To enhance the performance of the ensemble system, it is critical to increasing the diversity of FF-CNN models. To achieve this objective, we introduce diversities by adopting three strategies: 1) different parameter settings in convolutional layers, 2) flexible feature subsets fed into the Fully-connected (FC) layers, and 3) multiple image embeddings of the same input source. Furthermore, we partition [...]

By |February 5th, 2019|News|Comments Off on Congratulations to Yueru Chen for passing her Qualifying Exam|
  • Permalink Gallery

    Congratulations to Heming Zhang for Passing Her Qualifying Exam

Congratulations to Heming Zhang for Passing Her Qualifying Exam

Congratulations to Heming Zhang for Passing Her Qualifying Exam on Jan. 24, 2019! Her thesis proposal is titled with “LOCAL-AWARE DEEP LEARNING: METHODOLOGY AND APPLICATIONS”. Her Qualifying Exam Committee includes: Jay Kuo (Chair), Antonio Ortega, Keith Jenkins, Sandy Sawchuk and Stefanos Nikolaidis (Outside Member).

Abstract of thesis proposal:

Deep learning techniques utilize networks with multiple layers cascaded to map the inputs to desired outputs. To map the entire inputs to desired outputs, useful information should be extracted through the layers. During the mapping, feature extraction and prediction are jointly performed. We do not have direct control for feature extraction. Consequently, some useful information, especially local information, is also discarded in the process.

In this thesis proposal, we specifically study local-aware deep learning techniques
with: 1) multi-modal attention mechanism; 2) local cues reasoning; 3) local region
characteristics analysis. Specifically, we design a multi-modal attention mechanism for generative visual dialogue system, which simultaneously attends to multi-modal inputs and utilizes extracted local information to generate dialogue responses. We propose a proposal network for fast face detection system for mobile devices, which detects salient facial parts and
uses them as local cues for detection of entire faces. We extract representative fashion features by analyzing local regions, which contain local fashion details of humans’ interests.

By |February 4th, 2019|News|Comments Off on Congratulations to Heming Zhang for Passing Her Qualifying Exam|
  • Permalink Gallery

    Congratulations to Fenxiao (Jessica) Chen for Passing Her Qualifying Exam

Congratulations to Fenxiao (Jessica) Chen for Passing Her Qualifying Exam

Congratulations to Fenxiao (Jessica) Chen for Passing His Qualifying Exam on 01/22/19! Her thesis proposal is titled with “GRAPH EMBEDDING WITH DEEP LEARNING TECHNIQUES”. Her Qualifying Exam Committee include: Jay Kuo (Chair), Sandy Sawchuk, Panos Georgiou, Viktor Prasanna and Xiong Ren (Outside Member).

Abstract of thesis proposal:

Graph representation learning is an important task nowadays due to the fact that most real-world data naturally comes in the form of graphs in many applications. Graph data often come in high-dimensional irregular form which makes them more difficult to analyze than the traditional low-dimensional data. Graph embedding has been widely used to convert graph data into a lower dimensional space while preserving the intrinsic properties of the original data.

In this thesis proposal, we specifically study two graph embedding problems: 1) Developing effective and graph embedding techniques can provide researcher with deeper understanding of the collected data more efficiently; 2) Use the embedded information to conduct applications such as node classification and link prediction.

To find an efficient way to learn and encode graph into a low dimensional embedding. We first present a novel Deepwalk-assisted Graph PCA (DGPCA) method is proposed for processing language network data represented by graphs. This method can generate a precise text representation for nodes (or vertices) in language networks. Unlike other existing work, our learned low dimensional vector representations add flexibility in exploring vertices neighborhood information while reducing noise contained in the original data. To demonstrate the effectiveness, we use DGPCA to classify vertices that contain text information in three language networks. Experimentally, DGPCA is shown to perform well on the language datasets in comparison to several state-of-the-art benchmarking methods.

To solve the node prediction problem, we present A novel graph-to-tree conversion mechanism called the deep [...]

By |February 4th, 2019|News|Comments Off on Congratulations to Fenxiao (Jessica) Chen for Passing Her Qualifying Exam|
  • Permalink Gallery

    Congratulations to Junting Zhang for Passing Her Qualifying Exam

Congratulations to Junting Zhang for Passing Her Qualifying Exam

Congratulations to Junting Zhang for Passing His Qualifying Exam on 01/17/19. Her thesis proposal is titled with “IMAGE KNOWLEDGE TRANSFER WITH DEEP LEARNING TECHNIQUES”. Her Qualifying Exam committee includes: Jay Kuo (Chair), Sandy Sawchuk, Keith Jenkins, Panos Georgiou and Ulrich Neumann (Outside Member).

Abstract of thesis proposal:

In recent years, we have witnessed tremendous success in training deep neural networks to learn a surprisingly accurate mapping from input signals to outputs, whether they are images, languages, genetic sequences, etc. from large amounts of labeled data. One fatal characteristics of the current dominant learning paradigm is that it learns in isolation: given a carefully constructed training dataset, it runs a machine learning algorithm on the dataset to produce a model that is then used in its specific intended application. It has no intention to exploit the dependencies and relations among different tasks and domains, nor the effective techniques to retain, accumulate, and transfer knowledge gained from past learning experiences to solve new problems in the new scenarios.

The learning environments are typically static and strictly constrained. For supervised learning, labeling of training data is often done manually, which is prohibitively expensive in terms of labor resource and time, especially when the required label is fine-grained or it requires knowledge from a domain expert. Considering the real world is too complex with infinite possible tasks, it is almost impossible to label sufficient number of examples for every possible task or application. Furthermore, the world also changes constantly, and appearance of instances or the label of the same instance may vary from time to time, the labeling thus needs to be done continually, which is a daunting task for humans.

On the contrary, humans learn in a different way, where transfer [...]

By |February 4th, 2019|News|Comments Off on Congratulations to Junting Zhang for Passing Her Qualifying Exam|

Congratulations to Ye Wang for Passing His Qualifying Exam

Congratulations to Ye Wang for Passing His Qualifying Exam on 01/16/2019! His thesis proposal is titled with “VIDEO OBJECT SEGMENTATION AND TRACKING WITH DEEP LEARNING TECHNIQUES”. His Qualifying Exam committee includes: Jay Kuo (Chair), Sandy Sawchuk, Antonio Ortega, Shri Narayanan, and Joseph Lim.

Abstract of thesis proposal:

Unsupervised video object segmentation is a crucial application in video analysis without knowing any prior information about the objects. It becomes tremendously challenging when multiple objects occur and interact in a given video clip. In this thesis proposal, a novel unsupervised video object segmentation approach via distractor-aware online adaptation (DOA) is proposed. DOA models spatial-temporal consistency in video sequences by capturing background dependencies from adjacent frames. Instance proposals are generated by the instance segmentation network for each frame and then selected by motion information as hard negatives if they exist and positives. To adopt high-quality hard negatives, the block matching algorithm is then applied to preceding frames to track the associated hard negatives. General negatives are also introduced in case that there are no hard negatives in the sequence and experiments demonstrate both kinds of negatives (distractors) are complementary. Finally, we conduct DOA using the positive, negative, and hard negative masks to update the foreground/background segmentation. The proposed approach achieves state-of-the-art results on two benchmark datasets, DAVIS 2016 and FBMS-59 datasets.

In addition, this thesis proposal reports a visible and thermal drone monitoring system that integrates deep-learning-based detection and tracking modules. The biggest challenge in adopting deep learning methods for drone detection is the paucity of training drone images especially thermal drone images. To address this issue, we develop two data augmentation techniques. One is a model-based drone augmentation technique that automatically generates visible drone images with a bounding box [...]

By |January 28th, 2019|News|Comments Off on Congratulations to Ye Wang for Passing His Qualifying Exam|