USC Media Communications Lab

Permalink Gallery
MCL Research on Point-cloud Analysis

MCL Research on Point-cloud Analysis

With the rise of visualization, animation and autonomous driving applications, the demand for 3D point cloud analysis and understanding has rapidly increased. Point Cloud is a kind of data obtained from lidar scanning which contains abundant 3D information. Our research directions about point cloud in autonomous driving are object detection, segmentation and classification.

Due to its unstructured and unordered properties, people usually transfer point cloud into other data types such as mesh, voxel and multi-view. But the transformation must cause information lost. Recently, several deep-learning-solutions such as PointNet/Pointnet++ [1, 2] tailored to point clouds provide a more efficient and flexible way to handle 3D data. Some successful results for object classification and parts and semantic scene segmentation have been demonstrated. However, object and scene understanding with Convolutional Neural Networks (CNNs) on 3D volumetric data is still limited due to its high memory requirement and computational cost. This brings a challenge for autonomous driving since it requires real-time and concise processing of the observed scenes and objects.

An interpretable CNN design based on the feedforward (FF) methodology [3] without any backpropagation (BP) was recently proposed by the Media Communications Lab at USC. The FF design offers a complementary approach to CNN filter weights selection. We are now designing a feed-forward (FF) network for both object classification and indoor scene segmentation. The advantages of the FF design methodology are multiple folds. It is completely interpretable. It demands much less training complexity and training data. Furthermore, it can be generalized to weakly supervised or unsupervised learning scenarios in a straightforward manner. The latter is extremely important in real world application scenarios since data labeling is very tedious and expensive.

References:

R. Qi, H. Su, K. Mo, and L. J. Guibas. [...]

By Xuejing Lei|April 1st, 2019|News|Comments Off|

Permalink Gallery
MCL Research on Domain Adaptation

MCL Research on Domain Adaptation

Domain Adaptation is a sort of transfer learning, which is aimed to learn a model from source data distribution and apply to the target data of different distribution. Basically, the tasks in source and target domains are the same, such as both are image classification task or both are image segmentation task. There are three types of domain adaptation, differing in how many target samples are labeled with ground truth labels. In the supervised domain adaptation and the semi-supervised domain adaptation, all or part of target data is labeled respectively, while all target data is unlabeled in the unsupervised domain adaptation.

There are several classical methods supposed to solve domain shift problems by feature alignment in the unsupervised domain adaptation. [1] maps data of source and target domains into one subspace learned by reducing the distribution distance measured by maximum mean discrepancy. [2] aligns eigenvectors of two domains by learning a linear mapping function. [3] utilizes geometric and statistical changes between source and target domain to build an infinite number of subspaces and integrates them together. With the increasing popularity of deep learning, there are plenty of methods[4,5,6] utilize CNN or GAN in domain adaptation. But those methods demand a high computation cost due to back-propagation and GAN related methods are unstable in training. Besides, generalizability from one domain to the other is weak in deep learning based methods.

Professor Kuo proposed several explanations on explainable deep learning since 2014. The Saak and Saab transform gives a way to extract feature representation of images and original images can be reconstructed from the feature representation through inverse transform. This gives us a new way to handle domain adaptation task. We are now working on aligning Saab features [...]

By Xuejing Lei|March 25th, 2019|News|Comments Off|

Permalink Gallery
MCL Research on Active Learning

MCL Research on Active Learning

Deep learning has shown its effectiveness in various computer vision tasks. However, a large amount of labeled data is usually needed for deep learning approaches. Active learning can help reduce the labeling efforts by choosing the most informative samples to label and thus achieves a comparable performance with less labeled data.

There are two major types of active learning strategy: uncertainty based and diversity based.

The core idea of uncertainty based methods is to label those samples that are most uncertain to the existing model trained on current labeled set. For example, an image with a prediction of 50 percent cat is empirically considered to be more valuable than an image with a prediction of 99 percent cat, where the former has larger uncertainty. Besides uncertainty metrics from information theory like entropy, Beluch et al. [1] proposes to use an ensemble to estimate the uncertainty of unlabeled images and achieves good results in ImageNet dataset.

In contrast, diversity based methods rely on an assumption that a more diverse set of images chosen as the training set can lead to better performance. Sener et al. [2] formalizes the active learning problem into a core-set problem and achieves competitive performance in CIFAR-10 dataset. Mixed-integer programming is used to solve their objective function.

Our current research focuses on balancing the two factors (uncertainty and diversity) in a explainable way.

References:
[1] Beluch, William H., et al. “The power of ensembles for active learning in image classification.” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018.
[2] Sener, Ozan, and Silvio Savarese. “Active learning for convolutional neural networks: A core-set approach.” (2018).

Author: Yeji Shen

By Xuejing Lei|March 18th, 2019|News|Comments Off|

Permalink Gallery
Professor Kuo received IEEE Computer Society 2019 Edward J. McCluskey Technical Achievement Award

Professor Kuo received IEEE Computer Society 2019 Edward J. McCluskey Technical Achievement Award

Dr. C.-C. Jay Kuo, Distinguished Professor of Electrical Engineering and Computer Science, and the Director of the Multimedia Communications Laboratory at USC, has been selected to receive the IEEE Computer Society 2019 Edward J. McCluskey Technical Achievement Award, for “outstanding contributions to multimedia computing technologies and their applications.”

Professor Kuo is a world-renowned technical leader in multimedia computing technologies, systems and applications with an enduring impact on both academic and industry realms in the last three decades. He has made seminal contributions to video coding technologies in three areas: fast motion search, H.264 rate control, and perceptual coding. Professor Kuo’s deblocking filter and rate control technologies are widely used in video capturing devices such as smart phone cameras. Furthermore, he conducted extensive work in applying wavelets to image processing such as texture analysis, curve representation, fractal analysis, watermarking and data hiding. Recently, he has focused on machine learning, artificial intelligence, and computer vision and has developed a mathematical model that shed light on the mysterious behavior of deep learning networks.

Professor Kuo said, “It is a great honor to be named as the recipient of the IEEE Computer Society 2019 Edward J. McCluskey Technical Achievement Award. There are many outstanding researchers in the field, and I am truly humbled for this recognition.”.

For more, please click https://www.computer.org/press-room/2019-news/2019-edward-j-mccluskey-technical-achievement-award-c-c-jay-cuo

By Xuejing Lei|March 11th, 2019|News|Comments Off|

Permalink Gallery
MCL Research on Explainable Deep Learning

MCL Research on Explainable Deep Learning

The deep learning technologies such as convolutional neural networks (CNNs) and recurrent neural networks (RNNs) have great impacts on modern machine learning due to their impressive performance in many application fields that involve learning, modeling, and processing of complex sensing data. Yet, the working principle of deep learning remains mysterious. Furthermore, it has several well-known weaknesses: 1) vulnerability to adversarial attacks, 2) demanding heavy supervision, 3) generalizability from one domain to the other. Professor Kuo and his PhD students at Media Communications Lab (MCL) have been working on explainable deep learning since 2014 and published a sequence of pioneering papers on this topic.

Explanation of nonlinear activation, convolutional filters and discriminability of trained features of CNNs [1]-[3]. The role of CNN’s nonlinear activation function is well explained in [1] at the first time. That is, the nonlinear activation operation is used to resolve the sign confusion problem due to the cascade of convolutional operations in multiple layers. This work received the 2018 best paper award from the Journal of Visual Communication and Image Representation. The convolutional filters is viewed as a rectified correlations on a sphere (RECOS) and CNN’s operation is interpreted as a multi-layer RECOS transform in [2]. The discriminability of trained features of a CNN at different convolution layers is analyzed using two quantitative metrics in [3] – the Gaussian confusion measure (GCM) and the cluster purity measure (CPM), The analysis is validated by experimental results.

Saak transform and its application to adversarial attacks [4]-[5]. Being inspired by deep learning, we develop a new mathematical transform called the Saak (Subspace approximation with augmented kernels) transform in [4]. The Saak and inverse Saak transforms provide signal analysis and synthesis tools, respectively. CNNs are known to [...]

By Xuejing Lei|March 4th, 2019|News|Comments Off|

Permalink Gallery
Farewell to Dr. Xinfeng Zhang and Dr. Chao Yang

Farewell to Dr. Xinfeng Zhang and Dr. Chao Yang

Dr. Xinfeng Zhang and Dr. Chao Yang are currently Postdoctoral Research Fellows at the MCL. They will complete their one-year stay and go back to China at the end of October.

Dr. Xinfeng Zhang received his PhD degree from Institute of Computing Technology, Chinese Academy of Sciences while Dr. Chao Yang received his PhD degree from Shanghai University. They are both experts in video coding. They joined the MCL in November 2017 and participated in two industrial projects: “Perceptual Video Coding based on Visual Attention Mechanism” (sponsored by Huawei) and “Joint Image Coding and Visual Understanding” (sponsored by Netflix, Tencent and Mediatek). They have done an excellent job in both projects, which leads to two journal papers under review.

MCL Director, Dr. C.-C. Jay Kuo, said that “It is our great pleasure to have Dr. Zhang and Dr. Yang to be around in our lab for the last year. They have made very important contributions. I do wish them the very best in their future career development.”

Dr. Xinfeng Zhang said that “It is a wonderful year for me in MCL, which is a prestigious research lab but also a family with love. I appreciate Prof. Kuo very much for the professional advices in my research, the strong support for my faculty job applications and the sincere guidance for life and career. Moreover, I am very pleased to know all of the MCL members and become friends with you. Especially, thank Prof. Li, Dr. Yang, Haiqiang for the good research cooperation, and thank Bing Li and Bin Wang for the helps in my life of USC. Thanks, and best wishes for our MCL members!”

And Dr. Chao Yang said that “It’s a great honor to be here working [...]

By Xuejing Lei|October 21st, 2018|News|Comments Off|

Permalink Gallery
MCL Director, Dr. C.-C. Jay Kuo, Delivered Viterbi Special Guest Speech at Technion, Israel Institute of Technology

MCL Director, Dr. C.-C. Jay Kuo, Delivered Viterbi Special Guest Speech at Technion, Israel Institute of Technology

After a short stay in Athens for ICIP 2018, Professor Kuo flew to Israel and visited Technion, Israel Institute of Technology. He delivered the Viterbi Special Guest Lecture titled with “Unveil Convolutional Neural Networks and Go Beyond” on October 11. The talk was very well received.

The Technion – Israel Institute of Technology – is a public research university in Haifa, Israel. The university was established in 1912 during the Ottoman Empire, which was more than 35 years before the State of Israel. The Technion is the oldest university in Israel. It is ranked the best university in Israel and in the whole of the Middle East.

There is a close tie between USC and Technion through Dr. Andrew J. Viterbi. Dr. Viterbi received a Technion Honorary Doctorate in 2000. He has been a Distinguished Visiting Professor of Electrical Engineering at the Technion since then. Dr. Viterbi announced a $50 million gift to secure and enhance the Technion-Israel Institute of Technology’s leadership position in electrical and computer engineering in Israel and globally in 2015. He is a member of the Technion Board of Governors.

Professor Kuo said, “It was my great honor to have this opportunity to be a bridge between the USC Viterbi School of Engineering and the Technion Viterbi Faculty of Electrical Engineering. It is very meaningful to have more interactions and faculty/student exchanges between these two world top universities.” Professor Kuo’s visit was sponsored by the Technion Rubiner/Viterbi Fund. He used the same office of Dr. Andrew Viterbi during his stay. Professor Kuo’s visit was hosted by Professor Josh Zeevi, who is a world renowned expert in vision and image sciences.

By Xuejing Lei|October 14th, 2018|News|Comments Off|

Permalink Gallery
Professor Kuo Delivered Plenary Speech on Interpretable CNNs at ICIP 2018

Professor Kuo Delivered Plenary Speech on Interpretable CNNs at ICIP 2018

The 25th IEEE International Conference on Image Processing (ICIP) was held in the Megaron Athens International Conference Centre, Athens, Greece, from October 7-10, 2018. ICIP is the world’s largest and most comprehensive technical conference focused on image and video processing and computer vision. The theme of ICIP 2018 will be “Imaging beyond imagination”. The conference features world-class speakers, tutorials, exhibits, and a vision technology showcase.

MCL Director, Professor C.-C. Jay Kuo, delivered the Plenary Speech entitled with “Unveil Convolutional Neural Networks (CNNs) and Go Beyond” on Oct. 8, 2018. Professor Kuo has worked on theoretical understanding of CNNs since 2015 and published a sequence of papers on this topic. This speech contained main results of his research endeavor.

Specifically, Professor Kuo described a new interpretable feedforward CNN design methodology. It does not demand any backpropagation. This design adopts a data-centric approach and derives network parameters of the current layer based on data statistics from the output of the previous layer. This process continues layer after layer in one pass. The feedforward design leads to a CNN that has a classification performance close to the one designed by backpropagation. Yet, it is more robust with respect to adversarial attacks. Above all, it is mathematically transparent.

Professor Kuo has a paper entitled with “Interpretable Convolutional Neural Networks via Feedforward Design” as the main reference. It can be downloaded from the arXiv: https://arxiv.org/abs/1810.02786.

By Xuejing Lei|October 8th, 2018|News|Comments Off|

Permalink Gallery
Welcome New MCL Member Thiyagarajan Ramanathan

Welcome New MCL Member Thiyagarajan Ramanathan

1. Could you briefly introduce yourself and your research interests?

I’m pursuing my Master Degree in Electrical Engineering at USC with specialization in Image Processing and Machine Learning. I joined MCL in August 2018. My current research interests lie in the field of Machine Learning for Image Processing applications. I have worked on many applications of image processing such as Object detection, Vehicle tracking and Behavioral cloning using Deep models. I’m very interested in Natural Language Processing as well.

2. What is your impression about MCL and USC?

USC has been an amazing environment for my growth. Though I have been here only for a year, I have gained so much of knowledge and I’m very grateful to USC for that. The people at MCL are forward-thinking and very intelligent. The research at MCL is cutting edge and I am very happy and excited to be a part of the lab.

3. What is your future expectation and plan in MCL?

I want to interact more with Professor Kuo and all the students of the lab. I want to improve my research skills and expand my knowledge.

By Xuejing Lei|September 30th, 2018|News|Comments Off|

Permalink Gallery
Welcome New MCL Member Manasa Manohara

Welcome New MCL Member Manasa Manohara

1. Could you briefly introduce yourself and your research interests?

I am Manasa Manohara, a Master’s student at USC pursuing EE with a focus on Computer Vision. I finished my Bachelor’s from M.S.Ramaiah Institute of Technology in Bangalore, India. I worked as a Development Consultant at SAP India for about a year before I joined USC. I am excited about Computer Vision and Machine Learning, and how a combination of these technologies can be applied to solve everyday problems. I am particularly interested in Image and video segmentation and reconstruction. In my free time, I love to dance and read books.

2. What is your impression about MCL and USC?

USC provides a lot of opportunities for the students to grow not just in terms of academics but also encourages to pursue their personal interests. It has a good mix of people who are passionate about what they do. I hope I can make the most of what USC has to offer.

MCL is doing a lot of good work in the field of Computer Vision and I would love to learn and contribute to their research along with interacting with my peers to explore their fields of interest.

3. What is your future expectation and plan in MCL?

I hope I can enhance my skills in the fields of Image Processing and Deep learning. I wish to accelerate and solve everyday persistent problems with an exciting mixture of the above technologies.

By Xuejing Lei|September 23rd, 2018|News|Comments Off|

Previous 1 2 345 Next

XuejingLei

MCL Research on Point-cloud Analysis

MCL Research on Point-cloud Analysis

MCL Research on Domain Adaptation

MCL Research on Domain Adaptation

MCL Research on Active Learning

MCL Research on Active Learning

Professor Kuo received IEEE Computer Society 2019 Edward J. McCluskey Technical Achievement Award

Professor Kuo received IEEE Computer Society 2019 Edward J. McCluskey Technical Achievement Award

MCL Research on Explainable Deep Learning

MCL Research on Explainable Deep Learning

Farewell to Dr. Xinfeng Zhang and Dr. Chao Yang

Farewell to Dr. Xinfeng Zhang and Dr. Chao Yang

MCL Director, Dr. C.-C. Jay Kuo, Delivered Viterbi Special Guest Speech at Technion, Israel Institute of Technology

MCL Director, Dr. C.-C. Jay Kuo, Delivered Viterbi Special Guest Speech at Technion, Israel Institute of Technology

Professor Kuo Delivered Plenary Speech on Interpretable CNNs at ICIP 2018

Professor Kuo Delivered Plenary Speech on Interpretable CNNs at ICIP 2018

Welcome New MCL Member Thiyagarajan Ramanathan

Welcome New MCL Member Thiyagarajan Ramanathan

Welcome New MCL Member Manasa Manohara

Welcome New MCL Member Manasa Manohara

Recent Posts