Monthly Archives: August 2015

  • Permalink Gallery

    Congratulations to three MCL PhD students for passing their defense

Congratulations to three MCL PhD students for passing their defense

Congratulations to Harshad Kadu, Joe Yuchieh Lin and Xiang Fu, who passed their defense last week! Following are their thesis abstracts and they also shared their PhD experiences.

Thesis: Advanced Techniques for Human Action Classification and Text Localization (Harshad Kadu)

The thesis contains two main research topics:

1) Automatic human action classification with mocap data

We propose a TSVQ based multi-resolution string representation scheme that transforms the time-series of human poses into codeword sequences. Temporal and spatial features extracted from these sequences are combined together using novel fusion methods to achieve superior performance.

2) Text localization in natural scene images

Stable extremal region operator detects regions of interest in the proposed multi-stage incremental region classification framework. Geometric filtering, context-based text grouping and ensemble classifier stages, remove false positives and group related text regions into words.

 

Thesis: Experimental Design and Evaluation Methodology for Human-Centric Visual Quality Assessment (Joe Yuchieh Lin)
The problem of human-centric visual quality assessment (VQA) is extensively studied in this thesis. Our study includes three major topics: 1) design of a dataset for streaming video quality assessment, 2) development of a new and effective video quality assessment index, 3) exploration of a new methodology for human visual quality assessment based on the notion of just-noticeable-differences (JND).
 

Thesis: Advanced Visual Segmentation Techniques: Algorithm Design and Performance Analysis (Xiang Fu)

Two research topics are covered in the dissertation:

1) How to interactively represent and segment the object(s) in a video?

An interactive video object segmentation framework is proposed to handle complex and diverse scenes with large motion and occlusions, which outperforms the state-of-the-art Adobe After Effect.

2) How to reliably generate automatic image segmentation?

Region-Dependent Spectral Graph is designed to fuse contour, surface, and depth cues according to the type of regions. Depth cue can further merge the regions in the textured area, which is never applied for segmentation before.

Contour-guided Color [...]

By |August 31st, 2015|News|Comments Off on Congratulations to three MCL PhD students for passing their defense|

Interview with new MCL member Yuhang Song

MCL has a new Ph.D student, Yuhang Song in Fall 2015. Let’s give him a warm welcome!

Yuhang received B.S. degree from the EE Department of Tsinghua University in Fall, 2015. He decided to join USC MCL to pursue his Ph.D degree beginning from this Fall semester. We had a brief interview with him.

What is your first impression of USC and MCL?

USC is a beautiful campus with high cultural diversity, and MCL is a big group with friendly mates. It really impressed me a lot during the first group meeting, where team members talked about their current work. Their enthusiasm about research and technology led a strong influence on me.

Could you briefly introduce yourself? (Previous research experience, project experience, research interest and expertise)

I had a couple of research experience about multiple fields, including pedestrian action categorization, image retrieval, structural light coding and time series analysis.  After my attendance of this lab, I want to keep an open mind on a variety of topics of computer vision and machine learning, and to dive into certain interesting topic after several tries. 

What’s your future expectation for MCL?

As I’m new here, I hope to learn more about the ongoing projects and state-of -the-art technology in the lab. After that, I hope to devote myself to developing my research abilities, as well as communication skills. It will be a great journey in my life during this period in USC and MCL, and I’ll do my best to be a good team member and keep good relationship with my colleagues.

By |August 23rd, 2015|News|Comments Off on Interview with new MCL member Yuhang Song|

Interview with new MCL member Hsin-Ying Lee

MCL has a new Ph.D student, Hsin-Ying Lee in Fall 2015. Let’s give Hsin-Ying a warm welcome!

Hsin-Ying received B.S. degree from the EE Department of National Taiwan University in 2014 Spring. He got the USC-Taiwan Fellowship award and decided to join USC MCL to pursue his Ph.D degree beginning from this Fall semester. We had a brief interview with him.

What is your first impression of USC and MCL?

USC is quite an impassioned and multicultural place. People in MCL are full of enthusiasms for cutting-edge research.

Could you briefly introduce yourself? (Previous research experience, project experience, research interest and expertise)

My previous research field was electronic design automation (EDA). I led a team to win two international computer-aided design (CAD) contests. I had joined a MediaTek project about datapath extraction and placement. Now I would like to dive into computer vision and start a new journey of research. Currently, my research interests are biomedical related subjects, depth estimation from single image, etc..

What’s your future expectation for MCL?

Because I am just a beginner in computer vision, I have a lot to learn. I hope I can develop some advanced and useful solutions for computer vision problem. On the other hand, I am looking forward to getting to know all team members in the lab and becoming friends with all of you. It is my honor to work with so many brilliant colleagues and I hope I will be a good teammate as well.

By |August 16th, 2015|News|Comments Off on Interview with new MCL member Hsin-Ying Lee|
  • Permalink Gallery

    NVIDIA Supports MCL in Building CNN for Computer Vision Research

NVIDIA Supports MCL in Building CNN for Computer Vision Research

MCL is joining the research of Convolutional Neural Network (CNN). With the donation of one K40 GPU from NVIDIA, MCL will create its first GPU server to train and test various CNN architectures. K40 is the prestigious GPU that is widely used in parallel computing environment. It features 12 GB memory and 2880 CUDA cores that can significantly accelerate the speed of modern computer vision algorithms. When ECC is turned off and clock frequency is slightly boosted, it is capable of processing 20 iterations of Caffe training in 19.2 seconds.

MCL director, Prof. C.-C. Jay Kuo, appreciates NVIDIA’s generous donation and has assigned Phd students, Hao Xu and Qin Huang to build a powerful server to house the K40. Prof. C.-C. Jay Kuo is interested in solving various computer vision problems. He has a vision that there should be a systematic way to provide unified solution to all types of computer vision problems. To achieve that goal, Prof. C.-C. Jay Kuo believes that we need to better understand the strength of the automatically trained feature sets.

MCL has a detailed road map for its future development of Convolutional Neural Network, and one crucial step is to study the features trained through it. MCL will use Caffe and Theano libraries to test various CNN architectures, from the classic AlexNet to the deeper and more accurate VGG net. The visualization of features trained from these architectures will be carefully examined using deConv network. Finally, MCL wants to build a clear understanding towards both the automatically trained features themselves and the reason for why are they selected by the CNN.

By |August 9th, 2015|News|Comments Off on NVIDIA Supports MCL in Building CNN for Computer Vision Research|
  • Permalink Gallery

    MCL interns and students presented their case studies in the entrepreneurship short course

MCL interns and students presented their case studies in the entrepreneurship short course

Three MCL students presented their case studies about business startups in industry in the last lecture of the entrepreneurship course this summer.

Chien-Yi Wang presented Adobe. Adobe has historically focused upon the creation of multimedia and creativity software products, with a more recent trend towards rich Internet application software development, including digital marketing and digital media solutions. The company is famous for its development on the “Portable Document Format”, which is adopted worldwide as a common medium for electronic documents. After the hallmark technology of PDF standard, they were devoted to develop powerful software and services for people to create suitable digital content, deploy it across media and devices, measure and optimize it over time and achieve greater business success. Adobe has a unique business model which could optimize the revenue by distributing their products in several ways. The company spent lots of money on the marketing side which allowed them to stand out among their competitors. Nowadays, their products all go on the cloud for customer to subscribe and make people easier to collaborate and use via different platforms. Adobe is also leading computer vision researches which could benefit the whole digital media community.

Eddy Wu presented Fitbit Inc. Fitbit is one of the most well-known technology companies that produce activity trackers – the wireless wearable devices that measure data such as number of steps, quality of sleep, and other personal metrics. Fitbit is a fast growing company since its establishment; its revenue increased from $14.5 million in 2011 to $745 million in 2014. Fitbit went public in Jun, 2015, and the shares jumped to about $30 a piece at IPO day and valued the company over $6 billion. There are many reasons for its huge success. First of all, while its competitors put more efforts on the hardware development, [...]

By |August 2nd, 2015|News|Comments Off on MCL interns and students presented their case studies in the entrepreneurship short course|