Let us hear what he has to say about his defense and an abstract of his thesis.

Online object tracking is one of the fundamental computer vision problems. It is commonly used in real world applications such as traffic control and safety in video surveillance, autonomous vehicle, robotic navigation, medical imaging, etc. It is a very challenging problem due to multiple time-varying attributes in video sequences.

In this research, we investigate two different kinds of tracking problems: single object tracking (SOT) and multiple object tracking (MOT). First, we attempt to achieve online single object tracking using both spatial and motion cues with two novel methods. One is a traditional framework called “temporal prediction and spatial refinement (TPSR)” tracker, consisting of three cascaded modules: pre-processing (PP), temporal prediction (TP) and spatial refinement (SR). Another one is based on convolutional neural network architecture that treats the tracking as a detection problem, called ”Motion-Guided Convolutional Neural Network (MGNet) Tracker”. It has two innovations: 1) adoption of a motion-guided candidate selection (MCS) scheme based on a dynamic prediction model, and 2) usage of a RGB- plus-motion 5-channel input to the convolutional neural network (CNN). From the two proposed methods, we have showed the advantages of combining spatial information and motion cue together to improve the tracking performance.

Second, from the proposed SOT technique, we build an online multiple object tracking system with advanced model update and matching. This method treats the MOT problem as an online tracking problem, rather than the global optimization framework. There are three major components in this tracking system: 1) a system platform built upon multiple CNN single object trackers in MOT environment; 2) the proposed advanced online update strategy including incremental and aggressive update mode; 3) [...]