Data Modeling Strategies for Imbalanced Learning in TRECVID Search Task

Jelena Tesic

Intelligent Information Analysis
IBM
T.J. Watson Research Center


Date: Friday, February 23, 2007
Place: Buchanan 1930
Time: 2:00 pm 3:00 pm


Abstract:

In this talk, we will present a novel approach to the difficult problem of querying video databases using visual topics with few examples. Typically with visual topics the examples are not sufficiently diverse to create robust model of the user's need in descriptor's space. As a result, direct modeling using the provided topic examples as training data is inadequate. Otherwise, systems resort to multiple content-based searches using each example in turn, which typically provides poor results.

 

We explore the relevance of visual concept modelling and how they help refine the query topics. First, we propose a new technique of leveraging unlabeled data to expand the diversity of the topic examples as well as provide a robust set of positive and negative visual examples that allow direct modeling in the descriptor space by using development set for the modelling task. The proposed method outperforms TRECVID 2006 baseline visual and text fusion by over 18%. Second, we propose a way to leverage underlying semantics contained in the visual query topic examples to improve the search. Moreover, we explore the visual context in fusion with text and visual search baseline and examine how this component can help disambiguate word and visual senses. We apply the proposed methods in the overall video search system, and show how the underlined semantics of the dataset can significantly improve the overall visual search results, and enhance performance of other modalities by at least %15.

 

 

 

JELENA TESIC is a Research Staff Member in the Intelligent Information Analysis Group at IBM T. J. Watson Research Center, Hawthorne, New York. She received the Dipl. Ing. degree from the School of Electrical Engineering, University of Belgrade, Serbia, in 1998, and M.S. and Ph.D. degrees in Electrical and Computer Engineering from University of California, Santa Barbara, in 1999 and 2004, respectively.

 

Jelena's current research work is in the areas of multimedia management, content and social network analysis, scalable indexing, learning, and semantic mining. She has made several contributions to boost the top performance of the IBM team in 2004, 2005, and 2006 TREC Video Retrieval benchmark organized by NIST. Her prior work includes topic in computer vision, image processing, and pattern recognition applied to various scientific multimedia collections.

 

 

Host: Professor B.S. Manjunath, Electrical and Computer Engineering