Object Recognition by Scene
Alignment
Antonio Torralba
Electrical Engineering and Computer
Science
MIT
Date: Friday, May 30, 2008
Place: Harold Frank Hall, 4164
Time: 2:00 pm — 3:00 pm
Abstract:
Object detection and recognition is generally
posed as a matching problem between the object representation and the
image features (e.g., aligning pictorial cues, shape correspondence, constellations
of parts, etc.) while rejecting the background features using an outlier
process. In this work, we take a different approach: we formulate the
object detection problem as a problem of aligning elements of the entire
scene. The background, instead of being treated as a set of outliers,
is used to guide the detection process. Our approach relies on the observation
that when we have a big enough database then we can find with high probability
some images in the database very close to a query image, as in similar
scenes with similar objects arranged in similar spatial configurations.
If the images in the retrieval set are
partially labeled, then we can transfer the knowledge of the labeling
to the query image, and the problem of object recognition becomes a problem
of aligning scene regions. But, can we find a dataset large enough to
cover a large number of scene configurations? Given an input image, how
do we find a good retrieval set, and, finally, how we do transfer the
labels to the input image? We will use two datasets; 1) the LabelMe dataset,
which contains more than 10,000 labeled images with over 180,000 annotated
objects. 2) The tiny images dataset: A dataset of weakly labeled images
with more than 79,000,000 images. We use this database to perform object
and scene classification, examining performance over a range of semantic
levels.
Work in collaboration with Rob Fergus, Bryan
Russell, Ce Liu and William
T. Freeman.
Additional information and links to relevant
papers can be found at:
URL: http://people.csail.mit.edu/torralba/tinyimages/
A review paper describing some of the background
for this work is
available here: http://cvcl.mit.edu/Papers/OlivaTorralbaTICS2007.pdf
Code, and additional material can be found
here: http://people.csail.mit.edu/torralba/code/spatialenvelope/
ANTONIO TORRALBA joined
the Department of Electrical Engineering and Computer Science at MIT in
August 2007 as an Assistant Professor and a member of the Computer Science
and Artificial Intelligence Laboratory. He earned a degree in telecommunications
engineering from the Technical University of Catalonia (Spain) in 1994.
He received his Ph.D. in Image and Signal Processing from the Institut
National Polytechnique, Grenoble, France, in 2000. Then, he moved to Boston
as a postdoctoral fellow at MIT. In 2004, he became a research scientist
at the Computer Science and Artificial Intelligence Laboratory.
Host: Prof. Miguel Eckstein, Dept. of Psychology
|