Self-taught Object Localization with Deep Networks

Loris Bazzani - Dartmouth College
Date and time
Thursday, July 23, 2015 at 5:00 PM - 16:45 rinfresco; 17:00 inizio seminario
Ca' Vignal 2, Floor 1°, Lecture Hall L
Programme Director
Marco Cristani
External reference
Publication date
July 13, 2015
Computer Science  


The reliance on plentiful and detailed manual annotations for training is a critical limitation of the current state of the art in object localization and detection. This talk presents self-taught object localization, a novel approach that leverages deep convolutional networks trained for whole-image recognition to localize objects in images without additional human supervision, i.e., without using any ground-truth bounding boxes for training. The key idea is to analyze the change in the recognition scores when artificially masking out different regions of the image. The masking out of a region that contains an object typically causes a significant drop in recognition. This idea is embedded into an agglomerative clustering technique that generates self-taught localization hypotheses. Our experiments on a challenging dataset of 200 classes indicate that our automatically-generated annotations are accurate enough to train object detectors yielding to recognition results remarkably close to those obtained by training on manually-annotated bounding boxes.

© 2002 - 2021  Verona University
Via dell'Artigliere 8, 37129 Verona  |  P. I.V.A. 01541040232  |  C. FISCALE 93009870234