Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection

Andreas Opelt; Axel Pinz; Andrew Zisserman

Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection

Andreas Opelt, Axel Pinz, Andrew Zisserman

Publikation: Beitrag in einer Fachzeitschrift › Artikel › Begutachtung

Abstract

We present a novel algorithmic approach to object categorization and detection that can learn category specific detectors, using Boosting, from a visual alphabet of shape and appearance. The alphabet itself is learnt incrementally during this process. The resulting representation consists of a set of category-specific descriptors—basic shape features are represented by boundary-fragments, and appearance is represented by patches—where each descriptor in combination
with centroid vectors for possible object centroids (geometry) forms an alphabet entry. Our experimental results highlight several qualities of this novel representation. First, we demonstrate the power of purely shape-based representation with excellent categorization and detection results using a Boundary-Fragment-Model (BFM), and investigate the capabilities of such a model to handle changes in scale and viewpoint, as well as intra- and inter-class variability. Second, we show that incremental learning of a BFM for many categories leads to a sub-linear growth of visual alphabet entries by sharing of shape features, while this generalization over categories at the same time often improves categorization performance (over independently learning the
categories). Finally, the combination of basic shape and appearance (boundary-fragments and patches) features can further improve results. Certain feature types are preferred by certain categories, and for some categories we achieve
the lowest error rates that have been reported so far.

Originalsprache	englisch
Seiten (von - bis)	16-44
Fachzeitschrift	International Journal of Computer Vision
Jahrgang	80
Ausgabenummer	1
Publikationsstatus	Veröffentlicht - 2008

Treatment code (Nähere Zuordnung)

Basic - Fundamental (Grundlagenforschung)

Andere Dateien und Links

Dieses zitieren

@article{567996a880524c738be22b263c83675d,

title = "Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection",

abstract = "We present a novel algorithmic approach to object categorization and detection that can learn category specific detectors, using Boosting, from a visual alphabet of shape and appearance. The alphabet itself is learnt incrementally during this process. The resulting representation consists of a set of category-specific descriptors—basic shape features are represented by boundary-fragments, and appearance is represented by patches—where each descriptor in combinationwith centroid vectors for possible object centroids (geometry) forms an alphabet entry. Our experimental results highlight several qualities of this novel representation. First, we demonstrate the power of purely shape-based representation with excellent categorization and detection results using a Boundary-Fragment-Model (BFM), and investigate the capabilities of such a model to handle changes in scale and viewpoint, as well as intra- and inter-class variability. Second, we show that incremental learning of a BFM for many categories leads to a sub-linear growth of visual alphabet entries by sharing of shape features, while this generalization over categories at the same time often improves categorization performance (over independently learning thecategories). Finally, the combination of basic shape and appearance (boundary-fragments and patches) features can further improve results. Certain feature types are preferred by certain categories, and for some categories we achievethe lowest error rates that have been reported so far.",

keywords = "Generic object recognition, Object categorization, Category representation, Visual alphabet, Boosting",

author = "Andreas Opelt and Axel Pinz and Andrew Zisserman",

note = "online 13. Mai 2008",

year = "2008",

language = "English",

volume = "80",

pages = "16--44",

journal = "International Journal of Computer Vision",

issn = "1573-1405 ",

publisher = "Springer Vieweg",

number = "1",

}

TY - JOUR

T1 - Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection

AU - Opelt, Andreas

AU - Pinz, Axel

AU - Zisserman, Andrew

N1 - online 13. Mai 2008

PY - 2008

Y1 - 2008

N2 - We present a novel algorithmic approach to object categorization and detection that can learn category specific detectors, using Boosting, from a visual alphabet of shape and appearance. The alphabet itself is learnt incrementally during this process. The resulting representation consists of a set of category-specific descriptors—basic shape features are represented by boundary-fragments, and appearance is represented by patches—where each descriptor in combinationwith centroid vectors for possible object centroids (geometry) forms an alphabet entry. Our experimental results highlight several qualities of this novel representation. First, we demonstrate the power of purely shape-based representation with excellent categorization and detection results using a Boundary-Fragment-Model (BFM), and investigate the capabilities of such a model to handle changes in scale and viewpoint, as well as intra- and inter-class variability. Second, we show that incremental learning of a BFM for many categories leads to a sub-linear growth of visual alphabet entries by sharing of shape features, while this generalization over categories at the same time often improves categorization performance (over independently learning thecategories). Finally, the combination of basic shape and appearance (boundary-fragments and patches) features can further improve results. Certain feature types are preferred by certain categories, and for some categories we achievethe lowest error rates that have been reported so far.

AB - We present a novel algorithmic approach to object categorization and detection that can learn category specific detectors, using Boosting, from a visual alphabet of shape and appearance. The alphabet itself is learnt incrementally during this process. The resulting representation consists of a set of category-specific descriptors—basic shape features are represented by boundary-fragments, and appearance is represented by patches—where each descriptor in combinationwith centroid vectors for possible object centroids (geometry) forms an alphabet entry. Our experimental results highlight several qualities of this novel representation. First, we demonstrate the power of purely shape-based representation with excellent categorization and detection results using a Boundary-Fragment-Model (BFM), and investigate the capabilities of such a model to handle changes in scale and viewpoint, as well as intra- and inter-class variability. Second, we show that incremental learning of a BFM for many categories leads to a sub-linear growth of visual alphabet entries by sharing of shape features, while this generalization over categories at the same time often improves categorization performance (over independently learning thecategories). Finally, the combination of basic shape and appearance (boundary-fragments and patches) features can further improve results. Certain feature types are preferred by certain categories, and for some categories we achievethe lowest error rates that have been reported so far.

KW - Generic object recognition

KW - Object categorization

KW - Category representation

KW - Visual alphabet

KW - Boosting

UR - http://www.springerlink.com/content/a2178qj29p551755/

M3 - Article

SN - 1573-1405

VL - 80

SP - 16

EP - 44

JO - International Journal of Computer Vision

JF - International Journal of Computer Vision

IS - 1

ER -

Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection

Abstract

Treatment code (Nähere Zuordnung)

Andere Dateien und Links

Fingerprint

Dieses zitieren