Generic Object Recognition with Boosting

Andreas Opelt; Axel Pinz; Michael Fussenegger; Peter Auer

Generic Object Recognition with Boosting

Andreas Opelt, Axel Pinz, Michael Fussenegger, Peter Auer

Institut für Grundlagen der Informationsverarbeitung (7080)

Publikation: Beitrag in einer Fachzeitschrift › Artikel › Begutachtung

Abstract

This paper explores the power and the limitations of weakly supervised categorization. We present a complete framework that starts with the extraction of various local regions of either discontinuity or homogeneity. A variety of local descriptors can be applied to form a set of feature vectors for each local region. Boosting is used to learn a subset of such feature vectors (weak hypotheses) and to combine them into one final hypothesis for each visual category. This combination of individual extractors and descriptors leads to recognition rates that are superior to other approaches which use only one specific extractor/descriptor setting. To explore the limitation of our system, we had to set up new, highly complex image databases that show the objects of interest at varying scales and poses, in cluttered background, and under considerable occlusion. We obtain classification results up to 81 percent ROC-equal error rate on the most complex of our databases. Our approach outperforms all comparable solutions on common databases.

Originalsprache	englisch
Seiten (von - bis)	416-431
Fachzeitschrift	IEEE Transactions on Pattern Analysis and Machine Intelligence
Jahrgang	28
Ausgabenummer	3
Publikationsstatus	Veröffentlicht - 2006

Fields of Expertise

Information, Communication & Computing

Dieses zitieren

@article{4be11606587c4f5c8198887ab1fb3fea,

title = "Generic Object Recognition with Boosting",

abstract = "This paper explores the power and the limitations of weakly supervised categorization. We present a complete framework that starts with the extraction of various local regions of either discontinuity or homogeneity. A variety of local descriptors can be applied to form a set of feature vectors for each local region. Boosting is used to learn a subset of such feature vectors (weak hypotheses) and to combine them into one final hypothesis for each visual category. This combination of individual extractors and descriptors leads to recognition rates that are superior to other approaches which use only one specific extractor/descriptor setting. To explore the limitation of our system, we had to set up new, highly complex image databases that show the objects of interest at varying scales and poses, in cluttered background, and under considerable occlusion. We obtain classification results up to 81 percent ROC-equal error rate on the most complex of our databases. Our approach outperforms all comparable solutions on common databases.",

keywords = "Boosting, object categorization, object localization",

author = "Andreas Opelt and Axel Pinz and Michael Fussenegger and Peter Auer",

year = "2006",

language = "English",

volume = "28",

pages = "416--431",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "3",

}

TY - JOUR

T1 - Generic Object Recognition with Boosting

AU - Opelt, Andreas

AU - Pinz, Axel

AU - Fussenegger, Michael

AU - Auer, Peter

PY - 2006

Y1 - 2006

N2 - This paper explores the power and the limitations of weakly supervised categorization. We present a complete framework that starts with the extraction of various local regions of either discontinuity or homogeneity. A variety of local descriptors can be applied to form a set of feature vectors for each local region. Boosting is used to learn a subset of such feature vectors (weak hypotheses) and to combine them into one final hypothesis for each visual category. This combination of individual extractors and descriptors leads to recognition rates that are superior to other approaches which use only one specific extractor/descriptor setting. To explore the limitation of our system, we had to set up new, highly complex image databases that show the objects of interest at varying scales and poses, in cluttered background, and under considerable occlusion. We obtain classification results up to 81 percent ROC-equal error rate on the most complex of our databases. Our approach outperforms all comparable solutions on common databases.

AB - This paper explores the power and the limitations of weakly supervised categorization. We present a complete framework that starts with the extraction of various local regions of either discontinuity or homogeneity. A variety of local descriptors can be applied to form a set of feature vectors for each local region. Boosting is used to learn a subset of such feature vectors (weak hypotheses) and to combine them into one final hypothesis for each visual category. This combination of individual extractors and descriptors leads to recognition rates that are superior to other approaches which use only one specific extractor/descriptor setting. To explore the limitation of our system, we had to set up new, highly complex image databases that show the objects of interest at varying scales and poses, in cluttered background, and under considerable occlusion. We obtain classification results up to 81 percent ROC-equal error rate on the most complex of our databases. Our approach outperforms all comparable solutions on common databases.

KW - Boosting

KW - object categorization

KW - object localization

M3 - Article

SN - 0162-8828

VL - 28

SP - 416

EP - 431

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 3

ER -

Generic Object Recognition with Boosting

Abstract

Fields of Expertise

Fingerprint

Dieses zitieren