Active learning for deep detection neural networks

Hamed Habibi Aghdam; Abel Gonzales-Garcia; Antonio M. Lopez; Joost van de Weijer

Active learning for deep detection neural networks

Hamed Habibi Aghdam, Abel Gonzales-Garcia, Antonio M. Lopez, Joost van de Weijer

Institute of Computer Graphics and Vision (7100)

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Abstract

The cost of drawing object bounding boxes (i.e. labeling) for millions of images is prohibitively high. For instance, labeling pedestrians in a regular urban image could take 35 seconds on average. Active learning aims to reduce the cost of labeling by selecting only those images that are informative to improve the detection network accuracy. In this paper, we propose a method to perform active learning of object detectors based on convolutional neural networks. We propose a new image-level scoring process to rank unlabeled images for their automatic selection, which clearly outperforms classical scores. The proposed method can be applied to videos and sets of still images. In the former case, temporal selection rules can complement our scoring process. As a relevant use case, we extensively study the performance of our method on the task of pedestrian detection. Overall, the experiments show that the proposed method performs better than random selection. © 2019 IEEE.

Original language	English
Title of host publication	17th IEEE/CVF International Conference on Computer Vision
Pages	3671-3679
Publication status	Published - 2019

Cite this

@inproceedings{d9688d4fd90643c28c1b2625e4c8e55c,

title = "Active learning for deep detection neural networks",

abstract = "The cost of drawing object bounding boxes (i.e. labeling) for millions of images is prohibitively high. For instance, labeling pedestrians in a regular urban image could take 35 seconds on average. Active learning aims to reduce the cost of labeling by selecting only those images that are informative to improve the detection network accuracy. In this paper, we propose a method to perform active learning of object detectors based on convolutional neural networks. We propose a new image-level scoring process to rank unlabeled images for their automatic selection, which clearly outperforms classical scores. The proposed method can be applied to videos and sets of still images. In the former case, temporal selection rules can complement our scoring process. As a relevant use case, we extensively study the performance of our method on the task of pedestrian detection. Overall, the experiments show that the proposed method performs better than random selection. {\textcopyright} 2019 IEEE.",

author = "{Habibi Aghdam}, Hamed and Abel Gonzales-Garcia and {M. Lopez}, Antonio and {van de Weijer}, Joost",

year = "2019",

language = "English",

pages = "3671--3679",

booktitle = "17th IEEE/CVF International Conference on Computer Vision",

}

TY - GEN

T1 - Active learning for deep detection neural networks

AU - Habibi Aghdam, Hamed

AU - Gonzales-Garcia, Abel

AU - M. Lopez, Antonio

AU - van de Weijer, Joost

PY - 2019

Y1 - 2019

N2 - The cost of drawing object bounding boxes (i.e. labeling) for millions of images is prohibitively high. For instance, labeling pedestrians in a regular urban image could take 35 seconds on average. Active learning aims to reduce the cost of labeling by selecting only those images that are informative to improve the detection network accuracy. In this paper, we propose a method to perform active learning of object detectors based on convolutional neural networks. We propose a new image-level scoring process to rank unlabeled images for their automatic selection, which clearly outperforms classical scores. The proposed method can be applied to videos and sets of still images. In the former case, temporal selection rules can complement our scoring process. As a relevant use case, we extensively study the performance of our method on the task of pedestrian detection. Overall, the experiments show that the proposed method performs better than random selection. © 2019 IEEE.

AB - The cost of drawing object bounding boxes (i.e. labeling) for millions of images is prohibitively high. For instance, labeling pedestrians in a regular urban image could take 35 seconds on average. Active learning aims to reduce the cost of labeling by selecting only those images that are informative to improve the detection network accuracy. In this paper, we propose a method to perform active learning of object detectors based on convolutional neural networks. We propose a new image-level scoring process to rank unlabeled images for their automatic selection, which clearly outperforms classical scores. The proposed method can be applied to videos and sets of still images. In the former case, temporal selection rules can complement our scoring process. As a relevant use case, we extensively study the performance of our method on the task of pedestrian detection. Overall, the experiments show that the proposed method performs better than random selection. © 2019 IEEE.

M3 - Conference paper

SP - 3671

EP - 3679

BT - 17th IEEE/CVF International Conference on Computer Vision

ER -