3D Localization in Urban Environments from Single Images

Anil Armagan; Martin Hirzer; Peter M. Roth; Vincent Lepetit

3D Localization in Urban Environments from Single Images

Anil Armagan, Martin Hirzer, Peter M. Roth, Vincent Lepetit

Institute of Computer Graphics and Vision (7100)

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Abstract

In this paper, we tackle the problem of geolocalization in urban environments overcoming the limitations in terms of accuracy of sensors like GPS, compass and accelerometer. For that purpose, we adopt recent findings in image segmentation and machine learning and combine them with the valuable information given by 2.5D maps of buildings. In particular, we first extract the façades of buildings and their edges and use this information to estimate the orientation and location that best align an input image to a 3D rendering of the given 2.5D map. As this step builds on a learned semantic segmentation procedure, rich training data is required. Thus, we also discuss how the required training data can be efficiently generated via a 3D tracking system.

Original language	English
Title of host publication	Proceedings of the OAGM/AAPR & ARW Joint Workshop (OAGM/AAPR & ARW)
Publication status	Published - 2017
Event	OAGM/AAPR ARW 2017: Joint Workshop on “Vision, Automation & Robotics” - Palais Eschenbach, Wien, Austria Duration: 10 May 2017 → 12 May 2017 http://www.roboticsworkshop.at/index.php

Conference

Conference	OAGM/AAPR ARW 2017
Abbreviated title	OAGM/AAPR ARW 2017
Country/Territory	Austria
City	Wien
Period	10/05/17 → 12/05/17
Internet address	http://www.roboticsworkshop.at/index.php

Cite this

@inproceedings{45d150df9bd8427ab935c0e8bf2b398e,

title = "3D Localization in Urban Environments from Single Images",

abstract = "In this paper, we tackle the problem of geolocalization in urban environments overcoming the limitations in terms of accuracy of sensors like GPS, compass and accelerometer. For that purpose, we adopt recent findings in image segmentation and machine learning and combine them with the valuable information given by 2.5D maps of buildings. In particular, we first extract the fa{\c c}ades of buildings and their edges and use this information to estimate the orientation and location that best align an input image to a 3D rendering of the given 2.5D map. As this step builds on a learned semantic segmentation procedure, rich training data is required. Thus, we also discuss how the required training data can be efficiently generated via a 3D tracking system.",

author = "Anil Armagan and Martin Hirzer and Roth, {Peter M.} and Vincent Lepetit",

year = "2017",

language = "English",

booktitle = "Proceedings of the OAGM/AAPR & ARW Joint Workshop (OAGM/AAPR & ARW)",

note = "OAGM/AAPR ARW 2017 : Joint Workshop on “Vision, Automation & Robotics”, OAGM/AAPR ARW 2017 ; Conference date: 10-05-2017 Through 12-05-2017",

url = "http://www.roboticsworkshop.at/index.php",

}

TY - GEN

T1 - 3D Localization in Urban Environments from Single Images

AU - Armagan, Anil

AU - Hirzer, Martin

AU - Roth, Peter M.

AU - Lepetit, Vincent

PY - 2017

Y1 - 2017

N2 - In this paper, we tackle the problem of geolocalization in urban environments overcoming the limitations in terms of accuracy of sensors like GPS, compass and accelerometer. For that purpose, we adopt recent findings in image segmentation and machine learning and combine them with the valuable information given by 2.5D maps of buildings. In particular, we first extract the façades of buildings and their edges and use this information to estimate the orientation and location that best align an input image to a 3D rendering of the given 2.5D map. As this step builds on a learned semantic segmentation procedure, rich training data is required. Thus, we also discuss how the required training data can be efficiently generated via a 3D tracking system.

AB - In this paper, we tackle the problem of geolocalization in urban environments overcoming the limitations in terms of accuracy of sensors like GPS, compass and accelerometer. For that purpose, we adopt recent findings in image segmentation and machine learning and combine them with the valuable information given by 2.5D maps of buildings. In particular, we first extract the façades of buildings and their edges and use this information to estimate the orientation and location that best align an input image to a 3D rendering of the given 2.5D map. As this step builds on a learned semantic segmentation procedure, rich training data is required. Thus, we also discuss how the required training data can be efficiently generated via a 3D tracking system.

M3 - Conference paper

BT - Proceedings of the OAGM/AAPR & ARW Joint Workshop (OAGM/AAPR & ARW)

T2 - OAGM/AAPR ARW 2017

Y2 - 10 May 2017 through 12 May 2017

ER -

3D Localization in Urban Environments from Single Images

Abstract

Conference

Fingerprint

Cite this