Scene Understanding and 3D Imagination: A Comparison between Machine Learning and Human Cognition

Michael Schoosleitner; Torsten Ullrich

doi:10.5220/0009350002310238

Scene Understanding and 3D Imagination: A Comparison between Machine Learning and Human Cognition

Michael Schoosleitner, Torsten Ullrich

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Abstract

Spatial perception and three-dimensional imagination are important characteristics for many construction tasks in civil engineering. In order to support people in these tasks, worldwide research is being carried out on assistance systems based on machine learning and augmented reality. In this paper, we examine the machine learning component and compare it to human performance. The test scenario is to recognize a partly-assembled model, identify its current status, i.e. the current instruction step, and to return the next step. Thus, we created a database of 2D images containing the complete set of instruction steps of the corresponding 3D model. Afterwards, we trained the deep neural network RotationNet with these images. Usually, the machine learning approaches are compared to each other; our contribution evaluates the machine learning results with human performance tested in a survey: in a clean-room setting the survey and RotationNet results are comparable and neither is significa ntly better. The real-world results show that the machine learning approaches need further improvements

Originalsprache	englisch
Titel	Proceedings of the International Joint Conference on Computer Vision and Computer Graphics Theory and Applications
Redakteure/-innen	Manuela Chessa, Alexis Paljic, Jose Braz
Herausgeber (Verlag)	SciTePress
Seiten	231-238
Seitenumfang	8
Band	2, HUCAPP
ISBN (elektronisch)	978-989-758-402-2
DOIs	https://doi.org/10.5220/0009350002310238
Publikationsstatus	Veröffentlicht - 2020
Veranstaltung	16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications: VISIGRAPP 2021 - Virtuell, Österreich Dauer: 8 Feb. 2021 → 10 Feb. 2021

Publikationsreihe

Name	VISIGRAPP 2020 - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
Band	2

Konferenz

Konferenz	16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
Land/Gebiet	Österreich
Ort	Virtuell
Zeitraum	8/02/21 → 10/02/21

Zugriff auf Dokument

10.5220/0009350002310238Lizenz: CC BY-NC-ND 4.0

Andere Dateien und Links

http://www.scopus.com/inward/record.url?scp=85083516330&partnerID=8YFLogxK

Dieses zitieren

Schoosleitner, M., & Ullrich, T. (2020). Scene Understanding and 3D Imagination: A Comparison between Machine Learning and Human Cognition. in M. Chessa, A. Paljic, & J. Braz (Hrsg.), Proceedings of the International Joint Conference on Computer Vision and Computer Graphics Theory and Applications (Band 2, HUCAPP, S. 231-238). (VISIGRAPP 2020 - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications; Band 2). SciTePress. https://doi.org/10.5220/0009350002310238

Scene Understanding and 3D Imagination: A Comparison between Machine Learning and Human Cognition. / Schoosleitner, Michael; Ullrich, Torsten.
Proceedings of the International Joint Conference on Computer Vision and Computer Graphics Theory and Applications. Hrsg. / Manuela Chessa; Alexis Paljic; Jose Braz. Band 2, HUCAPP SciTePress, 2020. S. 231-238 (VISIGRAPP 2020 - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications; Band 2).

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Schoosleitner, M & Ullrich, T 2020, Scene Understanding and 3D Imagination: A Comparison between Machine Learning and Human Cognition. in M Chessa, A Paljic & J Braz (Hrsg.), Proceedings of the International Joint Conference on Computer Vision and Computer Graphics Theory and Applications. Bd. 2, HUCAPP, VISIGRAPP 2020 - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, Bd. 2, SciTePress, S. 231-238, 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, Virtuell, Österreich, 8/02/21. https://doi.org/10.5220/0009350002310238

Schoosleitner M, Ullrich T. Scene Understanding and 3D Imagination: A Comparison between Machine Learning and Human Cognition. in Chessa M, Paljic A, Braz J, Hrsg., Proceedings of the International Joint Conference on Computer Vision and Computer Graphics Theory and Applications. Band 2, HUCAPP. SciTePress. 2020. S. 231-238. (VISIGRAPP 2020 - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications). doi: 10.5220/0009350002310238

Schoosleitner, Michael ; Ullrich, Torsten. / Scene Understanding and 3D Imagination: A Comparison between Machine Learning and Human Cognition. Proceedings of the International Joint Conference on Computer Vision and Computer Graphics Theory and Applications. Hrsg. / Manuela Chessa ; Alexis Paljic ; Jose Braz. Band 2, HUCAPP SciTePress, 2020. S. 231-238 (VISIGRAPP 2020 - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications).

@inproceedings{8db6fe838d5c45108c385ecde2107252,

title = "Scene Understanding and 3D Imagination: A Comparison between Machine Learning and Human Cognition",

abstract = "Spatial perception and three-dimensional imagination are important characteristics for many construction tasks in civil engineering. In order to support people in these tasks, worldwide research is being carried out on assistance systems based on machine learning and augmented reality. In this paper, we examine the machine learning component and compare it to human performance. The test scenario is to recognize a partly-assembled model, identify its current status, i.e. the current instruction step, and to return the next step. Thus, we created a database of 2D images containing the complete set of instruction steps of the corresponding 3D model. Afterwards, we trained the deep neural network RotationNet with these images. Usually, the machine learning approaches are compared to each other; our contribution evaluates the machine learning results with human performance tested in a survey: in a clean-room setting the survey and RotationNet results are comparable and neither is significa ntly better. The real-world results show that the machine learning approaches need further improvements",

author = "Michael Schoosleitner and Torsten Ullrich",

year = "2020",

doi = "10.5220/0009350002310238",

language = "English",

volume = "2, HUCAPP",

series = "VISIGRAPP 2020 - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications",

publisher = "SciTePress",

pages = "231--238",

editor = "Manuela Chessa and Alexis Paljic and Jose Braz",

booktitle = "Proceedings of the International Joint Conference on Computer Vision and Computer Graphics Theory and Applications",

address = "Portugal",

note = "16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications : VISIGRAPP 2021 ; Conference date: 08-02-2021 Through 10-02-2021",

}

TY - GEN

T1 - Scene Understanding and 3D Imagination: A Comparison between Machine Learning and Human Cognition

AU - Schoosleitner, Michael

AU - Ullrich, Torsten

PY - 2020

Y1 - 2020

N2 - Spatial perception and three-dimensional imagination are important characteristics for many construction tasks in civil engineering. In order to support people in these tasks, worldwide research is being carried out on assistance systems based on machine learning and augmented reality. In this paper, we examine the machine learning component and compare it to human performance. The test scenario is to recognize a partly-assembled model, identify its current status, i.e. the current instruction step, and to return the next step. Thus, we created a database of 2D images containing the complete set of instruction steps of the corresponding 3D model. Afterwards, we trained the deep neural network RotationNet with these images. Usually, the machine learning approaches are compared to each other; our contribution evaluates the machine learning results with human performance tested in a survey: in a clean-room setting the survey and RotationNet results are comparable and neither is significa ntly better. The real-world results show that the machine learning approaches need further improvements

AB - Spatial perception and three-dimensional imagination are important characteristics for many construction tasks in civil engineering. In order to support people in these tasks, worldwide research is being carried out on assistance systems based on machine learning and augmented reality. In this paper, we examine the machine learning component and compare it to human performance. The test scenario is to recognize a partly-assembled model, identify its current status, i.e. the current instruction step, and to return the next step. Thus, we created a database of 2D images containing the complete set of instruction steps of the corresponding 3D model. Afterwards, we trained the deep neural network RotationNet with these images. Usually, the machine learning approaches are compared to each other; our contribution evaluates the machine learning results with human performance tested in a survey: in a clean-room setting the survey and RotationNet results are comparable and neither is significa ntly better. The real-world results show that the machine learning approaches need further improvements

UR - http://www.scopus.com/inward/record.url?scp=85083516330&partnerID=8YFLogxK

U2 - 10.5220/0009350002310238

DO - 10.5220/0009350002310238

M3 - Conference paper

VL - 2, HUCAPP

T3 - VISIGRAPP 2020 - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications

SP - 231

EP - 238

BT - Proceedings of the International Joint Conference on Computer Vision and Computer Graphics Theory and Applications

A2 - Chessa, Manuela

A2 - Paljic, Alexis

A2 - Braz, Jose

PB - SciTePress

T2 - 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications

Y2 - 8 February 2021 through 10 February 2021

ER -

Scene Understanding and 3D Imagination: A Comparison between Machine Learning and Human Cognition

Abstract

Publikationsreihe

Konferenz

Zugriff auf Dokument

Andere Dateien und Links

Fingerprint

Dieses zitieren