GerIE - An Open Information Extraction System for the German Language

Akim Bassa, Mark Kröll, Roman Kern

Research output: Contribution to journalArticlepeer-review

Abstract

Open Information Extraction (OIE) allows to extract relations from a text without the need of domain-speci_c training data. To date, most of the research on OIE has been focused to the English language and little or no research has been conducted on other languages, including German. To tackle this problem, we developed GerIE, an OIE system for the German language. We surveyed the literature on OIE in order to identify concepts that may apply to the German language. Our system is based on the output of a German dependency parser and a number of handcrafted rules to extract the propositions. To evaluate the system, we created two dedicated datasets: one derived from news articles and the other devised from texts from an encyclopedia. Our system achieves F-measures of up to 0.89 for correctly-preprocessed sentences.
Original languageEnglish
Pages (from-to)2-24
Number of pages23
JournalJournal of Universal Computer Science
Volume24
Issue number1
DOIs
Publication statusPublished - 2018

Fingerprint

Dive into the research topics of 'GerIE - An Open Information Extraction System for the German Language'. Together they form a unique fingerprint.

Cite this