Generating Tailored Classification Schemas for German Patents

Oliver Pimas, Stefan Klampfl, Thomas Kohl, Roman Kern, Mark Kröll

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

Patents and patent applications are important parts of a company’s intellectual property. Thus, companies put a lot of effort in designing and maintaining an internal structure for organizing their own patent portfolios, but also in keeping track of competitor’s patent portfolios. Yet, official classification schemas offered by patent offices (i) are often too coarse and (ii) are not mappable, for instance, to a company’s functions, applications, or divisions. In this work, we present a first step towards generating tailored classification. To automate the generation process, we apply key term extraction and topic modelling algorithms to 2.131 publications of German patent applications. To infer categories, we apply topic modelling to the patent collection. We evaluate the mapping of the topics found via the Latent Dirichlet Allocation method to the classes present in the patent collection as assigned by the domain expert.
Originalspracheenglisch
TitelInternational Conference on Applications of Natural Language to Information Systems
Seiten230 - 238
ISBN (elektronisch)978-331941753-0
DOIs
PublikationsstatusVeröffentlicht - 2016
Veranstaltung21st International Conference on Applications of Natural Language to Information Systems: NLDB 2016 - Salford, Großbritannien / Vereinigtes Königreich
Dauer: 22 Juni 201624 Juni 2016

Publikationsreihe

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band9612
ISSN (elektronisch)0302-9743

Konferenz

Konferenz21st International Conference on Applications of Natural Language to Information Systems
Land/GebietGroßbritannien / Vereinigtes Königreich
OrtSalford
Zeitraum22/06/1624/06/16

Fields of Expertise

  • Information, Communication & Computing

Fingerprint

Untersuchen Sie die Forschungsthemen von „Generating Tailored Classification Schemas for German Patents“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren