Automatic News Article Generation from Legislative Proceedings: A Phenom-Based Approach

Anastasiia Klimashevskaia, Richa Gadgil, Thomas Gerrity, Foaad Khosmood*, Christian Gütl, Patrick Howe

*Korrespondierende/r Autor/-in für diese Arbeit

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

Algorithmic journalism refers to automatic AI-constructed news stories. There have been successful commercial implementations for news stories in sports, weather, financial reporting and similar domains with highly structured, well defined tabular data sources. Other domains such as local reporting have not seen adoption of algorithmic journalism, and thus no automated reporting systems are available in these categories which can have important implications for the industry. In this paper, we demonstrate a novel approach for producing news stories on government legislative activity, an area that has not widely adopted algorithmic journalism. Our data source is state legislative proceedings, primarily the transcribed speeches and dialogue from floor sessions and committee hearings in US State legislatures. Specifically, we create a library of potential events called phenoms. We systematically analyze the transcripts for the presence of phenoms using a custom partial order planner. Each phenom, if present, contributes some natural language text to the generated article: either stating facts, quoting individuals or summarizing some aspect of the discussion. We evaluate two randomly chosen articles with a user study on Amazon Mechanical Turk with mostly Likert scale questions. Our results indicate a high degree of achievement for accuracy of facts and readability of final content with 13 of 22 users in the first article and 19 of 20 subjects of the second article agreeing or strongly agreeing that the articles included the most important facts of the hearings. Other results strengthen this finding in terms of accuracy, focus and writing quality.

Originalspracheenglisch
TitelStatistical Language and Speech Processing - 9th International Conference, SLSP 2021, Proceedings
Redakteure/-innenLuis Espinosa-Anke, Carlos Martín-Vide, Irena Spasic
Herausgeber (Verlag)Springer Science and Business Media Deutschland GmbH
Seiten15-26
Seitenumfang12
ISBN (Print)9783030895785
DOIs
PublikationsstatusVeröffentlicht - 2021
Veranstaltung9th International Conference on Statistical Language and Speech Processing, SLSP 2021 - Cardiff, Großbritannien / Vereinigtes Königreich
Dauer: 23 Nov. 202125 Nov. 2021

Publikationsreihe

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band13062 LNAI
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

Konferenz

Konferenz9th International Conference on Statistical Language and Speech Processing, SLSP 2021
Land/GebietGroßbritannien / Vereinigtes Königreich
OrtCardiff
Zeitraum23/11/2125/11/21

ASJC Scopus subject areas

  • Theoretische Informatik
  • Informatik (insg.)

Fingerprint

Untersuchen Sie die Forschungsthemen von „Automatic News Article Generation from Legislative Proceedings: A Phenom-Based Approach“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren