Poluautomatsko stvaranje priča/sadržaja za geografski poslužitelj

Rizwan Mehmood; Hermann Maurer

Poluautomatsko stvaranje priča/sadržaja za geografski poslužitelj

Translated title of the contribution: Semi-automatic story generation for a geographic server

Rizwan Mehmood, Hermann Maurer

Institute of Interactive Systems and Data Science (7060)

Research output: Contribution to journal › Article › peer-review

Abstract

Most existing servers providing geographic data tend to offer various numeric data. We started to work on a new type of geographic server, motivated by four major issues: (i) How to handle figures when different databases present different values; (ii) How to build up sizeable collections of pictures with detailed descriptions; (iii) How to update rapidly changing information, such as personnel holding important functions, and (iv) how to describe countries not just by using trivial facts, but stories typical of the country involved. We have discussed and partially resolved issues (i) and (ii) in previous papers; we have decided to deal with (iii), regional updates, by tying in an international consortium whose members would either help themselves or find individuals to do so. It is issue (iv), how to generate non-trivial stories typical of a country, that we decided to tackle both manually (the consortium has by now generated around 200 stories), and by developing techniques for semi-automatic story generation, which is the topic of this paper. The basic idea was first to define sets of reasonably reliable servers that may differ from region to region, to extract “interesting facts” from the servers, and combine them in a raw version of a report that would require some manual cleaning-up (hence: semi-automatic). It may sound difficult to extract “interesting facts” from Web pages, but it is quite possible to define heuristics to do so, never exceeding the few lines allowed for quotation purposes. One very simple rule we adopted was this: ‘Look for sentences with superlatives!’ If a sentence contains words like “biggest”, “highest”, “most impressive” etc. it is likely to contain an interesting fact. With a little imagination, we have been able to establish a set of such rules. We will show that the stories can be completely different. For some countries, historical facts may dominate; for others, the beauty of landscapes; for others, cultural or economic achievements, and for yet others, unusual facts concerning Nobel Prize winners, food, entertainment, sports, other activities, national symbols, special laws, and so on. The results can be checked on by clicking on any country in the category “Special Information” under “Surprising Facts”. All examples shown in this paper were chosen fairly arbitrarily from over 1 90 examples, to show that the system is indeed working. There are two points to mention here: (a) it is a work in progress, yet has reached a very useable size; (b) the basic ideas can be applied to any area. The choice of geography was due to the wealth of data and interest in this area, but if our algorithms overlook some important facts, this is less critical than applied to types of medical treatment, etc.

Translated title of the contribution	Semi-automatic story generation for a geographic server
Original language	Croatian
Pages (from-to)	12-25
Number of pages	14
Journal	Kartografija i Geoinformacije
Volume	16
Issue number	27
Publication status	Published - 1 Jun 2017

Keywords

Geographic server
Story generation

ASJC Scopus subject areas

Geophysics
Geology
Earth and Planetary Sciences (miscellaneous)

Access to Document

https://hrcak.srce.hr/185928Licence: CC BY-SA 4.0

Cite this

@article{05a13f9bbd064abd82e649fb5c0e04eb,

title = "Poluautomatsko stvaranje pri{\v c}a/sadr{\v z}aja za geografski poslu{\v z}itelj",

abstract = "Most existing servers providing geographic data tend to offer various numeric data. We started to work on a new type of geographic server, motivated by four major issues: (i) How to handle figures when different databases present different values; (ii) How to build up sizeable collections of pictures with detailed descriptions; (iii) How to update rapidly changing information, such as personnel holding important functions, and (iv) how to describe countries not just by using trivial facts, but stories typical of the country involved. We have discussed and partially resolved issues (i) and (ii) in previous papers; we have decided to deal with (iii), regional updates, by tying in an international consortium whose members would either help themselves or find individuals to do so. It is issue (iv), how to generate non-trivial stories typical of a country, that we decided to tackle both manually (the consortium has by now generated around 200 stories), and by developing techniques for semi-automatic story generation, which is the topic of this paper. The basic idea was first to define sets of reasonably reliable servers that may differ from region to region, to extract “interesting facts” from the servers, and combine them in a raw version of a report that would require some manual cleaning-up (hence: semi-automatic). It may sound difficult to extract “interesting facts” from Web pages, but it is quite possible to define heuristics to do so, never exceeding the few lines allowed for quotation purposes. One very simple rule we adopted was this: {\textquoteleft}Look for sentences with superlatives!{\textquoteright} If a sentence contains words like “biggest”, “highest”, “most impressive” etc. it is likely to contain an interesting fact. With a little imagination, we have been able to establish a set of such rules. We will show that the stories can be completely different. For some countries, historical facts may dominate; for others, the beauty of landscapes; for others, cultural or economic achievements, and for yet others, unusual facts concerning Nobel Prize winners, food, entertainment, sports, other activities, national symbols, special laws, and so on. The results can be checked on by clicking on any country in the category “Special Information” under “Surprising Facts”. All examples shown in this paper were chosen fairly arbitrarily from over 1 90 examples, to show that the system is indeed working. There are two points to mention here: (a) it is a work in progress, yet has reached a very useable size; (b) the basic ideas can be applied to any area. The choice of geography was due to the wealth of data and interest in this area, but if our algorithms overlook some important facts, this is less critical than applied to types of medical treatment, etc.",

keywords = "Geographic server, Story generation",

author = "Rizwan Mehmood and Hermann Maurer",

year = "2017",

month = jun,

day = "1",

language = "kroatisch",

volume = "16",

pages = "12--25",

journal = "Kartografija i Geoinformacije",

issn = "1333-896X",

publisher = "Hrvatsko Kartografsko Dru{\v s}tvo ",

number = "27",

}

TY - JOUR

T1 - Poluautomatsko stvaranje priča/sadržaja za geografski poslužitelj

AU - Mehmood, Rizwan

AU - Maurer, Hermann

PY - 2017/6/1

Y1 - 2017/6/1

N2 - Most existing servers providing geographic data tend to offer various numeric data. We started to work on a new type of geographic server, motivated by four major issues: (i) How to handle figures when different databases present different values; (ii) How to build up sizeable collections of pictures with detailed descriptions; (iii) How to update rapidly changing information, such as personnel holding important functions, and (iv) how to describe countries not just by using trivial facts, but stories typical of the country involved. We have discussed and partially resolved issues (i) and (ii) in previous papers; we have decided to deal with (iii), regional updates, by tying in an international consortium whose members would either help themselves or find individuals to do so. It is issue (iv), how to generate non-trivial stories typical of a country, that we decided to tackle both manually (the consortium has by now generated around 200 stories), and by developing techniques for semi-automatic story generation, which is the topic of this paper. The basic idea was first to define sets of reasonably reliable servers that may differ from region to region, to extract “interesting facts” from the servers, and combine them in a raw version of a report that would require some manual cleaning-up (hence: semi-automatic). It may sound difficult to extract “interesting facts” from Web pages, but it is quite possible to define heuristics to do so, never exceeding the few lines allowed for quotation purposes. One very simple rule we adopted was this: ‘Look for sentences with superlatives!’ If a sentence contains words like “biggest”, “highest”, “most impressive” etc. it is likely to contain an interesting fact. With a little imagination, we have been able to establish a set of such rules. We will show that the stories can be completely different. For some countries, historical facts may dominate; for others, the beauty of landscapes; for others, cultural or economic achievements, and for yet others, unusual facts concerning Nobel Prize winners, food, entertainment, sports, other activities, national symbols, special laws, and so on. The results can be checked on by clicking on any country in the category “Special Information” under “Surprising Facts”. All examples shown in this paper were chosen fairly arbitrarily from over 1 90 examples, to show that the system is indeed working. There are two points to mention here: (a) it is a work in progress, yet has reached a very useable size; (b) the basic ideas can be applied to any area. The choice of geography was due to the wealth of data and interest in this area, but if our algorithms overlook some important facts, this is less critical than applied to types of medical treatment, etc.

AB - Most existing servers providing geographic data tend to offer various numeric data. We started to work on a new type of geographic server, motivated by four major issues: (i) How to handle figures when different databases present different values; (ii) How to build up sizeable collections of pictures with detailed descriptions; (iii) How to update rapidly changing information, such as personnel holding important functions, and (iv) how to describe countries not just by using trivial facts, but stories typical of the country involved. We have discussed and partially resolved issues (i) and (ii) in previous papers; we have decided to deal with (iii), regional updates, by tying in an international consortium whose members would either help themselves or find individuals to do so. It is issue (iv), how to generate non-trivial stories typical of a country, that we decided to tackle both manually (the consortium has by now generated around 200 stories), and by developing techniques for semi-automatic story generation, which is the topic of this paper. The basic idea was first to define sets of reasonably reliable servers that may differ from region to region, to extract “interesting facts” from the servers, and combine them in a raw version of a report that would require some manual cleaning-up (hence: semi-automatic). It may sound difficult to extract “interesting facts” from Web pages, but it is quite possible to define heuristics to do so, never exceeding the few lines allowed for quotation purposes. One very simple rule we adopted was this: ‘Look for sentences with superlatives!’ If a sentence contains words like “biggest”, “highest”, “most impressive” etc. it is likely to contain an interesting fact. With a little imagination, we have been able to establish a set of such rules. We will show that the stories can be completely different. For some countries, historical facts may dominate; for others, the beauty of landscapes; for others, cultural or economic achievements, and for yet others, unusual facts concerning Nobel Prize winners, food, entertainment, sports, other activities, national symbols, special laws, and so on. The results can be checked on by clicking on any country in the category “Special Information” under “Surprising Facts”. All examples shown in this paper were chosen fairly arbitrarily from over 1 90 examples, to show that the system is indeed working. There are two points to mention here: (a) it is a work in progress, yet has reached a very useable size; (b) the basic ideas can be applied to any area. The choice of geography was due to the wealth of data and interest in this area, but if our algorithms overlook some important facts, this is less critical than applied to types of medical treatment, etc.

KW - Geographic server

KW - Story generation

UR - http://www.scopus.com/inward/record.url?scp=85028470695&partnerID=8YFLogxK

M3 - Artikel

AN - SCOPUS:85028470695

SN - 1333-896X

VL - 16

SP - 12

EP - 25

JO - Kartografija i Geoinformacije

JF - Kartografija i Geoinformacije

IS - 27

ER -

Poluautomatsko stvaranje priča/sadržaja za geografski poslužitelj

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this