On Entropy-Based Data Mining

Andreas Holzinger, Matthias Hörtenhuber, Christopher Mayer, Martin Bachler, Siegfried Wassertheurer, Armando Pinho, David Koslicki

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

In the real world, we are confronted not only with complex and high-dimensional data sets, but usually with noisy, incomplete and uncertain data, where the application of traditional methods of knowledge discovery and data mining always entail the danger of modeling artifacts. Originally, information entropy was introduced by Shannon (1949), as a measure of uncertainty in the data. But up to the present, there have emerged many different types of entropy methods with a large number of different purposes and possible application areas. In this paper, we briefly discuss the applicability of entropy methods for the use in knowledge discovery and data mining, with particular emphasis on biomedical data. We present a very short overview of the state-of-the-art, with focus on four methods: Approximate Entropy (ApEn), Sample Entropy (SampEn), Fuzzy Entropy (FuzzyEn), and Topological Entropy (FiniteTopEn). Finally, we discuss some open problems and future research challenges.
LanguageEnglish
Title of host publicationInteractive Knowledge Discovery and Data Mining in Biomedical Informatics, LNCS 8401
Place of PublicationHeidelberg, Berlin, New York
PublisherSpringer
Pages209-226
Volume8401
Edition1
ISBN (Print)978-3-662-43967-8
DOIs
StatusPublished - 2014

Fingerprint

Data mining
Entropy

Keywords

  • Information Entropy
  • Data Mining
  • Health Informatics
  • Knowledge Discovery
  • Topological Entropy

ASJC Scopus subject areas

  • Computational Theory and Mathematics

Fields of Expertise

  • Information, Communication & Computing

Treatment code (Nähere Zuordnung)

  • Basic - Fundamental (Grundlagenforschung)
  • Application

Cite this

Holzinger, A., Hörtenhuber, M., Mayer, C., Bachler, M., Wassertheurer, S., Pinho, A., & Koslicki, D. (2014). On Entropy-Based Data Mining. In Interactive Knowledge Discovery and Data Mining in Biomedical Informatics, LNCS 8401 (1 ed., Vol. 8401, pp. 209-226). Heidelberg, Berlin, New York: Springer. DOI: 10.1007/978-3-662-43968-5_12

On Entropy-Based Data Mining. / Holzinger, Andreas; Hörtenhuber, Matthias; Mayer, Christopher; Bachler, Martin; Wassertheurer, Siegfried; Pinho, Armando; Koslicki, David.

Interactive Knowledge Discovery and Data Mining in Biomedical Informatics, LNCS 8401. Vol. 8401 1. ed. Heidelberg, Berlin, New York : Springer, 2014. p. 209-226.

Research output: Chapter in Book/Report/Conference proceedingChapter

Holzinger, A, Hörtenhuber, M, Mayer, C, Bachler, M, Wassertheurer, S, Pinho, A & Koslicki, D 2014, On Entropy-Based Data Mining. in Interactive Knowledge Discovery and Data Mining in Biomedical Informatics, LNCS 8401. 1 edn, vol. 8401, Springer, Heidelberg, Berlin, New York, pp. 209-226. DOI: 10.1007/978-3-662-43968-5_12
Holzinger A, Hörtenhuber M, Mayer C, Bachler M, Wassertheurer S, Pinho A et al. On Entropy-Based Data Mining. In Interactive Knowledge Discovery and Data Mining in Biomedical Informatics, LNCS 8401. 1 ed. Vol. 8401. Heidelberg, Berlin, New York: Springer. 2014. p. 209-226. Available from, DOI: 10.1007/978-3-662-43968-5_12
Holzinger, Andreas ; Hörtenhuber, Matthias ; Mayer, Christopher ; Bachler, Martin ; Wassertheurer, Siegfried ; Pinho, Armando ; Koslicki, David. / On Entropy-Based Data Mining. Interactive Knowledge Discovery and Data Mining in Biomedical Informatics, LNCS 8401. Vol. 8401 1. ed. Heidelberg, Berlin, New York : Springer, 2014. pp. 209-226
@inbook{b30f635152134bb28ceed25864622602,
title = "On Entropy-Based Data Mining",
abstract = "In the real world, we are confronted not only with complex and high-dimensional data sets, but usually with noisy, incomplete and uncertain data, where the application of traditional methods of knowledge discovery and data mining always entail the danger of modeling artifacts. Originally, information entropy was introduced by Shannon (1949), as a measure of uncertainty in the data. But up to the present, there have emerged many different types of entropy methods with a large number of different purposes and possible application areas. In this paper, we briefly discuss the applicability of entropy methods for the use in knowledge discovery and data mining, with particular emphasis on biomedical data. We present a very short overview of the state-of-the-art, with focus on four methods: Approximate Entropy (ApEn), Sample Entropy (SampEn), Fuzzy Entropy (FuzzyEn), and Topological Entropy (FiniteTopEn). Finally, we discuss some open problems and future research challenges.",
keywords = "Information Entropy, Data Mining, Health Informatics, Knowledge Discovery, Topological Entropy",
author = "Andreas Holzinger and Matthias H{\"o}rtenhuber and Christopher Mayer and Martin Bachler and Siegfried Wassertheurer and Armando Pinho and David Koslicki",
year = "2014",
doi = "10.1007/978-3-662-43968-5_12",
language = "English",
isbn = "978-3-662-43967-8",
volume = "8401",
pages = "209--226",
booktitle = "Interactive Knowledge Discovery and Data Mining in Biomedical Informatics, LNCS 8401",
publisher = "Springer",
edition = "1",

}

TY - CHAP

T1 - On Entropy-Based Data Mining

AU - Holzinger,Andreas

AU - Hörtenhuber,Matthias

AU - Mayer,Christopher

AU - Bachler,Martin

AU - Wassertheurer,Siegfried

AU - Pinho,Armando

AU - Koslicki,David

PY - 2014

Y1 - 2014

N2 - In the real world, we are confronted not only with complex and high-dimensional data sets, but usually with noisy, incomplete and uncertain data, where the application of traditional methods of knowledge discovery and data mining always entail the danger of modeling artifacts. Originally, information entropy was introduced by Shannon (1949), as a measure of uncertainty in the data. But up to the present, there have emerged many different types of entropy methods with a large number of different purposes and possible application areas. In this paper, we briefly discuss the applicability of entropy methods for the use in knowledge discovery and data mining, with particular emphasis on biomedical data. We present a very short overview of the state-of-the-art, with focus on four methods: Approximate Entropy (ApEn), Sample Entropy (SampEn), Fuzzy Entropy (FuzzyEn), and Topological Entropy (FiniteTopEn). Finally, we discuss some open problems and future research challenges.

AB - In the real world, we are confronted not only with complex and high-dimensional data sets, but usually with noisy, incomplete and uncertain data, where the application of traditional methods of knowledge discovery and data mining always entail the danger of modeling artifacts. Originally, information entropy was introduced by Shannon (1949), as a measure of uncertainty in the data. But up to the present, there have emerged many different types of entropy methods with a large number of different purposes and possible application areas. In this paper, we briefly discuss the applicability of entropy methods for the use in knowledge discovery and data mining, with particular emphasis on biomedical data. We present a very short overview of the state-of-the-art, with focus on four methods: Approximate Entropy (ApEn), Sample Entropy (SampEn), Fuzzy Entropy (FuzzyEn), and Topological Entropy (FiniteTopEn). Finally, we discuss some open problems and future research challenges.

KW - Information Entropy

KW - Data Mining

KW - Health Informatics

KW - Knowledge Discovery

KW - Topological Entropy

UR - http://link.springer.com/chapter/10.1007%2F978-3-662-43968-5_12

U2 - 10.1007/978-3-662-43968-5_12

DO - 10.1007/978-3-662-43968-5_12

M3 - Chapter

SN - 978-3-662-43967-8

VL - 8401

SP - 209

EP - 226

BT - Interactive Knowledge Discovery and Data Mining in Biomedical Informatics, LNCS 8401

PB - Springer

CY - Heidelberg, Berlin, New York

ER -