Automatic Bayesian Density Analysis

Antonio Vergari; Alejandro Molina; Robert Peharz; Zoubin Ghahramani; Kristian Kersting; Isabel Valera

doi:10.1609/aaai.v33i01.33015207

Automatic Bayesian Density Analysis

Antonio Vergari, Alejandro Molina, Robert Peharz, Zoubin Ghahramani, Kristian Kersting, Isabel Valera

Institute of Theoretical Computer Science (7080)

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Abstract

Making sense of a dataset in an automatic and unsupervised fashion is a challenging problem in statistics and AI. Classical approaches for exploratory data analysis are usually not flexible enough to deal with the uncertainty inherent to real-world data: they are often restricted to fixed latent interaction models and homogeneous likelihoods; they are sensitive to missing, corrupt and anomalous data; moreover, their expressiveness generally comes at the price of intractable inference. As a result, supervision from statisticians is usually needed to find the right model for the data. However, since domain experts are not necessarily also experts in statistics, we propose Automatic Bayesian Density Analysis (ABDA) to make exploratory data analysis accessible at large. Specifically, ABDA allows for automatic and efficient missing value estimation, statistical data type and likelihood discovery, anomaly detection and dependency structure mining, on top of providing accurate density estimation. Extensive empirical evidence shows that ABDA is a suitable tool for automatic exploratory analysis of mixed continuous and discrete tabular data.

Original language	English
Title of host publication	Proceedings of The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19)
DOIs	https://doi.org/10.1609/aaai.v33i01.33015207
Publication status	Published - 2019

Access to Document

10.1609/aaai.v33i01.33015207

Cite this

@inproceedings{6eac8a2913194796a1e978989e31dd79,

title = "Automatic Bayesian Density Analysis",

abstract = "Making sense of a dataset in an automatic and unsupervised fashion is a challenging problem in statistics and AI. Classical approaches for exploratory data analysis are usually not flexible enough to deal with the uncertainty inherent to real-world data: they are often restricted to fixed latent interaction models and homogeneous likelihoods; they are sensitive to missing, corrupt and anomalous data; moreover, their expressiveness generally comes at the price of intractable inference. As a result, supervision from statisticians is usually needed to find the right model for the data. However, since domain experts are not necessarily also experts in statistics, we propose Automatic Bayesian Density Analysis (ABDA) to make exploratory data analysis accessible at large. Specifically, ABDA allows for automatic and efficient missing value estimation, statistical data type and likelihood discovery, anomaly detection and dependency structure mining, on top of providing accurate density estimation. Extensive empirical evidence shows that ABDA is a suitable tool for automatic exploratory analysis of mixed continuous and discrete tabular data.",

author = "Antonio Vergari and Alejandro Molina and Robert Peharz and Zoubin Ghahramani and Kristian Kersting and Isabel Valera",

year = "2019",

doi = "10.1609/aaai.v33i01.33015207",

language = "English",

booktitle = "Proceedings of The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19)",

}

TY - GEN

T1 - Automatic Bayesian Density Analysis

AU - Vergari, Antonio

AU - Molina, Alejandro

AU - Peharz, Robert

AU - Ghahramani, Zoubin

AU - Kersting, Kristian

AU - Valera, Isabel

PY - 2019

Y1 - 2019

N2 - Making sense of a dataset in an automatic and unsupervised fashion is a challenging problem in statistics and AI. Classical approaches for exploratory data analysis are usually not flexible enough to deal with the uncertainty inherent to real-world data: they are often restricted to fixed latent interaction models and homogeneous likelihoods; they are sensitive to missing, corrupt and anomalous data; moreover, their expressiveness generally comes at the price of intractable inference. As a result, supervision from statisticians is usually needed to find the right model for the data. However, since domain experts are not necessarily also experts in statistics, we propose Automatic Bayesian Density Analysis (ABDA) to make exploratory data analysis accessible at large. Specifically, ABDA allows for automatic and efficient missing value estimation, statistical data type and likelihood discovery, anomaly detection and dependency structure mining, on top of providing accurate density estimation. Extensive empirical evidence shows that ABDA is a suitable tool for automatic exploratory analysis of mixed continuous and discrete tabular data.

AB - Making sense of a dataset in an automatic and unsupervised fashion is a challenging problem in statistics and AI. Classical approaches for exploratory data analysis are usually not flexible enough to deal with the uncertainty inherent to real-world data: they are often restricted to fixed latent interaction models and homogeneous likelihoods; they are sensitive to missing, corrupt and anomalous data; moreover, their expressiveness generally comes at the price of intractable inference. As a result, supervision from statisticians is usually needed to find the right model for the data. However, since domain experts are not necessarily also experts in statistics, we propose Automatic Bayesian Density Analysis (ABDA) to make exploratory data analysis accessible at large. Specifically, ABDA allows for automatic and efficient missing value estimation, statistical data type and likelihood discovery, anomaly detection and dependency structure mining, on top of providing accurate density estimation. Extensive empirical evidence shows that ABDA is a suitable tool for automatic exploratory analysis of mixed continuous and discrete tabular data.

UR - https://research.tue.nl/en/publications/0ed34309-670e-4719-935a-2548fdb9e5f8

U2 - 10.1609/aaai.v33i01.33015207

DO - 10.1609/aaai.v33i01.33015207

M3 - Conference paper

BT - Proceedings of The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19)

ER -

Automatic Bayesian Density Analysis

Abstract

Access to Document

Other files and links

Cite this