A geometric perspective on information plane analysis

Mina Basirat; Bernhard C. Geiger; Peter M. Roth

doi:10.3390/e23060711

A geometric perspective on information plane analysis

Mina Basirat, Bernhard C. Geiger, Peter M. Roth^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

Abstract

Information plane analysis, describing the mutual information between the input and a hidden layer and between a hidden layer and the target over time, has recently been proposed to analyze the training of neural networks. Since the activations of a hidden layer are typically continuous-valued, this mutual information cannot be computed analytically and must thus be estimated, resulting in apparently inconsistent or even contradicting results in the literature. The goal of this paper is to demonstrate how information plane analysis can still be a valuable tool for analyzing neural network training. To this end, we complement the prevailing binning estimator for mutual information with a geometric interpretation. With this geometric interpretation in mind, we evaluate the impact of regularization and interpret phenomena such as underfitting and overfitting. In addition, we investigate neural network learning in the presence of noisy data and noisy labels.

Original language	English
Article number	711
Journal	Entropy
Volume	23
Issue number	6
DOIs	https://doi.org/10.3390/e23060711
Publication status	Published - Jun 2021

Keywords

Adaptive and fixed binning
Image classification
Information plane analysis
Neural networks

ASJC Scopus subject areas

Information Systems
Mathematical Physics
Physics and Astronomy (miscellaneous)
Electrical and Electronic Engineering

Access to Document

10.3390/e23060711Licence: CC BY 4.0

Cite this

@article{460d7a2180de4d98870d7d3652e3ea9b,

title = "A geometric perspective on information plane analysis",

abstract = "Information plane analysis, describing the mutual information between the input and a hidden layer and between a hidden layer and the target over time, has recently been proposed to analyze the training of neural networks. Since the activations of a hidden layer are typically continuous-valued, this mutual information cannot be computed analytically and must thus be estimated, resulting in apparently inconsistent or even contradicting results in the literature. The goal of this paper is to demonstrate how information plane analysis can still be a valuable tool for analyzing neural network training. To this end, we complement the prevailing binning estimator for mutual information with a geometric interpretation. With this geometric interpretation in mind, we evaluate the impact of regularization and interpret phenomena such as underfitting and overfitting. In addition, we investigate neural network learning in the presence of noisy data and noisy labels.",

keywords = "Adaptive and fixed binning, Image classification, Information plane analysis, Neural networks",

author = "Mina Basirat and Geiger, {Bernhard C.} and Roth, {Peter M.}",

note = "Publisher Copyright: {\textcopyright} 2021 by the authors. Licensee MDPI, Basel, Switzerland.",

year = "2021",

month = jun,

doi = "10.3390/e23060711",

language = "English",

volume = "23",

journal = "Entropy",

issn = "1099-4300",

publisher = "MDPI AG",

number = "6",

}

TY - JOUR

T1 - A geometric perspective on information plane analysis

AU - Basirat, Mina

AU - Geiger, Bernhard C.

AU - Roth, Peter M.

PY - 2021/6

Y1 - 2021/6

N2 - Information plane analysis, describing the mutual information between the input and a hidden layer and between a hidden layer and the target over time, has recently been proposed to analyze the training of neural networks. Since the activations of a hidden layer are typically continuous-valued, this mutual information cannot be computed analytically and must thus be estimated, resulting in apparently inconsistent or even contradicting results in the literature. The goal of this paper is to demonstrate how information plane analysis can still be a valuable tool for analyzing neural network training. To this end, we complement the prevailing binning estimator for mutual information with a geometric interpretation. With this geometric interpretation in mind, we evaluate the impact of regularization and interpret phenomena such as underfitting and overfitting. In addition, we investigate neural network learning in the presence of noisy data and noisy labels.

AB - Information plane analysis, describing the mutual information between the input and a hidden layer and between a hidden layer and the target over time, has recently been proposed to analyze the training of neural networks. Since the activations of a hidden layer are typically continuous-valued, this mutual information cannot be computed analytically and must thus be estimated, resulting in apparently inconsistent or even contradicting results in the literature. The goal of this paper is to demonstrate how information plane analysis can still be a valuable tool for analyzing neural network training. To this end, we complement the prevailing binning estimator for mutual information with a geometric interpretation. With this geometric interpretation in mind, we evaluate the impact of regularization and interpret phenomena such as underfitting and overfitting. In addition, we investigate neural network learning in the presence of noisy data and noisy labels.

KW - Adaptive and fixed binning

KW - Image classification

KW - Information plane analysis

KW - Neural networks

UR - http://www.scopus.com/inward/record.url?scp=85107929852&partnerID=8YFLogxK

U2 - 10.3390/e23060711

DO - 10.3390/e23060711

M3 - Article

AN - SCOPUS:85107929852

SN - 1099-4300

VL - 23

JO - Entropy

JF - Entropy

IS - 6

M1 - 711

ER -

A geometric perspective on information plane analysis

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this