Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks

Truc Nguyen; Franz Pernkopf

doi:10.1109/EMBC44109.2020.9176076

Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks

Truc Nguyen, Franz Pernkopf

Institute of Signal Processing and Speech Communication (4420)

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Abstract

We propose a robust and efficient lung sound classification system using a snapshot ensemble of convolutional neural networks (CNNs). A robust CNN architecture is used to extract high-level features from log mel spectrograms. The CNN architecture is trained on a cosine cycle learning rate schedule. Capturing the best model of each training cycle allows to obtain multiple models settled on various local optima from cycle to cycle at the cost of training a single mode. Therefore, the snapshot ensemble boosts performance of the proposed system while keeping the drawback of expensive training of ensembles moderate. To deal with the class-imbalance of the dataset, temporal stretching and vocal tract length perturbation (VTLP) for data augmentation and the focal loss objective are used. Empirically, our system outperforms state-of-the-art systems for the prediction task of four classes (normal, crackles, wheezes, and both crackles and wheezes) and two classes (normal and abnormal (i.e. crackles, wheezes, and both crackles and wheezes)) and achieves 78.4% and 83.7% ICBHI specific micro-averaged accuracy, respectively. The average accuracy is repeated on ten random splittings of 80% training and 20% testing data using the ICBHI 2017 dataset of respiratory cycles.

Original language	English
Title of host publication	42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society
Subtitle of host publication	Enabling Innovative Technologies for Global Healthcare, EMBC 2020
Publisher	Institute of Electrical and Electronics Engineers
Pages	760-763
Number of pages	4
ISBN (Electronic)	9781728119908
DOIs	https://doi.org/10.1109/EMBC44109.2020.9176076
Publication status	Published - Jul 2020
Event	42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society: EMBC 2020 - Virtuell, Montreal, Canada Duration: 20 Jul 2020 → 24 Jul 2020

Publication series

Name	Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
Volume	2020-July
ISSN (Print)	1557-170X

Conference

Conference	42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society
Abbreviated title	EMBC 2020
Country/Territory	Canada
City	Virtuell, Montreal
Period	20/07/20 → 24/07/20

ASJC Scopus subject areas

Signal Processing
Biomedical Engineering
Computer Vision and Pattern Recognition
Health Informatics

Access to Document

10.1109/EMBC44109.2020.9176076

Cite this

Nguyen, T., & Pernkopf, F. (2020). Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks. In 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society: Enabling Innovative Technologies for Global Healthcare, EMBC 2020 (pp. 760-763). Article 9176076 (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS; Vol. 2020-July). Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/EMBC44109.2020.9176076

Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks. / Nguyen, Truc; Pernkopf, Franz.
42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society: Enabling Innovative Technologies for Global Healthcare, EMBC 2020. Institute of Electrical and Electronics Engineers, 2020. p. 760-763 9176076 (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS; Vol. 2020-July).

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Nguyen, T & Pernkopf, F 2020, Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks. in 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society: Enabling Innovative Technologies for Global Healthcare, EMBC 2020., 9176076, Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS, vol. 2020-July, Institute of Electrical and Electronics Engineers, pp. 760-763, 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society, Virtuell, Montreal, Quebec, Canada, 20/07/20. https://doi.org/10.1109/EMBC44109.2020.9176076

Nguyen T, Pernkopf F. Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks. In 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society: Enabling Innovative Technologies for Global Healthcare, EMBC 2020. Institute of Electrical and Electronics Engineers. 2020. p. 760-763. 9176076. (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS). doi: 10.1109/EMBC44109.2020.9176076

Nguyen, Truc ; Pernkopf, Franz. / Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks. 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society: Enabling Innovative Technologies for Global Healthcare, EMBC 2020. Institute of Electrical and Electronics Engineers, 2020. pp. 760-763 (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS).

@inproceedings{365cbd7b4f0b4d35a801cf20f4df8f58,

title = "Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks",

abstract = "We propose a robust and efficient lung sound classification system using a snapshot ensemble of convolutional neural networks (CNNs). A robust CNN architecture is used to extract high-level features from log mel spectrograms. The CNN architecture is trained on a cosine cycle learning rate schedule. Capturing the best model of each training cycle allows to obtain multiple models settled on various local optima from cycle to cycle at the cost of training a single mode. Therefore, the snapshot ensemble boosts performance of the proposed system while keeping the drawback of expensive training of ensembles moderate. To deal with the class-imbalance of the dataset, temporal stretching and vocal tract length perturbation (VTLP) for data augmentation and the focal loss objective are used. Empirically, our system outperforms state-of-the-art systems for the prediction task of four classes (normal, crackles, wheezes, and both crackles and wheezes) and two classes (normal and abnormal (i.e. crackles, wheezes, and both crackles and wheezes)) and achieves 78.4% and 83.7% ICBHI specific micro-averaged accuracy, respectively. The average accuracy is repeated on ten random splittings of 80% training and 20% testing data using the ICBHI 2017 dataset of respiratory cycles.",

author = "Truc Nguyen and Franz Pernkopf",

year = "2020",

month = jul,

doi = "10.1109/EMBC44109.2020.9176076",

language = "English",

series = "Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS",

publisher = "Institute of Electrical and Electronics Engineers",

pages = "760--763",

booktitle = "42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society",

address = "United States",

note = "42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society : EMBC 2020, EMBC 2020 ; Conference date: 20-07-2020 Through 24-07-2020",

}

TY - GEN

T1 - Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks

AU - Nguyen, Truc

AU - Pernkopf, Franz

PY - 2020/7

Y1 - 2020/7

N2 - We propose a robust and efficient lung sound classification system using a snapshot ensemble of convolutional neural networks (CNNs). A robust CNN architecture is used to extract high-level features from log mel spectrograms. The CNN architecture is trained on a cosine cycle learning rate schedule. Capturing the best model of each training cycle allows to obtain multiple models settled on various local optima from cycle to cycle at the cost of training a single mode. Therefore, the snapshot ensemble boosts performance of the proposed system while keeping the drawback of expensive training of ensembles moderate. To deal with the class-imbalance of the dataset, temporal stretching and vocal tract length perturbation (VTLP) for data augmentation and the focal loss objective are used. Empirically, our system outperforms state-of-the-art systems for the prediction task of four classes (normal, crackles, wheezes, and both crackles and wheezes) and two classes (normal and abnormal (i.e. crackles, wheezes, and both crackles and wheezes)) and achieves 78.4% and 83.7% ICBHI specific micro-averaged accuracy, respectively. The average accuracy is repeated on ten random splittings of 80% training and 20% testing data using the ICBHI 2017 dataset of respiratory cycles.

AB - We propose a robust and efficient lung sound classification system using a snapshot ensemble of convolutional neural networks (CNNs). A robust CNN architecture is used to extract high-level features from log mel spectrograms. The CNN architecture is trained on a cosine cycle learning rate schedule. Capturing the best model of each training cycle allows to obtain multiple models settled on various local optima from cycle to cycle at the cost of training a single mode. Therefore, the snapshot ensemble boosts performance of the proposed system while keeping the drawback of expensive training of ensembles moderate. To deal with the class-imbalance of the dataset, temporal stretching and vocal tract length perturbation (VTLP) for data augmentation and the focal loss objective are used. Empirically, our system outperforms state-of-the-art systems for the prediction task of four classes (normal, crackles, wheezes, and both crackles and wheezes) and two classes (normal and abnormal (i.e. crackles, wheezes, and both crackles and wheezes)) and achieves 78.4% and 83.7% ICBHI specific micro-averaged accuracy, respectively. The average accuracy is repeated on ten random splittings of 80% training and 20% testing data using the ICBHI 2017 dataset of respiratory cycles.

UR - http://www.scopus.com/inward/record.url?scp=85091015126&partnerID=8YFLogxK

U2 - 10.1109/EMBC44109.2020.9176076

DO - 10.1109/EMBC44109.2020.9176076

M3 - Conference paper

AN - SCOPUS:85091015126

T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS

SP - 760

EP - 763

BT - 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society

PB - Institute of Electrical and Electronics Engineers

T2 - 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society

Y2 - 20 July 2020 through 24 July 2020

ER -

Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this