short-paper

Learning Image-based Representations for Heart Sound Classification

Authors:
Zhao Ren

University of Augsburg, Augsburg, Germany

University of Augsburg, Augsburg, Germany
View Profile

,
Nicholas Cummins

University of Augsburg, Augsburg, Germany

University of Augsburg, Augsburg, Germany
View Profile

,
Vedhas Pandit

University of Augsburg, Augsburg, Germany

University of Augsburg, Augsburg, Germany
View Profile

,
Jing Han

University of Augsburg, Augsburg, Germany

University of Augsburg, Augsburg, Germany
View Profile

,
Kun Qian

Technische Universität München, Munich, Germany

Technische Universität München, Munich, Germany
View Profile

,
Björn Schuller

Imperial College London & University of Augsburg, London, United Kingdom

Imperial College London & University of Augsburg, London, United Kingdom
View Profile

DH '18: Proceedings of the 2018 International Conference on Digital HealthApril 2018Pages 143–147https://doi.org/10.1145/3194658.3194671

Published:23 April 2018Publication History

DH '18: Proceedings of the 2018 International Conference on Digital Health

Pages 143–147

ABSTRACT

Machine learning based heart sound classification represents an efficient technology that can help reduce the burden of manual auscultation through the automatic detection of abnormal heart sounds. In this regard, we investigate the efficacy of using the pre-trained Convolutional Neural Networks (CNNs) from large-scale image data for the classification of Phonocardiogram (PCG) signals by learning deep PCG representations. First, the PCG files are segmented into chunks of equal length. Then, we extract a scalogram image from each chunk using a wavelet transformation. Next, the scalogram images are fed into either a pre-trained CNN, or the same network fine-tuned on heart sound data. Deep representations are then extracted from a fully connected layer of each network and classification is achieved by a static classifier. Alternatively, the scalogram images are fed into an end-to-end CNN formed by adapting a pre-trained network via transfer learning. Key results indicate that our deep PCG representations extracted from a fine-tuned CNN perform the strongest, 56.2% mean accuracy, on our heart sound classification task. When compared to a baseline accuracy of 46.9%, gained using conventional audio processing features and a support vector machine, this is a significant relative improvement of 19.8% (p∠.001 by one-tailed z-test).

References

Shahin Amiriparian, Maurice Gerczuk, Sandra Ottl, Nicholas Cummins, Michael Freitag, Sergey Pugachevskiy, Alice Baird, and Björn Schuller. 2018. Snore sound classification using image-based deep spectrum features Proc. INTERSPEECH. Stockholm, Sweden, 3512--3516.Google Scholar
Chih-Chung Chang and Chih-Jen Lin. 2011. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology Vol. 2, 3 (Apr.. 2011), 1--27. Google ScholarDigital Library
Gari D. Clifford, Chengyu Liu, Benjamin Moody, David Springer, Ikaro Silva, Qiao Li, and Roger G. Mark. 2016. Classification of normal/abnormal heart sound recordings: The PhysioNet/Computing in Cardiology Challenge 2016. In Proc. Computing in Cardiology Conference (CinC). Vancouver, Canada, 609--612.Google Scholar
Jun Deng, Nicholas Cummins, Jing Han, Xinzhou Xu, Zhao Ren, Vedhas Pandit, Zixing Zhang, and Björn Schuller. 2016. The University of Passau open emotion recognition system for the multimodal emotion challenge. In Proc. CCPR. Chengdu, China, 652--666.Google ScholarCross Ref
Florian Eyben, Felix Weninger, Florian Groß, and Björn Schuller. 2013. Recent Developments in openSMILE, the Munich open-source multimedia feature extractor. In Proc. ACM Multimedia. Barcelona, Spain, 835--838. Google ScholarDigital Library
Steve R. Gunn. 1998. Support vector machines for classification and regression. ISIS technical report Vol. 14, 1 (May. 1998), 5--16.Google Scholar
Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, and Li Fei-Fei. 2014. Large-scale video classification with convolutional neural networks Proc. CVPR. Columbus, OH, 1725--1732. Google ScholarDigital Library
Edmund Kay and Anurag Agarwal. 2016. DropConnected neural network trained with diverse features for classifying heart sounds Proc. Computing in Cardiology Conference (CinC). Vancouver, Canada, 617--620.Google Scholar
Yoon Kim. 2014. Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014).Google Scholar
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks Proc. NIPS. Lake Tahoe, NV, 1097--1105. Google ScholarDigital Library
Aubrey Leatham. 1952. Phonocardiography. British Medical Bulletin Vol. 8, 4 (1952), 333--342.Google ScholarCross Ref
Chengyu Liu et almbox.. 2016. An open access database for the evaluation of heart sound algorithms. Physiological Measurement Vol. 37, 12 (Nov.. 2016), 2181--2213.Google Scholar
Ilias Maglogiannis, Euripidis Loukis, Elias Zafiropoulos, and Antonis Stasis. 2009. Support vectors machine-based identification of heart valve diseases using heart sounds. Computer Methods and Programs in Biomedicine Vol. 95, 1 (July. 2009), 47--61. Google ScholarDigital Library
Vykintas Maknickas and Algirdas Maknickas. 2018. Recognition of normal--abnormal phonocardiographic signals using deep convolutional neural networks and mel-frequency spectral coefficients. Physiological Measurement Vol. 38, 8 (July. 2018), 1671--1679.Google Scholar
Ali Moukadem, Alain Dieterlen, Nicolas Hueber, and Christian Brandt. 2013. A robust heart sounds segmentation module based on S-transform. Biomedical Signal Processing and Control Vol. 8, 3 (May. 2013), 273--281.Google ScholarCross Ref
Dariush Mozaffarian et almbox.. 2016. Heart disease and stroke statistics--2016 update: A report from the American Heart Association. Circulation Vol. 133, 4 (Jan.. 2016), e38--e360.Google Scholar
Sofia C. Olhede and Andrew T. Walden. 2002. Generalized morse wavelets. IEEE Transactions on Signal Processing Vol. 50, 11 (Nov.. 2002), 2661--2670. Google ScholarDigital Library
Chrysa D. Papadaniil and Leontios J. Hadjileontiadis. 2014. Efficient heart sound segmentation and extraction using ensemble empirical mode decomposition and kurtosis features. IEEE Journal of Biomedical and Health Informatics Vol. 18, 4 (July. 2014), 1138--1152.Google ScholarCross Ref
Kun Qian, Christoph Janott, Vedhas Pandit, Zixing Zhang, Clemens Heiser, Winfried Hohenhorst, Michael Herzog, Werner Hemmert, and Björn Schuller. 2018. Classification of the excitation location of snore sounds in the upper airway by acoustic multifeature analysis. IEEE Transactions on Biomedical Engineering Vol. 64, 8 (Aug.. 2018), 1731--1741.Google Scholar
Kun Qian, Christoph Janott, Zixing Zhang, Clemens Heiser, and Björn Schuller. 2016. Wavelet features for classification of vote snore sounds Proc. ICASSP. Shanghai, China, 221--225.Google Scholar
Zhao Ren, Vedhas Pandit, Kun Qian, Zijiang Yang, Zixing Zhang, and Björn Schuller. 2018. Deep sequential image features on acoustic scene classification Proc. DCASE Workshop. Munich, Germany, 113--117.Google Scholar
Olivier Rioul and Martin Vetterli. 1991. Wavelets and signal processing. IEEE Signal Processing Magazine Vol. 8, 4 (Oct.. 1991), 14--38.Google ScholarCross Ref
Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. 2015. Imagenet large scale visual recognition challenge. International Journal of Computer Vision Vol. 115, 3 (Dec.. 2015), 211--252. Google ScholarDigital Library
Maryam Samieinasab and Reza Sameni. 2015. Fetal phonocardiogram extraction using single channel blind source separation Proc. ICEE. Tehran, Iran, 78--83.Google Scholar
Samuel E. Schmidt, Claus Holst-Hansen, Claus Graff, Egon Toft, and Johannes J. Struijk. 2010 a. Segmentation of heart sound recordings by a duration-dependent hidden Markov model. Physiological Measurement Vol. 31, 4 (Mar.. 2010), 513--529.Google ScholarCross Ref
Samuel E. Schmidt, Claus Holst-Hansen, John Hansen, Egon Toft, and Johannes J. Struijk. 2015. Acoustic features for the identification of coronary artery disease. IEEE Transactions on Biomedical Engineering Vol. 62, 11 (Nov.. 2015), 2611--2619.Google ScholarCross Ref
Samuel E. Schmidt, Egon Toft, Claus Holst-Hansen, and Johannes J. Struijk. 2010 b. Noise and the detection of coronary artery disease with an electronic stethoscope Proc. CIBEC. Cairo, Egypt, 53--56.Google Scholar
Björn Schuller, Stefan Steidl, Anton Batliner, Elika Bergelson, Jarek Krajewski, Christoph Janott, Andrei Amatuni, Marisa Casillas, Amdanda Seidl, Melanie Soderstrom, Anne Ss Warlaumont, Guillermo Hidalgo, Sebastian Schnieder, Clemens Heiser, Winfried Hohenhorst, Michael Herzog, Maximilian Schmitt, Kun Qian, Yue Zhang, George Trigeorgis, Panagiotis Tzirakis, and Stefanos Zafeiriou. 2018. The INTERSPEECH 2018 computational paralinguistics challenge: Addressee, cold & snoring Proc. INTERSPEECH. Stockholm, Sweden, 3442--3446.Google Scholar
Björn Schuller, Stefan Steidl, Anton Batliner, Alessandro Vinciarelli, Klaus Scherer, Fabien Ringeval, Mohamed Chetouani, Felix Weninger, Florian Eyben, Erik Marchi, Marcello Mortillaro, Hugues Salamin, Anna Polychroniou, Fabio Valente, and Samuel Kim. 2013. The INTERSPEECH 2013 computational paralinguistics challenge: Social signals, conflict, emotion, autism. In Proc. INTERSPEECH. Lyon, France, 148--152.Google Scholar
Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition Proc. ICLR. San Diego, CA, no pagination.Google Scholar
Zeeshan Syed, Daniel Leeds, Dorothy Curtis, Francesca Nesta, Robert A. Levine, and John Guttag. 2007. A framework for the analysis of acoustical cardiac signals. IEEE Transactions on Biomedical Engineering Vol. 54, 4 (Apr.. 2007), 651--662.Google ScholarCross Ref
Zeeshan Hassan Syed. 2003. MIT automated auscultation system. Ph.D. Dissertation. bibinfoschoolMassachusetts Institute of Technology.Google Scholar
Hong Tang, Ting Li, Tianshuang Qiu, and Yongwan Park. 2012. Segmentation of heart sounds based on dynamic clustering. Biomedical Signal Processing and Control Vol. 7, 5 (Sep.. 2012), 509--516.Google ScholarCross Ref

Index Terms

Learning Image-based Representations for Heart Sound Classification
1. Applied computing
  1. Life and medical sciences
    1. Health care information systems
    2. Health informatics
2. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification

Recommendations

Classification of Heart Sounds Using Softmax Regression and Convolutional Neural Network
ICCET '18: Proceedings of the 2018 International Conference on Communication Engineering and Technology

This research explores a new approach in using deep learning neural network for heart sound classification. The Michigan heart sound and murmur database (MHSDB) provided by the University of Michigan Health System was used in this research. The heart ...
Read More
Heart Sound Classification Using Deep Learning Techniques Based on Log-mel Spectrogram
Abstract
In this study, two models for classifying heart rate sounds are proposed to classify heart sound by deep learning techniques based on the log-mel spectrogram of heart sound signals. The heart sound dataset comprises five classes, one normal class ...
Read More
Backpropagation Artificial Neural Network Classifier to Detect Changes in Heart Sound due to Mitral Valve Regurgitation

The phonocardiograph (PCG) can provide a non-invasive diagnostic ability to the clinicians and technicians to compare the heart acoustic signal obtained from normal and that of pathological heart (cardiac patient). This instrument was connected to the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
DH '18: Proceedings of the 2018 International Conference on Digital Health
April 2018
172 pages
ISBN:9781450364935
DOI:10.1145/3194658
General Chair:
Patty Kostkova
University College London, UK
,
Program Chairs:
Floriana Grasso
University of Liverpool, UK
,
Carlos Castillo
Eurecat, Spain
,
Yelena Mejova
QCRI, Qatar
,
Arnold Bosman
Transmissible, Netherlands
,
Michael Edelstein
Chatham House, UK
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 April 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
convolutional neural networks
heart sound classification
phonocardiogram
scalogram
transfer learning
Qualifiers
- short-paper
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 43
  Total Citations
  View Citations
- 556
  Total Downloads
- Downloads (Last 12 months)65
- Downloads (Last 6 weeks)7
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Learning Image-based Representations for Heart Sound Classification

DH '18: Proceedings of the 2018 International Conference on Digital Health

ABSTRACT

References

Cited By

Index Terms

Recommendations

Classification of Heart Sounds Using Softmax Regression and Convolutional Neural Network

Heart Sound Classification Using Deep Learning Techniques Based on Log-mel Spectrogram

Backpropagation Artificial Neural Network Classifier to Detect Changes in Heart Sound due to Mitral Valve Regurgitation