Deep Recurrent Learning for Heart Sounds Segmentation based on Instantaneous Frequency Features
DOI:
https://doi.org/10.37537/rev.elektron.4.2.101.2020Keywords:
phonocardiogram, fourier transform, long short-term memoryAbstract
In this work, a novel stack of well-known technologies is presented to determine an automatic method to segment the heart sounds in a phonocardiogram (PCG). We will show a deep recurrent neural network (DRNN) capable of segmenting a PCG into their main components and a very specific way of extracting instantaneous frequency that will play an important role in the training and testing of the proposed model. More specifically, it involves an Long Short-Term Memory (LSTM) neural network accompanied by the Fourier Synchrosqueezed Transform (FSST) used to extract instantaneous time-frequency features from a PCG. The present approach was tested on heart sound signals longer than 5 seconds and shorter than 35 seconds from freely-available databases. This approach proved that, with a relatively small architecture, a small set of data and the right features, this method achieved an almost state-of-the-art performance, showing an average sensitivity of 89.5%, an average positive predictive value of 89.3% and an average accuracy of 91.3%.Downloads
References
H. Liang, L. Sakari, and H. Liro, “A heart sound segmentation algorithm using Wavelet Decomposition and reconstruction,” in Proceedings of the 19th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. ’Magnificent Milestones and Emerging Opportunities in Medical Engineering’ (Cat. No.97CH36136), vol. 4, Chicago, IL, USA, Oct 1997, pp. 1630–1633.
S. E Schmidt, C. Holst-Hansen, C. Graff, E. Toft and JJ. Struijk, “Segmentation of heart sound recordings by a duration-dependent hidden Markov model,” Physiological measurement, vol. 31, no. 4, pp. 513–529, Mar. 2010.
D. B. Springer, L. Tarassenko, and G. D. Clifford, “Logistic Regression-HSMM-Based Heart Sound Segmentation,” IEEE Transactions on Biomedical Engineering, vol. 63, no. 4, pp. 822–832, April 2016.
F. Renna, J. H. Oliveira, and M. T. Coimbra, “Deep Convolutional Neural Networks for Heart Sound Segmentation,” IEEE Journal of Biomedical and Health Informatics, pp. 1–1, 2019.
P. F. O. Ronneberger and T. Brox, “U-Net: Convolutional Networks for Biomedical Image Segmentation,” CoRR, vol. abs/1505.04597, 2015. [Online]. Available: http://arxiv.org/abs/1505.04597
A. L. Goldberger, L. A. N. Amaral, L. Glass, J. M. Hausdorff, P. C. Ivanov, R. G. Mark, J. E. Mietus, G. B. Moody, C.-K. Peng, and H. E. Stanley, “PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals,” Circulation, vol. 101, no. 23, pp. e215–e220, 2000 (June 13), circulation Electronic Pages: http://circ.ahajournals.org/content/101/23/e215.full PMID:1085218; doi: 10.1161/01.CIR.101.23.e215.
T. Oberlin, S. Meignen, and V. Perrier, “The Fourier-based Synchrosqueezing Transform,” in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 05 2014, pp. 315–319.
Published
Issue
Section
License
The authors who publish in this journal agree with terms established in the license Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)