Preview

«System analysis and applied information science»

Advanced search

Methodics and tools of cough sound processing on basic of neural net

https://doi.org/10.21122/2309-4923-2023-1-35-41

Abstract

The purpose of the article is to analyze the methods and means of processing cough sounds to detect lung diseases, as well as to describe the developed system for classifying and detecting cough sounds based on a deep neural network. Four types of machine learning and the use of convolutional neural network (CNN) are considered. Hypermarkets of CNN are given. Varieties of machine learning based on the CNN are discussed. The analysis of works on the methodology and means of processing cough sounds based on the CNN with the reduction of the means used and the accuracy of recognition is carried out. Details of machine learning using the environmental sound classification 50 (ESC-50) dataset are discussed. To recognize COVID-19 cough, a classifier was analyzed using CNN as a machine learning model. The proposed CNN system is designed to classify and detect cough sounds based on ESC-50. After selecting a set of sound classification data, four stages are described: extraction of features from audio files, labeling, training, testing. The ESC-50 used for the study was downloaded from the Kaggle website. Python libraries and modules related to deep learning and data science were used to implement the project: NumPy, Librosa, Matplotlib, Hickle, Sci-Kit Learn, Keras. The implemented network used a stochastic gradient algorithm. Several volunteers recorded their voices while coughing using their smartphones and it was assured to record their voices in a public environment to introduce noise to the sounds, in addition to some audio files that were downloaded online. The results showed an average accuracy of 85.37 %, precision of 78.8 % and a recall record of 91.9 %.

About the Authors

U. A. Vishniakou
Belarusian state University of Informatics and Radioelectronics
Belarus

Vishniakou Uladzimir - doctor of technical science, professor of ICT department of Belarusian State University of Informatics and Radioelectronics. 

Minsk, Republic of Belarus



Bahaa Shaya
Belarusian state University of Informatics and Radioelectronics
Belarus

Shaya Bahaa - master of technical science, PhD-student of ICT department of Belarusian State University of Informatics and Radioelectronics.

Minsk, Republic of Belarus



References

1. Radhakrishnan P. Towards Data Science. 9 Aug 2017. [Electronic resource]. Avialable : https://towardsdatascience.com/what-are-hyperparameters-and-how-to-tune-the-hyperparameters-in-a-deep-neural-network-d0604917584a. Date of access: Aug 2022.

2. LeCun J., Boser B., Denker J., Henderson D., Howard R., Hubbard W., Jackel L. Backpropagation Applied to Handwritten Zip Code. Neural Computation, 1989. ‒ Vol. 1, No. 4. ‒ Pp. 541-551.

3. Dhillon A., Verma G.K. Convolutional neural network: A review of models, methodologies and applications to object detection. Progress in Artificial Intelligence, 2019. ‒ Vol. 9, No. 2. ‒ Pp. 85-112.

4. Kriszhevsky G.K., Sutskever G.K., Hinton G.E. ImageNet classification with deep convolutional neural networks // Proceedings of the 25th International Conference on Neural, vol. 60, No. 6, 2012. – Pp. 1097-1105.

5. Chu, S. Unstructured Audio Classification for Environment Recognition.// Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008. – Pp. 1845-1846.

6. Banchhor K.S., Khan A. Musical Instrument Recognition using Zero Crossing Rate and Short-time Energy. International Journal of Applied Information Systems. vol. 1, No 3, 2012. – Pp. 85-97.

7. Feroze K., Maud A.R. Sound Event Detection in Real Life Audio using Perceptual Linear Predictive Feature with Neural Network // Proceeding of 15th International Bhurban Conferance on Applied Sceinces & Technology, Islamabad, Pakistan, 2018.

8. Wang H.-H., Liu J.-M., You M., Li G.-Z. Audio Signals Encoding for Cough Classification Using Convolutional Neural Networks: A Comparative Study // IEEE International Conference on Bioinformatics and Biomedicine, Shanghai, 2015.

9. Chowdhury A., Ross A. Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals. IEEE Transactions on Information Forensics and Security, vol. 15, 2019. – Pp. 1616-1629.

10. Piczak K.J. Environmental sound classification with convolutional neural networks // IEEE International workshop on machine learning for signal processing, Boston, 2015.

11. Bales C., Nabeel M., John N.C., Masood U., Qureshi H., Farooq H. Can Machine Learning Be Used to Recognize and Diagnose Coughs? // The 8th IEEE International Conference on E-Health and Bioengineering, Romania, 2020.

12. Amoh A., Odame K. Deep Neural Networks for Identifying Cough Sounds. IEEE Transactions on Biomedical Circuits and Systems, vol. 10, No. 5, 2016. – Pp. 1003-1011.

13. Kosasih K., Abeyratne U., Swarnkar V., Triasih R. Wavelet Augmented Cough Analysis for Rapid Childhood Pneumonia Diagnosis. IEEE Transactions on biomedical engineering, vol. 62, No. 4, 2015. – Pp. 1185-1194.

14. Barata F., Kipfer K., Weber M., Tinschert P., Fleisch E., Kowatsch T. Towards Device-Agnostic Mobile Cough Detection with Convolutional Neural Networks // IEEE International Conference on Healthcare Informatics, Xi’an, 2019.

15. Khomsay S., Vanijjirattikhan R., Suwatthikul J. Cough detection using PCA and Deep Learning // Intern. Conf. on Information and Communication Technology Convergence (ICTC), Jeju, 2019.

16. Chen X., Hu X. Zhai G. Cough Detection Using Selected Informative Features from Audio Signals. Cirnell University, August, 2021.

17. Rashid H.-A., Mazumder A.N., Pan U., Niyogi K., Mohsenin T. CoughNet: A Flexible Low Power CNN-LSTM Processor for Cough Sound Detection // IEEE 3rd International Conference on Artificial Intelligence Circuits and Systems (AICAS), Washington DC, 2021.

18. Bansal V., Pahwa G., Kannan N. Cough Classification for COVID-19 based on audio mfcc features using Convolutional Neural Networks // IEEE International Conference on Computing, Power and Communication Technologies (GUCON), Greater Noida, 2020.

19. Piczak K.J. ESC: Dataset for Environmental Sound Classification // Proceedings of the 23rd Annual ACM Conference on Multimedia, Brisbane, 2015.

20. Abadi M., Barham P., Chen P., Chen Z, Davis A. TensorFlow: A System for Large-Scale Machine Learning. Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’16). November, 2016 Savannah, GA, USA.


Review

For citations:


Vishniakou U.A., Shaya B. Methodics and tools of cough sound processing on basic of neural net. «System analysis and applied information science». 2023;(1):35-41. (In Russ.) https://doi.org/10.21122/2309-4923-2023-1-35-41

Views: 310


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2309-4923 (Print)
ISSN 2414-0481 (Online)