Air pollution has been on the rise for quite a while now and with it is the increasing number of cases involving respiratory diseases. These respiratory diseases range from the mild ones to the most severe ones. Therefore, we need to detect these diseases. The most efficient way of checking the health of the lung is through the use of lung sounds. But the lung sounds carry with them the minute variations that a doctor or any human sometimes misses. To tackle the same, this paper proposes a deep learning model that detects a person’s lung disease by processing lung sounds. The ICBHI 2017 dataset comprising the lung sound audios and their corresponding labels is used to train the proposed model. The dataset consists of lung sounds and some additional background noise has been removed using the bandpass Butterworth filter, whose frequency range has been set to 250Hz - 2000Hz. Later, the amount of data is multiplied through data augmentation for audio signals. The augmented data is later converted to mel spectrograms, which are, in turn, fed into the deep learning model. The model that gives the best accuracy is EfficientNet-B0 + attention with a training accuracy of 99.72%, validation accuracy of 99.82%, precision, recall and f1-scores of 99.82% as well. Furthermore, a comparison of the pre-trained model’s training and inference times has been performed. Although the proposed model takes a significant amount of training time, it takes the least amount of inference time compared to the other pre-trained models that have been tested.
Lung sound disease detection using attention over pre-trained efficientnet architecture, 2024.
Lung sound disease detection using attention over pre-trained efficientnet architecture
Bhatt, Chintan
;Bruno, Alessandro
2024-01-01
Abstract
Air pollution has been on the rise for quite a while now and with it is the increasing number of cases involving respiratory diseases. These respiratory diseases range from the mild ones to the most severe ones. Therefore, we need to detect these diseases. The most efficient way of checking the health of the lung is through the use of lung sounds. But the lung sounds carry with them the minute variations that a doctor or any human sometimes misses. To tackle the same, this paper proposes a deep learning model that detects a person’s lung disease by processing lung sounds. The ICBHI 2017 dataset comprising the lung sound audios and their corresponding labels is used to train the proposed model. The dataset consists of lung sounds and some additional background noise has been removed using the bandpass Butterworth filter, whose frequency range has been set to 250Hz - 2000Hz. Later, the amount of data is multiplied through data augmentation for audio signals. The augmented data is later converted to mel spectrograms, which are, in turn, fed into the deep learning model. The model that gives the best accuracy is EfficientNet-B0 + attention with a training accuracy of 99.72%, validation accuracy of 99.82%, precision, recall and f1-scores of 99.82% as well. Furthermore, a comparison of the pre-trained model’s training and inference times has been performed. Although the proposed model takes a significant amount of training time, it takes the least amount of inference time compared to the other pre-trained models that have been tested.File | Dimensione | Formato | |
---|---|---|---|
s11042-024-20078-1.pdf
Non accessibile
Tipologia:
Documento in Post-print
Dimensione
2.94 MB
Formato
Adobe PDF
|
2.94 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.