Deep recurrent neural networks for audio classification in construction sites
Document Type
Conference Proceeding
Publication Date
1-24-2021
Abstract
In this paper, we propose a Deep Recurrent Neural Network (DRNN) approach based on Long-Short Term Memory (LSTM) units for the classification of audio signals recorded in construction sites. Five classes of multiple vehicles and tools, normally used in construction sites, have been considered. The input provided to the DRNN consists in the concatenation of several spectral features, like MFCCs, mel-scaled spectrogram, chroma and spectral contrast. The proposed architecture and the feature extraction have been described. Some experimental results, obtained by using real-world recordings, demonstrate the effectiveness of the proposed idea. The final overall accuracy on the test set is up to 97% and overcomes other state-of-the-art approaches.
Publication Source (Journal or Book title)
European Signal Processing Conference
First Page
810
Last Page
814
Recommended Citation
Scarpiniti, M., Comminiello, D., Uncini, A., & Lee, Y. (2021). Deep recurrent neural networks for audio classification in construction sites. European Signal Processing Conference, 2021-January, 810-814. https://doi.org/10.23919/Eusipco47968.2020.9287802