Repository logo

Moving Sound Sources Direction of Arrival Classification Using Different Deep Learning Schemes

dc.contributor.authorRusrus, Jana
dc.contributor.supervisorBouchard, Martin
dc.contributor.supervisorShirmohammadi, Shervin
dc.date.accessioned2023-04-19T19:28:19Z
dc.date.available2023-04-19T19:28:19Z
dc.date.issued2023-04-19en_US
dc.description.abstractSound source localization is an important task for several applications and the use of deep learning for this task has recently become a popular research topic. While the majority of the previous work has focused on static sound sources, in this work we evaluate the performance of a deep learning classification system for localization of high-speed moving sound sources. In particular, we systematically evaluate the effect of a wide range of parameters at three levels including: data generation (e.g., acoustic conditions), feature extraction (e.g., STFT parameters), and model training (e.g., neural network architectures). We evaluate the performance of multiple metrics in terms of precision, recall, F-score and confusion matrix in a multi-class multi-label classification framework. We used four different deep learning models: feedforward neural networks, recurrent neural network, gated recurrent networks and temporal Convolutional neural network. We showed that (1) the presence of some reverberation in the training dataset can help in achieving better detection for the direction of arrival of acoustic sources, (2) window size does not affect the performance of static sources but highly affects the performance of moving sources, (3) sequence length has a significant effect on the performance of recurrent neural network architectures, (4) temporal convolutional neural networks can outperform both recurrent and feedforward networks for moving sound sources, (5) training and testing on white noise is easier for the network than training on speech data, and (6) increasing the number of elements in the microphone array improves the performance of the direction of arrival estimation.en_US
dc.identifier.urihttp://hdl.handle.net/10393/44824
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-29030
dc.language.isoenen_US
dc.publisherUniversité d'Ottawa / University of Ottawaen_US
dc.subjectDirection of arrivalen_US
dc.subjectDeep learninigen_US
dc.subjectSound source localizationen_US
dc.titleMoving Sound Sources Direction of Arrival Classification Using Different Deep Learning Schemesen_US
dc.typeThesisen_US
thesis.degree.disciplineGénie / Engineeringen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMAScen_US
uottawa.departmentScience informatique et génie électrique / Electrical Engineering and Computer Scienceen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Rusrus_Jana_2023_thesis.pdf
Size:
4.79 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
6.65 KB
Format:
Item-specific license agreed upon to submission
Description: