A Novel Semantic Feature Fusion-based Pedestrian Detection System to Support Autonomous Vehicles

Sha, Mingzhi

A Novel Semantic Feature Fusion-based Pedestrian Detection System to Support Autonomous Vehicles

dc.contributor.author	Sha, Mingzhi
dc.contributor.supervisor	Boukerche, Azzedine
dc.date.accessioned	2021-05-27T18:23:39Z
dc.date.available	2021-05-27T18:23:39Z
dc.date.issued	2021-05-27	en_US
dc.description.abstract	Intelligent transportation systems (ITS) have become a popular method to enhance the safety and efficiency of transportation. Pedestrians, as an essential participant of ITS, are very vulnerable in a traffic collision, compared with the passengers inside the vehicle. In order to protect the safety of all traffic participants and enhance transportation efficiency, the novel autonomous vehicles are required to detect pedestrians accurately and timely. In the area of pedestrian detection, deep learning-based pedestrian detection methods have gained significant development since the appearance of powerful GPUs. A large number of researchers are paying efforts to improve the accuracy of pedestrian detection by utilizing the Convolutional Neural Network (CNN)-based detectors. In this thesis, we propose a one-stage anchor-free pedestrian detector named Bi-Center Network (BCNet), which is aided by the semantic features of pedestrians' visible parts. The framework of our BCNet has two main modules: the feature extraction module produces the concatenated feature maps that extracted from different layers of ResNet, and the four parallel branches in the detection module produce the full body center keypoint heatmap, visible part center keypoint heatmap, heights, and offsets, respectively. The final bounding boxes are converted from the high response points on the fused center keypoint heatmap and corresponding predicted heights and offsets. The fused center keypoint heatmap contains the semantic feature fusion of the full body and the visible part of each pedestrian. Thus, we conduct ablation studies and discover the efficiency of feature fusion and how visibility features benefit the detector's performance by proposing two types of approaches: introducing two weighting hyper-parameters and applying three different attention mechanisms. Our BCNet gains 9.82% MR-2 (the lower the better) on the Reasonable setup of the CityPersons dataset, compared to baseline model which gains 12.14% MR-2 . The experimental results indicate that the performance of pedestrian detection could be significantly improved because the visibility semantic could prompt stronger responses on the heatmap. We compare our BCNet with state-of-the-art models on the CityPersons dataset and ETH dataset, which shows that our detector is effective and achieves a promising performance.	en_US
dc.identifier.uri	http://hdl.handle.net/10393/42213
dc.identifier.uri	http://dx.doi.org/10.20381/ruor-26435
dc.language.iso	en	en_US
dc.publisher	Université d'Ottawa / University of Ottawa	en_US
dc.subject	Pedestrian Detection	en_US
dc.subject	Autonomous Vehicles	en_US
dc.subject	Deep Learning	en_US
dc.subject	Intelligent transportation systems	en_US
dc.title	A Novel Semantic Feature Fusion-based Pedestrian Detection System to Support Autonomous Vehicles	en_US
dc.type	Thesis	en_US
thesis.degree.discipline	Génie / Engineering	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	MSc	en_US
uottawa.department	Science informatique et génie électrique / Electrical Engineering and Computer Science	en_US

Fichiers

Trousse originale

Voici les éléments 1 - 1 sur 1

Nom:: Sha_Mingzhi_2021_thesis.pdf
Taille:: 43.41 MB
Format:: Adobe Portable Document Format
Description:

Télécharger

Trousse de licence

Voici les éléments 1 - 1 sur 1

Nom:: license.txt
Taille:: 6.65 KB
Format:: Item-specific license agreed upon to submission
Description:

Télécharger

Collections

- Thèses, 2011 - // Theses, 2011 -