Depth-Aware Deep Learning Networks for Object Detection and Image Segmentation

Dickens, James

Depth-Aware Deep Learning Networks for Object Detection and Image Segmentation

dc.contributor.author	Dickens, James
dc.contributor.supervisor	Payeur, Pierre
dc.date.accessioned	2021-09-01T20:19:26Z
dc.date.available	2021-09-01T20:19:26Z
dc.date.issued	2021-09-01	en_US
dc.description.abstract	The rise of convolutional neural networks (CNNs) in the context of computer vision has occurred in tandem with the advancement of depth sensing technology. Depth cameras are capable of yielding two-dimensional arrays storing at each pixel the distance from objects and surfaces in a scene from a given sensor, aligned with a regular color image, obtaining so-called RGBD images. Inspired by prior models in the literature, this work develops a suite of RGBD CNN models to tackle the challenging tasks of object detection, instance segmentation, and semantic segmentation. Prominent architectures for object detection and image segmentation are modified to incorporate dual backbone approaches inputting RGB and depth images, combining features from both modalities through the use of novel fusion modules. For each task, the models developed are competitive with state-of-the-art RGBD architectures. In particular, the proposed RGBD object detection approach achieves 53.5% mAP on the SUN RGBD 19-class object detection benchmark, while the proposed RGBD semantic segmentation architecture yields 69.4% accuracy with respect to the SUN RGBD 37-class semantic segmentation benchmark. An original 13-class RGBD instance segmentation benchmark is introduced for the SUN RGBD dataset, for which the proposed model achieves 38.4% mAP. Additionally, an original depth-aware panoptic segmentation model is developed, trained, and tested for new benchmarks conceived for the NYUDv2 and SUN RGBD datasets. These benchmarks offer researchers a baseline for the task of RGBD panoptic segmentation on these datasets, where the novel depth-aware model outperforms a comparable RGB counterpart.	en_US
dc.identifier.uri	http://hdl.handle.net/10393/42619
dc.identifier.uri	http://dx.doi.org/10.20381/ruor-26839
dc.language.iso	en	en_US
dc.publisher	Université d'Ottawa / University of Ottawa	en_US
dc.rights	Attribution 4.0 International	*
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	*
dc.subject	Deep learning	en_US
dc.subject	Computer vision	en_US
dc.subject	CNN	en_US
dc.subject	Object detection	en_US
dc.subject	Semantic segmentation	en_US
dc.subject	Instance segmentation	en_US
dc.subject	Multi-modal deep learning	en_US
dc.subject	Panoptic segmentation	en_US
dc.subject	Artificial intelligence	en_US
dc.subject	Convolutional neural networks	en_US
dc.subject	Neural networks	en_US
dc.subject	RGBD	en_US
dc.subject	Depth images	en_US
dc.title	Depth-Aware Deep Learning Networks for Object Detection and Image Segmentation	en_US
dc.type	Thesis	en_US
thesis.degree.discipline	Génie / Engineering	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	MCS	en_US
uottawa.department	Science informatique et génie électrique / Electrical Engineering and Computer Science	en_US

Fichiers

Trousse originale

Voici les éléments 1 - 1 sur 1

Nom:: Dickens_James_2021_thesis.pdf
Taille:: 26.49 MB
Format:: Adobe Portable Document Format
Description:

Télécharger

Trousse de licence

Voici les éléments 1 - 1 sur 1

Nom:: license.txt
Taille:: 6.65 KB
Format:: Item-specific license agreed upon to submission
Description:

Télécharger

Collections

- Thèses, 2011 - // Theses, 2011 -