Repository logo

Depth-Aware Deep Learning Networks for Object Detection and Image Segmentation

dc.contributor.authorDickens, James
dc.contributor.supervisorPayeur, Pierre
dc.date.accessioned2021-09-01T20:19:26Z
dc.date.available2021-09-01T20:19:26Z
dc.date.issued2021-09-01en_US
dc.description.abstractThe rise of convolutional neural networks (CNNs) in the context of computer vision has occurred in tandem with the advancement of depth sensing technology. Depth cameras are capable of yielding two-dimensional arrays storing at each pixel the distance from objects and surfaces in a scene from a given sensor, aligned with a regular color image, obtaining so-called RGBD images. Inspired by prior models in the literature, this work develops a suite of RGBD CNN models to tackle the challenging tasks of object detection, instance segmentation, and semantic segmentation. Prominent architectures for object detection and image segmentation are modified to incorporate dual backbone approaches inputting RGB and depth images, combining features from both modalities through the use of novel fusion modules. For each task, the models developed are competitive with state-of-the-art RGBD architectures. In particular, the proposed RGBD object detection approach achieves 53.5% mAP on the SUN RGBD 19-class object detection benchmark, while the proposed RGBD semantic segmentation architecture yields 69.4% accuracy with respect to the SUN RGBD 37-class semantic segmentation benchmark. An original 13-class RGBD instance segmentation benchmark is introduced for the SUN RGBD dataset, for which the proposed model achieves 38.4% mAP. Additionally, an original depth-aware panoptic segmentation model is developed, trained, and tested for new benchmarks conceived for the NYUDv2 and SUN RGBD datasets. These benchmarks offer researchers a baseline for the task of RGBD panoptic segmentation on these datasets, where the novel depth-aware model outperforms a comparable RGB counterpart.en_US
dc.identifier.urihttp://hdl.handle.net/10393/42619
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-26839
dc.language.isoenen_US
dc.publisherUniversité d'Ottawa / University of Ottawaen_US
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectDeep learningen_US
dc.subjectComputer visionen_US
dc.subjectCNNen_US
dc.subjectObject detectionen_US
dc.subjectSemantic segmentationen_US
dc.subjectInstance segmentationen_US
dc.subjectMulti-modal deep learningen_US
dc.subjectPanoptic segmentationen_US
dc.subjectArtificial intelligenceen_US
dc.subjectConvolutional neural networksen_US
dc.subjectNeural networksen_US
dc.subjectRGBDen_US
dc.subjectDepth imagesen_US
dc.titleDepth-Aware Deep Learning Networks for Object Detection and Image Segmentationen_US
dc.typeThesisen_US
thesis.degree.disciplineGénie / Engineeringen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMCSen_US
uottawa.departmentScience informatique et génie électrique / Electrical Engineering and Computer Scienceen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Dickens_James_2021_thesis.pdf
Size:
26.49 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
6.65 KB
Format:
Item-specific license agreed upon to submission
Description: