Repository logo

Self-Organizing Neural Visual Models to Learn Feature Detectors and Motion Tracking Behaviour by Exposure to Real-World Data

dc.contributor.authorYogeswaran, Arjun
dc.contributor.supervisorPayeur, Pierre
dc.date.accessioned2018-01-08T17:21:23Z
dc.date.available2018-01-08T17:21:23Z
dc.date.issued2018
dc.description.abstractAdvances in unsupervised learning and deep neural networks have led to increased performance in a number of domains, and to the ability to draw strong comparisons between the biological method of self-organization conducted by the brain and computational mechanisms. This thesis aims to use real-world data to tackle two areas in the domain of computer vision which have biological equivalents: feature detection and motion tracking. The aforementioned advances have allowed efficient learning of feature representations directly from large sets of unlabeled data instead of using traditional handcrafted features. The first part of this thesis evaluates such representations by comparing regularization and preprocessing methods which incorporate local neighbouring information during training on a single-layer neural network. The networks are trained and tested on the Hollywood2 video dataset, as well as the static CIFAR-10, STL-10, COIL-100, and MNIST image datasets. The induction of topography or simple image blurring via Gaussian filters during training produces better discriminative features as evidenced by the consistent and notable increase in classification results that they produce. In the visual domain, invariant features are desirable such that objects can be classified despite transformations. It is found that most of the compared methods produce more invariant features, however, classification accuracy does not correlate to invariance. The second, and paramount, contribution of this thesis is a biologically-inspired model to explain the emergence of motion tracking behaviour in early development using unsupervised learning. The model’s self-organization is biased by an original concept called retinal constancy, which measures how similar visual contents are between successive frames. In the proposed two-layer deep network, when exposed to real-world video, the first layer learns to encode visual motion, and the second layer learns to relate that motion to gaze movements, which it perceives and creates through bi-directional nodes. This is unique because it uses general machine learning algorithms, and their inherent generative properties, to learn from real-world data. It also implements a biological theory and learns in a fully unsupervised manner. An analysis of its parameters and limitations is conducted, and its tracking performance is evaluated. Results show that this model is able to successfully follow targets in real-world video, despite being trained without supervision on real-world video.en
dc.identifier.urihttp://hdl.handle.net/10393/37096
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-21368
dc.language.isoenen
dc.publisherUniversité d'Ottawa / University of Ottawaen
dc.subjectrestricted Boltzmann machineen
dc.subjectself-organizationen
dc.subjectdeep learningen
dc.subjectdeep belief networken
dc.subjectunsupervised learningen
dc.subjectsmooth pursuiten
dc.subjectsaccadeen
dc.subjectmotion trackingen
dc.subjectfeature learningen
dc.subjectinvarianceen
dc.subjectGaussian filteren
dc.subjectneural networken
dc.subjectHebbian learningen
dc.subjectreal-world dataen
dc.subjectbiologically-inspireden
dc.subjectimage classificationen
dc.subjectfeature extractionen
dc.subjectvisual attentionen
dc.subjectretinal slipen
dc.subjectsaliency mapen
dc.titleSelf-Organizing Neural Visual Models to Learn Feature Detectors and Motion Tracking Behaviour by Exposure to Real-World Dataen
dc.typeThesisen
thesis.degree.disciplineGénie / Engineeringen
thesis.degree.levelDoctoralen
thesis.degree.namePhDen
uottawa.departmentScience informatique et génie électrique / Electrical Engineering and Computer Scienceen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Yogeswaran_Arjun_2018_thesis.pdf
Size:
8.69 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
6.65 KB
Format:
Item-specific license agreed upon to submission
Description: