Repository logo

Improving Classification and Attribute Clustering: An Iterative Semi-supervised Approach

dc.contributor.authorSeifi, Farid
dc.contributor.supervisorMatwin, Stan
dc.contributor.supervisorJapkowicz, Nathalie
dc.contributor.supervisorDrummond, Chris
dc.date.accessioned2015-03-18T17:17:20Z
dc.date.available2015-03-18T17:17:20Z
dc.date.created2015
dc.date.issued2015
dc.degree.disciplineGénie / Engineering
dc.degree.leveldoctorate
dc.degree.namePhD
dc.description.abstractThis thesis proposes a novel approach to attribute clustering. It exploits the strength of semi-supervised learning to improve the quality of attribute clustering particularly when labeled data is limited. The significance of this work derives in part from the broad, and increasingly important, usage of attribute clustering to address outstanding problems within the machine learning community. This form of clustering has also been shown to have strong practical applications, being usable in heavyweight industrial applications. Although researchers have focused on supervised and unsupervised attribute clustering in recent years, semi-supervised attribute clustering has not received substantial attention. In this research, we propose an innovative two step iterative semi-supervised attribute clustering framework. This new framework, in each iteration, uses the result of attribute clustering to improve a classifier. It then uses the classifier to augment the training data used by attribute clustering in next iteration. This iterative framework outputs an improved classifier and attribute clustering at the same time. It gives more accurate clusters of attributes which better fit the real relations between attributes. In this study we proposed two new usages for attribute clustering to improve classification: solving the automatic view definition problem for multi-view learning and improving missing attribute-value handling at induction and prediction time. The application of these two new usages of attribute clustering in our proposed semi-supervised attribute clustering is evaluated using real world data sets from different domains.
dc.faculty.departmentScience informatique et génie électrique / Electrical Engineering and Computer Science
dc.identifier.urihttp://hdl.handle.net/10393/32140
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-2821
dc.language.isoen
dc.publisherUniversité d'Ottawa / University of Ottawa
dc.subjectAttribute Clustering
dc.subjectClassification
dc.subjectBig Data
dc.subjectSemi-supervised Learning
dc.subjectMachine Learning
dc.subjectMulti-view Learning
dc.subjectMissing Attribute-value Handling
dc.titleImproving Classification and Attribute Clustering: An Iterative Semi-supervised Approach
dc.typeThesis
thesis.degree.disciplineGénie / Engineering
thesis.degree.levelDoctoral
thesis.degree.namePhD
uottawa.departmentScience informatique et génie électrique / Electrical Engineering and Computer Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Seifi_Farid_2015_thesis.pdf
Size:
1.11 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
4.07 KB
Format:
Item-specific license agreed upon to submission
Description: