Repository logo

The role of named entities in text classification

dc.contributor.authorArmour, Quintin
dc.date.accessioned2013-11-07T18:11:58Z
dc.date.available2013-11-07T18:11:58Z
dc.date.created2005
dc.date.issued2005
dc.degree.levelMasters
dc.degree.nameM.A.Sc.
dc.description.abstractNamed entities, typically associated with names of people, places and organizations, constitute a group of textual elements present in almost any type of document. The general techniques used to extract them and their variable-length property also makes them an attractive type of attribute to study in text classification. In this thesis, several datasets are characterized as being either dependent or independent of named entities with a Naive Bayes based ranking technique. Using this characterization, results are presented which find named entities to be in fact useful in classification tasks, and that accuracy can be improved by considering them as a special type of attribute. Namely, the inclusion of regular terms, named entity representation and the frequency with which a classifier is retrained all have an impact on the classification of documents where named entities are important.
dc.format.extent96 p.
dc.identifier.citationSource: Masters Abstracts International, Volume: 44-04, page: 1919.
dc.identifier.urihttp://hdl.handle.net/10393/26840
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-9815
dc.language.isoen
dc.publisherUniversity of Ottawa (Canada)
dc.subject.classificationEngineering, Electronics and Electrical.
dc.titleThe role of named entities in text classification
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
MR11206.PDF
Size:
3.59 MB
Format:
Adobe Portable Document Format