Repository logo

Compact features for sentiment analysis

dc.contributor.authorGaudette, Lisa
dc.date.accessioned2013-11-07T19:04:16Z
dc.date.available2013-11-07T19:04:16Z
dc.date.created2009
dc.date.issued2009
dc.degree.levelMasters
dc.degree.nameM.S.C.
dc.description.abstractThis work examines a novel method of developing features to use for machine learning of sentiment analysis and related tasks. This task is frequently approached using a Bag of Words representation -- one feature for each word encountered in the training data -- which can easily number in the thousands or tens of thousands. This thesis develops a set of "numeric" features, by learning scores for words, dividing the range of possible scores into a number of bins, and then generating features based on counting how many words in each document have scores in each bin. This allows for effective learning of sentiment and related tasks with 25 features; in fact, performance was very often slightly better with these features. This reduction in the number of features allows for the processing of much larger collections of texts than previously attempted. In addition, we carefully consider the problem of evaluating ordinal problems.
dc.format.extent98 p.
dc.identifier.citationSource: Masters Abstracts International, Volume: 48-06, page: 3709.
dc.identifier.urihttp://hdl.handle.net/10393/28295
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-19182
dc.language.isoen
dc.publisherUniversity of Ottawa (Canada)
dc.subject.classificationComputer Science.
dc.titleCompact features for sentiment analysis
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
MR61163.PDF
Size:
3.96 MB
Format:
Adobe Portable Document Format