Repository logo

Content-Based Geolocation Prediction of Canadian Twitter Users and Their Tweets

dc.contributor.authorMetin, Ali Mert
dc.contributor.supervisorInkpen, Diana
dc.contributor.supervisorZhu, Xiaodan
dc.date.accessioned2019-08-13T18:51:43Z
dc.date.available2019-08-13T18:51:43Z
dc.date.issued2019-08-13en_US
dc.description.abstractLast decade witnessed the rise of online social networks, especially Twitter. Today, Twitteris a giant social platform with over 250 million users |who produce massive amounts of data everyday. This creates many research opportunities, speci cally for Natural Language Processing (NLP) in which text is utilized to extract information that could be used in many applications. One problem NLP might help solving is geolocation inference or geolocation detection from online social networks. Detecting the location of Twitter users based on the text of their tweets is useful since not many users publicly declare their locations or geotag their tweets. Location information is crucial for a variety of applications such as event detection, disease and illness tracking and user pro ling. These tasks are not trivial, because online content is often noisy; it includes misspellings, incomplete words or phrases, idiomatic expressions, abbreviations, acronyms, and Twitter-speci c literature. In this work, we attempted to detect the location of Canadian users |and tweets sent from Canada |at metropolitan areas and province level; this was not done before, to the best of our knowledge. In order to do this, we collected two di erent datasets, and applied a variety of machine learning, including deep learning methods. Besides, we also attempted to geolocate users based on their social graph (i.e., user's friends and followers) as a novel approach.en_US
dc.identifier.urihttp://hdl.handle.net/10393/39508
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-23751
dc.language.isoenen_US
dc.publisherUniversité d'Ottawa / University of Ottawaen_US
dc.subjectGeolocationen_US
dc.subjectCanadian Twitter usersen_US
dc.subjectTwitteren_US
dc.subjectNatural language processingen_US
dc.titleContent-Based Geolocation Prediction of Canadian Twitter Users and Their Tweetsen_US
dc.typeThesisen_US
thesis.degree.disciplineGénie / Engineeringen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMScen_US
uottawa.departmentScience informatique et génie électrique / Electrical Engineering and Computer Scienceen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Metin_Ali_Mert_2019_thesis.pdf
Size:
2.38 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
6.65 KB
Format:
Item-specific license agreed upon to submission
Description: