Monitoring Tweets for Depression to Detect At-Risk Users

Jamil, Zunaira

Monitoring Tweets for Depression to Detect At-Risk Users

dc.contributor.author	Jamil, Zunaira
dc.contributor.supervisor	Inkpen, Diana
dc.date.accessioned	2017-05-03T15:01:37Z
dc.date.available	2017-05-03T15:01:37Z
dc.date.issued	2017
dc.description.abstract	According to the World Health Organization, mental health is an integral part of health and well-being. Mental illness can affect anyone, rich or poor, male or female. One such example of mental illness is depression. In Canada 5.3% of the population had presented a depressive episode in the past 12 months. Depression is difficult to diagnose, resulting in high under-diagnosis. Diagnosing depression is often based on self-reported experiences, behaviors reported by relatives, and a mental status examination. Currently, author- ities use surveys and questionnaires to identify individuals who may be at risk of depression. This process is time-consuming and costly. We propose an automated system that can identify at-risk users from their public social media activity. More specifically, we identify at-risk users from Twitter. To achieve this goal we trained a user-level classifier using Support Vector Machine (SVM) that can detect at-risk users with a recall of 0.8750 and a precision of 0.7778. We also trained a tweet-level classifier that predicts if a tweet indicates distress. This task was much more difficult due to the imbalanced data. In the dataset that we labeled, we came across 5% distress tweets and 95% non-distress tweets. To handle this class imbalance, we used undersampling methods. The resulting classifier uses SVM and performs with a recall of 0.8020 and a precision of 0.1237. Our system can be used by authorities to find a focused group of at-risk users. It is not a platform for labeling an individual as a patient with depres- sion, but only a platform for raising an alarm so that the relevant authorities could take necessary interventions to further analyze the predicted user to confirm his/her state of mental health. We respect the ethical boundaries relating to the use of social media data and therefore do not use any user identification information in our research.	en
dc.identifier.uri	http://hdl.handle.net/10393/36030
dc.identifier.uri	http://dx.doi.org/10.20381/ruor-20310
dc.language.iso	en	en
dc.publisher	Université d'Ottawa / University of Ottawa	en
dc.subject	NLP	en
dc.subject	Machine Learning	en
dc.subject	Tweets	en
dc.subject	text mining	en
dc.subject	social media	en
dc.subject	sentiment analysis	en
dc.title	Monitoring Tweets for Depression to Detect At-Risk Users	en
dc.type	Thesis	en
thesis.degree.discipline	Génie / Engineering	en
thesis.degree.level	Masters	en
thesis.degree.name	MCS	en
uottawa.department	Science informatique et génie électrique / Electrical Engineering and Computer Science	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Jamil_Zunaira_2017_thesis.pdf
Size:: 1.06 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.65 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

- Thèses, 2011 - // Theses, 2011 -