Monitoring Tweets for Depression to Detect At-Risk Users
| dc.contributor.author | Jamil, Zunaira | |
| dc.contributor.supervisor | Inkpen, Diana | |
| dc.date.accessioned | 2017-05-03T15:01:37Z | |
| dc.date.available | 2017-05-03T15:01:37Z | |
| dc.date.issued | 2017 | |
| dc.description.abstract | According to the World Health Organization, mental health is an integral part of health and well-being. Mental illness can affect anyone, rich or poor, male or female. One such example of mental illness is depression. In Canada 5.3% of the population had presented a depressive episode in the past 12 months. Depression is difficult to diagnose, resulting in high under-diagnosis. Diagnosing depression is often based on self-reported experiences, behaviors reported by relatives, and a mental status examination. Currently, author- ities use surveys and questionnaires to identify individuals who may be at risk of depression. This process is time-consuming and costly. We propose an automated system that can identify at-risk users from their public social media activity. More specifically, we identify at-risk users from Twitter. To achieve this goal we trained a user-level classifier using Support Vector Machine (SVM) that can detect at-risk users with a recall of 0.8750 and a precision of 0.7778. We also trained a tweet-level classifier that predicts if a tweet indicates distress. This task was much more difficult due to the imbalanced data. In the dataset that we labeled, we came across 5% distress tweets and 95% non-distress tweets. To handle this class imbalance, we used undersampling methods. The resulting classifier uses SVM and performs with a recall of 0.8020 and a precision of 0.1237. Our system can be used by authorities to find a focused group of at-risk users. It is not a platform for labeling an individual as a patient with depres- sion, but only a platform for raising an alarm so that the relevant authorities could take necessary interventions to further analyze the predicted user to confirm his/her state of mental health. We respect the ethical boundaries relating to the use of social media data and therefore do not use any user identification information in our research. | en |
| dc.identifier.uri | http://hdl.handle.net/10393/36030 | |
| dc.identifier.uri | http://dx.doi.org/10.20381/ruor-20310 | |
| dc.language.iso | en | en |
| dc.publisher | Université d'Ottawa / University of Ottawa | en |
| dc.subject | NLP | en |
| dc.subject | Machine Learning | en |
| dc.subject | Tweets | en |
| dc.subject | text mining | en |
| dc.subject | social media | en |
| dc.subject | sentiment analysis | en |
| dc.title | Monitoring Tweets for Depression to Detect At-Risk Users | en |
| dc.type | Thesis | en |
| thesis.degree.discipline | Génie / Engineering | en |
| thesis.degree.level | Masters | en |
| thesis.degree.name | MCS | en |
| uottawa.department | Science informatique et génie électrique / Electrical Engineering and Computer Science | en |
