Repository logo

The IntelliTweet: Unveiling Malicious Activities in Tweets Through a Multifaceted Feature Analysis

dc.contributor.authorDzeha, Eric
dc.contributor.supervisorJourdan, Guy-Vincent
dc.date.accessioned2024-05-21T22:11:04Z
dc.date.available2024-05-21T22:11:04Z
dc.date.issued2024-05-21
dc.description.abstractSocial media platforms have seamlessly integrated into our daily communication, facilitating information sharing, connections and engagement for both individuals and businesses. Among these platforms, Twitter has emerged as one of the popular platforms for its rapid information dissemination and real-time interaction capabilities. However, the widespread adoption of Twitter has also attracted malicious activities such as phishing, spam, and scams, which take advantage of the platform's extensive reach to spread rapidly. In this Thesis, we introduce "The IntelliTweet," a machine learning system designed to enhance real-time detection and classification of malicious tweets on Twitter. IntelliTweet employs a multifaceted feature approach by integrating content analysis, user profile attributes, sentiment analysis, URL analysis and term frequency-inverse document frequency (TF-IDF) techniques. This holistic methodology considers the contextual nature of tweets, as well as content-based features and user behavior patterns, to accurately distinguish malicious tweets from legitimate ones, including user-reported tweets that raise awareness about threats. Our work began with an in-depth review of existing literature and the landscape of Twitter-centric threats, identifying shortcomings in current detection methodologies ranging from traditional assessments to machine learning classifiers. We subsequently delved into the conceptualization of IntelliTweet as well as the feature design integrating tweet metadata, user profiles, and linguistic nuances within tweets. As part of this work, we created a database by collecting tweets in real-time directly from the Twitter stream. This database contains a mix of malicious tweets, legitimate tweets, and user-reported tweets, allowing us to analyze the interactions between user generated warnings and responses to malicious activities on Twitter. We conducted experiments including model selection, feature importance analysis, grid search optimization, hyperparameter tuning, and t-tests, providing a thorough evaluation of IntelliTweet's performance. Validating both binary and multiclass system configurations, IntelliTweet's precision-centric approach demonstrates reliability and significant improvements with results achieving 98.80% precision, 98.15% F1-score, and a low 0.07 false positive rate on real-world Twitter data. By prioritizing false alarm reduction and maximizing the global precision, IntelliTweet minimizes the mislabeling of legitimate users to account for the real-world implications of user misclassification such as account suspension. IntelliTweet represents a positive step towards Twitter security and positive user experiences, contributing to cybersecurity evolution and providing valuable insights for mitigating emerging threats on the platform. We also suggest in this Thesis some future research directions, including integrating user-centric features and cross-linguistic detection, and considering real-world applications and ethical considerations. It also proposes developing a global, multilingual defense mechanism against digital threats.
dc.identifier.urihttp://hdl.handle.net/10393/46262
dc.identifier.urihttps://doi.org/10.20381/ruor-30359
dc.language.isoen
dc.publisherUniversité d'Ottawa | University of Ottawa
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectTwitter
dc.subjectMalicious tweets
dc.subjectMachine learning
dc.subjectPhishing
dc.subjectScam
dc.subjectSpam
dc.subjectFeatures
dc.subjectText classification
dc.subjectSentiment analysis
dc.subjectURL Analysis
dc.subjectObfuscation techniques
dc.subjectSocial media
dc.subjectCybercrime
dc.subjectIntellelliTweet
dc.subjectPhishing Report
dc.subjectSecurity
dc.titleThe IntelliTweet: Unveiling Malicious Activities in Tweets Through a Multifaceted Feature Analysis
dc.typeThesisen
thesis.degree.disciplineGénie / Engineering
thesis.degree.levelMasters
thesis.degree.nameMCS
uottawa.departmentScience informatique et génie électrique / Electrical Engineering and Computer Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Dzeha_Eric_2024_thesis.pdf
Size:
8.24 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
6.65 KB
Format:
Item-specific license agreed upon to submission
Description: