A deep learning based multilingual hate speech detection for resource scarce languages

Beshay, Nabil

doi:doi:10.7939/r3-msjg-2090

This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

View

Download

Communities and Collections

Concordia University of Edmonton / Master of Science in Information Technology Project Reports (Concordia University of Edmonton)

Usage

232 views
569 downloads

A deep learning based multilingual hate speech detection for resource scarce languages

Author(s) / Creator(s)
- Beshay, Nabil
Over the last decade, the increased use of social media has led to an increase in hateful activities in social networks. An international issue that weakens the cohesiveness of civil societies is hate speech on internet social networks. Due to the lack of restrictions set by these sites for its users to express their views as they like. Hate speech is one of the most dangerous of these activities, so users have to protect themselves from these activities from social media sites such as YouTube, Facebook, Twitter, etc. Large-scale social platforms are currently investing important resources into automatically detecting and classifying hateful content, without much success.this research introduces a method for using deep learning algorithms to predict hate speech from social media websites. We implement proposed algorithms to detect hate speech in five different language: Arabic, English, and Urdu. this study employs a variety of feature engineering techniques and a comparative study among different machine and deep learning algorithms to automatically detect hate speech messages on many datasets. after hate speech data is collected, as a part of the preprocessing steaming, token splitting, character removal, and inflection elimination are carried out before performing the hate speech recognition process through deep learning algorithms. In the future, we would like to deploy proposed algorithms to other low resource languages and explore more language specific features in deep learning framework.
Date created

2022
Subjects / Keywords
Type of Item

Research Material
DOI

https://doi.org/10.7939/r3-msjg-2090
License

Attribution-NonCommercial 4.0 International

Language
- English