Detection of Hate Speech in Marathi Using Language Specific Pre-Processing
DOI:
https://doi.org/10.69511/ijdsaa.v6i6.230Keywords:
Marathi language, pre-processing, social media, hate speech, detectionAbstract
The increasing accessibility of social media platforms has exponentially increased the amount of textual content on the internet. Among this textual content, there is also a rapid growth of hate speech on these social media platforms. This offensive and hateful speech are increasingly becoming a cause for self-harm, depression and even suicidal tendencies, motivating social media platforms to invest in strategies that can make social communities safer. Most of the research done on hate speech detection is done in languages like English, Spanish, French and more. There is also a good amount of work done on the Hindi language, but there is not much significant work done on the Marathi language apart from a few notable exceptions. Marathi has 83 million speakers many of whom consume Marathi social media content. This makes hate speech detection in the Marathi language a desirable opportunity to explore. How various pre-processing methods affect hate speech detection in Marathi were explored in this research. Furthermore, transfer learning was used to analyse the efficiency of multilingual hate speech detection models for the Marathi language. Finally, how hate speech detection in large texts can vary from that in shot text data was explored.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Ankur Sarode, Nailya Sultanova

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

International Journal of Data Science and Advanced Analytics (IJDSAA) is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. This license allows users to copy, distribute and transmit an article, adapt the article as long as the author is attributed and the article is not used for commercial purposes.
The author(s) confirms
- The manuscript submission has not been previously published, nor is it before another journal for consideration (or an explanation has been provided in Comments to the Editor).
- The published materials used in the manuscript were obtained permission for reproduction. (if any)