Hate Speech Detection: Leveraging Machine Learning & Deep Learning for A Safer Internet
Keywords:
Machine Learning, Ensemble classifiers, Deep Learning, BERT, Hate Speech detection, Natural Language ProcessingAbstract
This paper proposes and analyses a hate speech detection model using various Machine learning and Deep Learning Techniques for text classification. It introduces an innovative Hate Speech Detection Model designed to address the escalating challenge of hate speech on social media platforms. The surge in online hate poses a significant threat to community well-being, necessitating advanced tools for accurate identification and mitigation. Conventional methods often struggle with the dynamic nature of language, prompting the development of a model that leverages state of-the-art natural language processing techniques and machine learning algorithms. Comparative analyses against existing models underscore its effectiveness. Our Hate Speech Detection Model emerges as a robust and scalable solution to combat online hate, contributing to the creation of a safer and more inclusive digital space for all users. Hate speech detection involves discovering if communication like texts, posts, comments, replies, audio, videos etc. contains hate or if it tries to instigate violence towards a community or a person. This is usually based on prejudice against 'protected characteristics' such as their ethnicity, gender, sexual orientation, religion, age etc. Social media is a platform for communication through which users create online communities to share information, ideas and personal messages in addition to other content. Social media is an essential part of many people’s lives. It is in everything, and it is everywhere. It is omnipresent. But unfortunately, trivial communication between people has now been hijacked and replaced by the ill-humoured, and ill-intentioned culture of trolling, which has become endemic on all social media sites.
In conclusion, our Hate Speech Detection Model stands as a formidable tool in the ongoing battle against online hate. By harnessing the power of advanced natural language processing techniques, the model showcases promising results, providing a scalable solution to address the escalating challenges posed by hate speech. As digital spaces continue to evolve, our model serves as a beacon, promoting inclusivity and fostering a safer online community for all.
References
Aluru, Sai Saketh, et al. "Deep learning models for multilingual hate speech detection." arXiv preprint arXiv:2004.06465 (2020).
Aljero, Mona Khalifa A., and Nazife Dimililer. "A novel stacked ensemble for hate speech recognition." Applied Sciences 11.24 (2021): 11684.
Mosca, Edoardo, Maximilian Wich, and Georg Groh. "Understanding and interpreting the impact of user context in hate speech detection." Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media. 2021.
Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems 30 (2017).
Abro, Sindhu, et al. "Automatic hate speech detection using machine learning: A comparative study." International Journal of Advanced Computer Science and Applications 11.8 (2020).
Huang, Xiaolei, et al. "Multilingual twitter corpus and baselines for evaluating demographic bias in hate speech recognition." arXiv preprint arXiv:2002.10361 (2020).
Alfina, I., Mulia, R., Fanany, M.I., Ekanata, Y.: Hate speech detection in the indonesian language: A dataset and preliminary study. In: 2017 International Conference on Advanced
Computer Science and Information Systems (ICACSIS). pp. 233–238. IEEE (2017).
Nitish Srivastava, Geoffrey E Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. Dropout: a simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15(1):1929–1958, 2014.
Artetxe, M., Schwenk, H.: Massively multilingual sentence embeddings for zeroshot crosslingual transfer and beyond. Transactions of the Association for Computational Linguistics 7, 597–610 (2019)
Burnap, P., Williams, M.L.: Cyber hate speech on Twitter: an application of machine classification and statistical modeling for policy and decision making. Policy Internet 7(2), 223– 242 (2015)
Davidson, T., Warmsley, D., Macy, M., Weber, I.: Automated hate speech detection and the problem of offensive language. In: Proceedings of the 11th Conference on Web and Social Media. AAAI (2017)
Founta, A.M., Djouvas, C., Chatzakou, D., Leontiadis, I., Blackburn, J., Stringhini, G., Vakali, A., Sirivianos, M., Kourtellis, N.: Large scale crowdsourcing and characterization of twitter abusive behavior. In: Twelfth International AAAI Conference on Web and Social Media (2018)
Rosa, J., and Y. Bonilla, Deprovincializing Trump, decolonizing diversity, and unsettling anthropology. American Ethnologist, 2017. 44(2): p. 201-208.
Zhouhan Lin, Minwei Feng, Cicero Nogueira dos Santos, Mo Yu, Bing Xiang, Bowen Zhou, and Yoshua Bengio. A structured self-attentive sentence embedding. arXiv preprint arXiv:1703.03130, 2017.
Sainbayar Sukhbaatar, arthur szlam, Jason Weston, and Rob Fergus. End-to-end memory networks. In C. Cortes, N. D. Lawrence, D. D. Lee, M. Sugiyama, and R. Garnett, editors, Advances in Neural Information Processing Systems 28, pages 2440–2448. Curran Associates, Inc., 2015.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Journal of Applied Optics
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
The CC Attribution-NonCommercial 4.0 License allows sharing and adapting the work, provided the creator is credited and the work is not used commercially. Modifications must be indicated, and derivative works under the same license are allowed.