Multi-Class Detection of Abusive Language Using Automated Machine Learning

Jorgensen, Mackenzie; Choi, Minho; Niemann, Marco; Brunk, Jens; Becker, Jörg

Multi-Class Detection of Abusive Language Using Automated Machine Learning

Jorgensen Mackenzie, Choi Minho, Niemann Marco, Brunk Jens, Becker Jörg

Zusammenfassung

Abusive language detection online is a daunting task for moderators. We propose Automated Machine Learning (Auto-ML) to semi-automate abusive language detection and to assist moderators. In this paper, we show that multi-class classification powered by Auto-ML is successful in detecting abusive language in English and German as well as and better than the state-ofthe- art machine learning models. We also highlight how we combatted the imbalanced data problem in our data-sets through feature selection and undersampling methods. We propose Auto-ML as a promising approach to the field of abusive language detection, especially for small companies who may have little machine learning knowledge and computing resources.

Schlüsselwörter

Abusive Language Detection, Automated-Machine Learning, Multi-Class Classification

Zitieren als

Jorgensen, M., Choi, M., Niemann, M., Brunk, J., & Becker, J. (2020). Multi-Class Detection of Abusive Language Using Automated Machine Learning.

Details

Publikationstyp

Forschungsartikel in Online-Sammlung (Konferenz)

Begutachtet

Ja

Publikationsstatus

Veröffentlicht

Jahr

2020

Konferenz

15. Internationale Tagung Wirtschaftsinformatik (WI 2020)

Konferenzort

Potsdam

Sprache

Englisch

DOI

https://doi.org/10.30844/wi_2020_r7-jorgensen

Gesamter Text

R7_Jorgensen-Multi-Class_Detection_of_Abusive_Language_Using_Automated_Machine_Learning-248_c.pdf