Incremental Machine Learning for Text Classification in Comment Moderation Systems

Wolters; Anna;, Müller; Kilian;, Riehle; Dennis, Maximilian

Incremental Machine Learning for Text Classification in Comment Moderation Systems

Wolters, Anna; Müller, Kilian; Riehle, Dennis Maximilian

Abstract

Over the last decade, researchers presented (semi-)automated comment moderation systems (CMS) based on machine learning (ML) and natural language processing (NLP) techniques to support the identification of hateful and offensive comments in online discussion forums. A common challenge in providing and operating comment moderation systems is the dynamic nature of language. As language evolves over time, continuous performance evaluations and resource-inefficient model retraining are applied to ensure high-quality identification of hate speech in the long-term use of comment moderation systems. To study the potentials of adaptable machine learning models embedded in comment moderation systems, we present an incremental machine learning approach for semi-automated comment moderation systems. This study shows a comparison of incrementally-trained ML models and batch-trained ML models used in comment moderation systems.

Keywords
Incremental Learning; Text Classification; Comment Moderation Systems

Publication type

Research article in proceedings (conference)

Peer reviewed

Yes

Publication status

Published

Year

2022

Conference

4th Multidisciplinary International Symposium on Disinformation in Open Online Media

Venue

Boise, ID

Book title

Disinformation in Open Online Media - 4th Multidisciplinary International Symposium, MISDOOM 2022, Boise, ID, USA, October 11–12, 2022, Proceedings

Editor

Spezzano, Francesca; Amaral, Adriana; Ceolin, Davide; Fazio, Lisa; Serra, Edoardo

Start page

138

End page

153

Volume

13545

Title of series

Lecture Notes in Computer Science

Publisher

Springer Nature

Place

Cham

Language

English

ISSN

0302-9743

ISBN

978-3-031-18252-5

DOI

https://doi.org/10.1007/978-3-031-18253-2_10