Bild

Feature Explorations for Hate Speech Classification

    Tatjana Scheffler, Erik Haegert, Santichai Pornavalai, Mino Lee Sasse

konvens 2018 - GermEval Proceedings, pp. 51-57, 2018/10/02

14th Conference on Natural Language Processing - KONVENS 2018


PDF
X
BibTEX-Export:

X
EndNote/Zotero-Export:

X
RIS-Export:

X 
Researchgate-Export (COinS)

Permanent QR-Code

Abstract

In this work, we present a hate speech classifier for German tweets for the GermEval2018 Shared Task. Our best models are Linear SVM classifiers using character ngrams as well as additional textual features. We achieve a macro F1-score of 0.77 (95% confidence interval: ±0.04) in cross validation. We also present an ensemble classifier based on majority voting of the three component models.