Josef Ruppendorfer - Melanie Siegel - Michael Wiegand (Hrsg.)

Proceedings of the GermEval 2018 Workshop

14th Conference on Natural Language Processing - KONVENS 2018

Verlag der Österreichischen Akademie der Wissenschaften
Austrian Academy of Sciences Press

A-1011 Wien, Dr. Ignaz Seipel-Platz 2
Tel. +43-1-515 81/DW 3420, Fax +43-1-515 81/DW 3400
https://verlag.oeaw.ac.at, e-mail: verlag@oeaw.ac.at

Die Konferenz zur Verarbeitung Natürlicher Sprache (KONVENS) soll den Erfahrungsaustausch auf hohem Niveau durch die Vorstellung computerlinguistischer Grundlagenforschung und ausgewählte Praxisvorträge von Experten befördern.

Copyright Cover: Melanie Siegel

Bestellung/Order

Proceedings of the GermEval 2018 Workshop

ISBN 978-3-7001-8435-5
Online Edition

Send or fax to your local bookseller or to:

Verlag der Österreichischen Akademie der Wissenschaften
Austrian Academy of Sciences Press

A-1011 Wien, Dr. Ignaz Seipel-Platz 2,
Tel. +43-1-515 81/DW 3420, Fax +43-1-515 81/DW 3400
https://verlag.oeaw.ac.at, e-mail: bestellung.verlag@oeaw.ac.at
UID-Nr.: ATU 16251605, FN 71839x Handelsgericht Wien, DVR: 0096385

Bitte senden Sie mir
Please send me

Exemplar(e) der genannten Publikation
copy(ies) of the publication overleaf

NAME

ADRESSE / ADDRESS

ORT / CITY

LAND / COUNTRY

ZAHLUNGSMETHODE / METHOD OF PAYMENT

Visa Euro / Master American Express

NUMMER

Ablaufdatum / Expiry date:

I will send a cheque Vorausrechnung / Send me a proforma invoice

DATUM, UNTERSCHRIFT / DATE, SIGNATURE

BANK AUSTRIA CREDITANSTALT, WIEN (IBAN AT04 1100 0006 2280 0100, BIC BKAUATWW), DEUTSCHE BANK MÜNCHEN (IBAN DE16 7007 0024 0238 8270 00, BIC DEUTDEDBMUC)

Challenges of Automatically Detecting Offensive Language Online: Participation Paper for the Germeval Shared Task 2018 ( H a UA )

Tom De Smedt,

Sylvia Jaki

konvens 2018 - GermEval Proceedings, pp. 27-32, 2018/10/02

14th Conference on Natural Language Processing - KONVENS 2018

PDF

Cite

X
BibTEX-Export:

X
EndNote/Zotero-Export:

X
RIS-Export:

Researchgate-Export (COinS)

Permanent QR-Code

Abstract

This paper presents our submission (HaUA) for Germeval Shared Task 1 (Binary Classification) on the identification of offensive language. With feature selection and features such as character ngrams, offensive word lexicons, and sentiment polarity, our SVM classifier is able to distinguish between offensive and nonoffensive Germanlanguage tweets with an indomain F1 score of 88.9%. In this paper, we report our methodology and discuss machine learning problems such as imbalance, overfitting, and the interpretability of machine learning algorithms. In the discussion section, we also briefly go beyond the technical perspectives and argue for a thorough discussion of the dilemma between internet security and freedom of speech, and what kind of language we are actually predicting with such algorithms.

Online Edition Table of Contents

Published Online: 2018/10/02 12:00:00

Object Identifier: 0xc1aa5572 0x003a10da

Document viewed: Calculating...

Josef Ruppendorfer - Melanie Siegel - Michael Wiegand (Hrsg.)

Proceedings of the GermEval 2018 Workshop

14th Conference on Natural Language Processing - KONVENS 2018

Challenges of Automatically Detecting Offensive Language Online: Participation Paper for the Germeval Shared Task 2018 ( H a UA )

Abstract