UB Paderborn / Katalog / Suche / Details

Ergebnis 14 von 2421876

Proceedings of the 11th ACM Workshop on Artificial Intelligence and Security, 2018, p.2-12

2018

Volltextzugriff (PDF)

Autor(en) / Beteiligte

Titel

All You Need is "Love": Evading Hate Speech Detection

Ist Teil von

Proceedings of the 11th ACM Workshop on Artificial Intelligence and Security, 2018, p.2-12

Ort / Verlag

New York, NY, USA: ACM

Erscheinungsjahr

2018

Quelle

ACM Digital Library Complete

Beschreibungen/Notizen

With the spread of social networks and their unfortunate use for hate speech, automatic detection of the latter has become a pressing problem. In this paper, we reproduce seven state-of-the-art hate speech detection models from prior work, and show that they perform well only when tested on the same type of data they were trained on. Based on these results, we argue that for successful hate speech detection, model architecture is less important than the type of data and labeling criteria. We further show that all proposed detection techniques are brittle against adversaries who can (automatically) insert typos, change word boundaries or add innocuous words to the original hate speech. A combination of these methods is also effective against Google Perspective - a cutting-edge solution from industry. Our experiments demonstrate that adversarial training does not completely mitigate the attacks, and using character-level features makes the models systematically more attack-resistant than using word-level features.

Sprache: Englisch
Identifikatoren: ISBN: 9781450360043, 1450360041
DOI: 10.1145/3270101.3270103
Titel-ID: cdi_acm_books_10_1145_3270101_3270103_brief

Format: –
Schlagworte: Computing methodologies -- Artificial intelligence -- Natural language processing, Computing methodologies -- Machine learning -- Learning paradigms -- Multi-task learning -- Transfer learning, Computing methodologies -- Machine learning -- Learning paradigms -- Supervised learning, Computing methodologies -- Machine learning -- Learning paradigms -- Supervised learning -- Supervised learning by classification, Computing methodologies -- Machine learning -- Machine learning approaches -- Neural networks, Social and professional topics -- Computing -- technology policy -- Censorship -- Hate speech, Social and professional topics -- Computing -- technology policy -- Censorship -- Political speech, Social and professional topics -- Computing -- technology policy -- Censorship -- Technology and censorship

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX