JCTDHS at SemEval-2019 task 5: Detection of hate speech in tweets using deep learning methods, character N-gram features, and preprocessing methods

Yaakov HaCohen-Kerner, Elyashiv Shayovitz, Shalom Rochman, Eli Cahn, Gal Didi, Ziv Ben-David

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

In this paper, we describe our submissions to SemEval-2019 contest. We tackled subtask A - “a binary classification where systems have to predict whether a tweet with a given target (women or immigrants) is hateful or not hateful”, a part of task 5 “Multilingual detection of hate speech against immigrants and women in Twitter (HatEval)”. Our system JCTDHS (Jerusalem College of Technology Detects Hate Speech) was developed for tweets written in English. We applied various supervised ML methods, various combinations of n-gram features using the TF-IDF scheme. In addition, we applied various combinations of eight basic preprocessing methods. Our best submission was a special bidirectional RNN, which was ranked at the 11th position out of 68 submissions.

Original languageEnglish
Title of host publicationNAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages426-430
Number of pages5
ISBN (Electronic)9781950737062
StatePublished - 2019
Externally publishedYes
Event13th International Workshop on Semantic Evaluation, SemEval 2019, co-located with the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2019 - Minneapolis, United States
Duration: 6 Jun 20197 Jun 2019

Publication series

NameNAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop

Conference

Conference13th International Workshop on Semantic Evaluation, SemEval 2019, co-located with the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2019
Country/TerritoryUnited States
CityMinneapolis
Period6/06/197/06/19

Bibliographical note

Publisher Copyright:
© 2019 Association for Computational Linguistics

Fingerprint

Dive into the research topics of 'JCTDHS at SemEval-2019 task 5: Detection of hate speech in tweets using deep learning methods, character N-gram features, and preprocessing methods'. Together they form a unique fingerprint.

Cite this