Abstract
In this paper, we describe our submissions for PAN at CLEF 2021 contest. We tackled the subtask “Profiling Hate Speech Spreaders on Twitter”. We developed different models for English and Spanish languages, using classic machine learning methods like Support Vector Classifier, Multi-Layer Perceptron, Logistic Regression, Random Forest, Ada-Boost Classifier and K-Neighbors Classifier to more recent deep learning methods like BERT and Bidirectional LSTM.
| Original language | English |
|---|---|
| Pages (from-to) | 2178-2185 |
| Number of pages | 8 |
| Journal | CEUR Workshop Proceedings |
| Volume | 2936 |
| State | Published - 2021 |
| Event | 22nd Working Notes of CLEF - Conference and Labs of the Evaluation Forum, CLEF-WN 2021 - Virtual, Online, Romania Duration: 21 Sep 2021 → 24 Sep 2021 |
Bibliographical note
Publisher Copyright:© 2021 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
Funding
We thank the Bar-Ilan Data Science Institute for kindly providing server for training our models. Without their support, this research would not have been possible. We are also grateful to the organizers and reviewers who gave us the opportunity to do this research.
| Funders |
|---|
| Bar-Ilan data science institute |
Keywords
- Ada-boost classifier
- Author profiling
- BERT
- English
- Hate speech
- LSTM
- Logistic regression
- MLP
- Random forest
- SVM
- Spanish
Fingerprint
Dive into the research topics of 'Detecting Hate Speech Spreaders on Twitter using LSTM and BERT in English and Spanish'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver