Abstract
In this paper, we describe our submissions to SemEval-2022 contest. We tackled subtask 6-A - 'iSarcasmEval: Intended Sarcasm Detection In English and Arabic - Binary Classification". We developed different models for two languages: English and Arabic. We applied 4 supervised machine learning methods, 6 preprocessing methods for English and 3 for Arabic, and 3 oversampling methods. Our best submitted model for the English test dataset was an SVC model that balanced the dataset using SMOTE and removed stop words. For the Arabic test dataset, our best submitted model was an SVC model that preprocessed removed longation.
Original language | English |
---|---|
Title of host publication | SemEval 2022 - 16th International Workshop on Semantic Evaluation, Proceedings of the Workshop |
Editors | Guy Emerson, Natalie Schluter, Gabriel Stanovsky, Ritesh Kumar, Alexis Palmer, Nathan Schneider, Siddharth Singh, Shyam Ratan |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 1031-1038 |
Number of pages | 8 |
ISBN (Electronic) | 9781955917803 |
State | Published - 2022 |
Externally published | Yes |
Event | 16th International Workshop on Semantic Evaluation, SemEval 2022 - Seattle, United States Duration: 14 Jul 2022 → 15 Jul 2022 |
Publication series
Name | SemEval 2022 - 16th International Workshop on Semantic Evaluation, Proceedings of the Workshop |
---|
Conference
Conference | 16th International Workshop on Semantic Evaluation, SemEval 2022 |
---|---|
Country/Territory | United States |
City | Seattle |
Period | 14/07/22 → 15/07/22 |
Bibliographical note
Publisher Copyright:© 2022 Association for Computational Linguistics.