Abstract
In this paper, we describe our submissions to the SemEval-2023 contest. We tackled subtask 12 - “AfriSenti-SemEval: Sentiment Analysis for Low-resource African Languages using Twitter Dataset". We developed different models for 12 African languages and a 13th model for a multilingual dataset built from these 12 languages. We applied a wide variety of word and char n-grams based on their tf-idf values, 4 classical machine learning methods, 2 deep learning methods, and 3 oversampling methods. We used 12 sentiment lexicons and applied extensive hyperparameter tuning.
Original language | English |
---|---|
Title of host publication | 17th International Workshop on Semantic Evaluation, SemEval 2023 - Proceedings of the Workshop |
Editors | Atul Kr. Ojha, A. Seza Dogruoz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori |
Publisher | Association for Computational Linguistics |
Pages | 365-378 |
Number of pages | 14 |
ISBN (Electronic) | 9781959429999 |
DOIs | |
State | Published - 2023 |
Externally published | Yes |
Event | 17th International Workshop on Semantic Evaluation, SemEval 2023, co-located with the 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 - Hybrid, Toronto, Canada Duration: 13 Jul 2023 → 14 Jul 2023 |
Publication series
Name | 17th International Workshop on Semantic Evaluation, SemEval 2023 - Proceedings of the Workshop |
---|
Conference
Conference | 17th International Workshop on Semantic Evaluation, SemEval 2023, co-located with the 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 |
---|---|
Country/Territory | Canada |
City | Hybrid, Toronto |
Period | 13/07/23 → 14/07/23 |
Bibliographical note
Publisher Copyright:© 2023 Association for Computational Linguistics.