Embeddia at SemEval-2019 task 6: Detecting hate with neural network and transfer learning approaches

Andraž Pelicon, Matej Martinc, Petra Kralj Novak

Research output: Contribution to Book/Report typesConference contributionpeer-review

Abstract (may include machine translation)

SemEval-2019 Task 6 was OffensEval: Identifying and Categorizing Offensive Language in Social Media. The task was further divided into three sub-tasks: offensive language identification, automatic categorization of offense types, and offense target identification. In this paper, we present the approaches used by the Embeddia team, who qualified as fourth, eighteenth and fifth on the three sub-tasks. A different model was trained for each sub-task. For the first sub-task, we used a BERT model fine-tuned on the provided dataset, while for the second and third tasks we developed a custom neural network architecture which combines bag-of-words features and automatically generated sequence-based features. Our results show that combining automatically and manually crafted features fed into a neural architecture outperform transfer learning approach on more unbalanced datasets.

Original languageEnglish
Title of host publicationNAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages604-610
Number of pages7
ISBN (Electronic)9781950737062
StatePublished - 2019
Externally publishedYes
Event13th International Workshop on Semantic Evaluation, SemEval 2019, co-located with the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2019 - Minneapolis, United States
Duration: 6 Jun 20197 Jun 2019

Publication series

NameNAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop

Conference

Conference13th International Workshop on Semantic Evaluation, SemEval 2019, co-located with the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2019
Country/TerritoryUnited States
CityMinneapolis
Period6/06/197/06/19

Fingerprint

Dive into the research topics of 'Embeddia at SemEval-2019 task 6: Detecting hate with neural network and transfer learning approaches'. Together they form a unique fingerprint.

Cite this