Target-Based Offensive Language Identification

Zampieri, Marcos and Morgan, Skye and North, Kai and Simmons, Austin and Khandelwal, Paridhi and Ranasinghe, Tharindu and Rosenthal, Sara and Nakov, Preslav (2023) Target-Based Offensive Language Identification. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics :. Association for Computational Linguistics (ACL Anthology), Stroudsberg, Pa., pp. 762-770. ISBN 9781959429715

Full text not available from this repository.

Abstract

We present TBO, a new dataset for Target-based Offensive language identification. TBO contains post-level annotations regarding the harmfulness of an offensive post and token-level annotations comprising of the target and the offensive argument expression. Popular offensive language identification datasets for social media focus on annotation taxonomies only at the post level and more recently, some datasets have been released that feature only token-level annotations. TBO is an important resource that bridges the gap between post-level and token-level annotation datasets by introducing a single comprehensive unified annotation taxonomy. We use the TBO taxonomy to annotate post-level and token-level offensive language on English Twitter posts. We release an initial dataset of over 4,500 instances collected from Twitter and we carry out multiple experiments to compare the performance of different models trained and tested on TBO.

Item Type:
Contribution in Book/Report/Proceedings
ID Code:
222058
Deposited By:
Deposited On:
13 Aug 2024 09:25
Refereed?:
Yes
Published?:
Published
Last Modified:
13 Aug 2024 09:25