Large-Scale Hate Speech Detection with Cross-Domain Transfer

Link to publication: https://aclanthology.org/2022.lrec-1.238/

Link to data: https://github.com/avaapm/hatespeech

Task description: Three-class (Hate speech, Offensive language, None)

Details of task: Hate speech detection on social media (Twitter) including 5 target groups (gender, race, religion, politics, sports)

Size of dataset: 100k English (27593 hate, 30747 offensive, 41660 none)

Percentage abusive: 58.3

Language: English

Level of annotation: Posts

Platform: Twitter

Medium: Text, Image

Reference: Cagri Toraman, Furkan Şahinuç, Eyup Yilmaz. 2022. Large-Scale Hate Speech Detection with Cross-Domain Transfer. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 2215–2225, Marseille, France. European Language Resources Association.