Version: 1.0
Release date: September 24, 2019
If you use this dataset for your own work, please cite the above paper. The BibTeX citation is:
@inproceedings{karan2019preemptive,
title={Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context},
author={Karan, Mladen and {\v{S}}najder, Jan},
booktitle={Proceedings of the Third Workshop on Abusive Language Online},
pages={129--134},
year={2019},
address = {Florence},
publisher = {Association for Computational Linguistics}
}
The dataset is available from here: pretox-wiki-data.tar.gz.

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.