Version: 1.0
Release date: September 24, 2019
If you use this dataset for your own work, please cite the above paper. The BibTeX citation is:
@inproceedings{karan2019preemptive, title={Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context}, author={Karan, Mladen and {\v{S}}najder, Jan}, booktitle={Proceedings of the Third Workshop on Abusive Language Online}, pages={129--134}, year={2019}, address = {Florence}, publisher = {Association for Computational Linguistics} }
The dataset is available from here: pretox-wiki-data.tar.gz.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.