CroWSI - Graph-Based Induction of Word Senses in Croatian
November 1, 2015
CroWSI is a set of co-occurrence graph-based WSI experiments for Croatian.
The dataset is available from here: TakeLab-CroWSI.zip.
The archive contains three files:
- first containing an gold standard evaluation dataset (evaluation_dataset.json),
- second containing a fine-grained inducted sense inventory for the 10,000 most frequent Croatian words (fine_grained_sense_inventory.json),
- and the third containing a coarse-grained inducted sense inventory for the 10,000 most frequent Croatian words (coarse_grained_sense_inventory.json)
This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.