HeidelTime.Hr - A Temporal Expression Tagger With Resources For Croatian

Version: 1.0
Release date: October 9, 2014

1 Description

HeidelTime is a rule-based, multilingual, cross-domain, open-source temporal expression tagger. Available here is the 1.6 version of HeidelTime with prepared resources for tagging of Croatian texts, as well as resources for pre-processing required for Croatian.

For details, please check the following paper:

Skukan, L.,Glavaš, G.,Šnajder, J.(2014). HeidelTime.Hr: Extracting and Normalizing Temporal Expressions in Croatian. In Proceedings of the Ninth Language Technologies Conference, Ljubljana. Information Society, 99-103. [paper]

If you use this dataset for your own work, please cite the above paper. The BibTeX citation is:

@inproceedings{skukan2014heideltimehr,
  title={HeidelTime.Hr: Extracting and Normalizing Temporal Expressions in Croatian},
  author={Skuka, Luka and Glava\v{s}, Goran and {\v{S}}najder, Jan},
  booktitle={Proceedings of the Nineth Language Technologies Conference},
  pages={99-103},
  year={2014},
  organization={Information Society}
}

2 Dataset

The dataset is available from here: TakeLab-HeidelTimeHr.tar.gz.

The archive contains two files and two directories. Details on setting up the tool for tagging of Croatian texts are given in the file USER-GUIDE.txt. The resources required for pre-processing are given in the src/ directory, while the HeidelTimeExecutable/ directory contains an executable[1] version of HeidelTime with added resources for Croatian.

[1] - Provided the steps detailed in User-GUIDE.txt are executed.

3 License

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.