Turn "cevapcici" into "ćevapčići" with DIACRO, a robust system for automatic diacritics restoration in Croatian texts.
Tools and Utilities
Looking to create parallel corpora for machine translation or other pruposes? CORAL (CORpus ALigner) can facilitate the task for you.
If you are looking for a tool to extract domain-specific terminology from a domain-specific document collection, TermeX might be the solution.