Publications

2017

  • Štajner, S., & Glavaš, G. (2017). Leveraging Event-Based Semantics for Automated Text Simplification. Expert Systems with Applications, 82(1), 383–395.
  • Štajner, S., Glavaš, G., Ponzetto, S. P., & Stuckenschmidt, H. (2017, January). Domain adaptation for automatic detection of speculative sentences. In  11th IEEE International Conference on Semantic Computing (ICSC), 2017, pp. 164–171.
  • Klein, P., Ponzetto, S. P., & Glavaš, G. (2017). Improving Neural Knowledge Base Completion with Cross-Lingual Projections. In Proceedings of EACL 2017, 516–522.
  • Glavaš, G., Nanni, F., & Ponzetto, S. P. (2017). Unsupervised cross-lingual scaling of political texts. In Proceedings of EACL 2017, 688–693.
  • di Buono M.P., 2017, Endpoint for Semantic Knowledge (ESK). In Barone L., Silberztein M., Monteleone M. (eds.) Automatic Processing of Natural-Language Electronic Texts with NooJ. 10th International Conference, NooJ 2016, České Budějovice, Czech Republic, June 9-11, 2016, Revised Selected Papers. CCIS Springer. ISBN 978-3-319-55001-5 [paper]
  • Gudivada V., Bhulai N. D., di Buono M.P. (eds.), 2017, Proceedings of ALLDATA 2017, The Third International Conference on Big Data, Small Data, Linked Data and Open Data. IARIA 2017. ISBN: 978-1-61208-552-4.
  • Monti J., Sangati F., de Santis A., di Buono M.P., Caruso V., , 2017, Corpus dell’Italiano annotato con ca. 3000 polirematiche verbali per PARSEME shared task on automatic identification of verbal MWEs (Italian corpus for PARSEME Shared Task on Verbal Multi-Word Expressions Identification). CC-BY-NC-SA 4.0. [data]

2016

  • Nanni, F., Dietz, L., Faralli, S., Glavaš, G., & Ponzetto, S. P. (2016). Capturing interdisciplinarity in academic abstracts. D-lib magazine22(9/10).
  • Nanni, F., Zirn, C., Glavaš, G., Eichorst, J., & Ponzetto, S. P. (2016). TopFish: topic-based analysis of political position in US electoral campaigns. In Proceedings of the International Conference on the Advances in Computational Analysis of Political Texts (PolText), 61–67.
  • Zirn, C., Glavaš, G., Nanni, F., Eichorts, J., & Stuckenschmidt, H. (2016). Classifying topics and detecting topic shifts in political manifestos. In Proceedings of the International Conference on the Advances in Computational Analysis of Political Texts (PolText), 88–93.
  • Copara, J., Ochoa, J., Thorne, C., & Glavaš, G. (2016). Spanish NER with Word Representations and Conditional Random Fields. In Proceedings of the Sixth Named Entity Workshop, 34–40.
  • Glavaš, G., Nanni, F., & Ponzetto, S. P. (2016). Unsupervised text segmentation using semantic relatedness graphs. In Proceedings of the 5th Joint Conference on Lexical and Computational Semantics (*SEM), 125–130.
  • Mauša, Goran; Galinac Grbac, Tihana; Dalbelo Bašić, Bojana
    A Systematic Data Collection Procedure for Software Defect Prediction // Computer Science and Information Systems,13 (2016), 1; 173-197. doi:10.2298/CSIS141228061M (članak, scientific)

  • Sebastian Padó, Aurelie Herbelot, Max Kisselew, Jan Šnajder. Predictability of Distributional Semantics in Derivational Word Formation. Proceedings of the 26th International Conference on Computational Linguistics (COLING 2016), Osaka. To appear. [paper] [data]
  • Sebastian Padó, Jan Šnajder, Jason Utt, Britta Zeller. Smoothing Syntax-Based Semantic Spaces: Let The Winner Take It All. Proceedings of the 13th Conference on Natural Language Processing (KONVENS 2016), Bochum. 186-191. [paper]
  • Federico Cerutti, Alexis Palmer, Ariel Rosenfeld, Jan Šnajder, Francesca Toni. A Pilot Study in Using Argumentation Frameworks for Online Debates. The First International Workshop on Systems and Algorithms for Formal Argumentation (SAFA 2016). 63-74. [paper]
  • Martin Tutek, Goran Glavaš, Jan Šnajder, Nataša Milic-Frayling, Bojana Dalbelo Bašić. Detecting and Ranking Conceptual Links between Texts. Proceedings of the 25th ACM International Conference on Information and Knowledge Management (CIKM 2016), Indianapolis. 2077-2080. [paper]
  • Filip Boltužić and Jan Šnajder (2016). Fill the Gap! Analyzing Implicit Premises between Claims from Online Debates. Proceedings of the 3rd Workshop on Argumentation Mining (ArgMining 2016), ACL 2016, Berlin. 124-133. [paper]
  • Damir Korenčić Marijana Grbešića-Zenzerović, Jan Šnajder. Topics and their Salience in the 2015 Parliamentary Election in Croatia: A Topic Model based Analysis of the Media Agenda. Proceedings of the International Conference on the Advances in Computational Analysis of Political Text (PolText 2016), Dubrovnik. To appear.
  • Mladen Karan, Jan Šnajder, Daniela Širinić, Goran Glavaš (2016). Analysis of Policy Agendas: Lessons Learned from Automatic Topic Classification of Croatian Political Texts. Proceedings of the 10th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH 2016), ACL 2016, Berlin. 12-21. [paper]
  • Martin Tutek, Ivan Sekulić, Paula Gombar, Ivan Paljak, Filip Čulinović, Filip Boltužić, Mladen Karan, Domagoj Alagić and Jan Šnajder (2016). TakeLab at SemEval-2016 Task 6: Stance Classification in Tweets Using a Genetic Algorithm Based Ensemble. Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval 2016), NAACL 2016, San Diego. 476-480. [paper]
  • Mladen Karan, Jan Šnajder (2016). FAQIR: A Frequently Asked Questions Retrieval Test Collection. Proceedings of the 19th International Conference on Text, Speech and Dialogue (TSD 2016), Brno. 74-81. [paper]
  • Domagoj Alagić and Jan Šnajder (2016). Cro36WSD: A Lexical Sample for Croatian Word Sense Disambiguation. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož. 1689-1694. [paper]
  • Marko Bekavac and Jan Šnajder (2016). Graph-Based Induction of Word Senses in Croatian. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož. 3014-3018. [paper]
  • Ivan Sekulić and Jan Šnajder (2016). VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož. 2676-2681. [paper]

2015

  • Glavaš, G. (2015). TAKELAB: Medical Information Extraction and Linking with MINERAL. In Proceedings of SemEval-2015, 389–393.
  • Mauša, Goran; Bogunović, Nikola; Galinac Grbac, Tihana; Dalbelo Bašić, Bojana
    Rotation Forest in Software Defect Prediction // Proceedings of SQAMIA 2015 / Budimac, Zoran ; Heričko, Marjan (ur.).
    Maribor, Slovenija, 2015. str. 35-43
  • Mauša, Goran; Galinac Grbac, Tihana; Dalbelo Bašić, Bojana Data Collection for Software Defect Prediction – an Exploratory Case Study of Open Source Software Projects // Proceedings of MIPRO CTI 2015 / Biljanović, Petar (ur.). Rijeka: Croatian Society for Information and Communication Technology, Electronics and Microelectronics – MIPRO, 2015. str. 513-519
  • Jan Šnajder and Petra Almić (2015). Modeling Semantic Compositionality of Croatian Multiword Expressions. Informatica, 39 (3). 301-309. [paper] [data]
  • Damir Korenčić, Strahil Ristov, Jan Šnajder. Getting the Agenda Right: Measuring Media Agenda using Topic Models. Workshop on Topic Models: Post-Processing and Applications, CIKM 2015, Melbourne. (In press.) [data]
  • Domagoj Alagić and Jan Šnajder (2015). Experiments on Active Learning for Croatian Word Sense Disambiguation. Proceedings of the 5th Workshop on Balto-Slavic Natural Language Processing, Hissar. 49-58. [paper] [slides] [data]
  • Goran Glavaš and Jan Šnajder (2015). Resolving Entity Coreference in Croatian with a Constrained Mention-Pair Model. Proceedings of the 5th Workshop on Balto-Slavic Natural Language Processing, Hissar. 17-23. [paper] [slides] [data]
  • Mladen Karan and Jan Šnajder (2015). Evaluation of Manual Query Expansion Rules on a Domain Specific FAQ Collection. In Experimental IR Meets Multilinguality, Multimodality, and Interaction (pp. 248-253). Springer International Publishing. [paper]
  • Filip Boltužić and Jan Šnajder (2015). Identifying Prominent Arguments in Online Debates Using Semantic Textual Similarity. Proceedings of the 2nd Workshop on Argumentation Mining (ArgMining 2015), NAACL 2015, Denver. 110-115. [paper] [slides]
  • Sebastian Padó, Alexis Palmer, Max Kisselew, Jan Šnajder (2015). Measuring Semantic Content To Assess Asymmetry in Derivation. Proceedings of the IWCS 2015 Workshop on Advances in Distributional Semantics. London. [paper]
  • Sebastian Padó, Britta Zeller, Jan Šnajder (2015). Morphological Priming in German: The Word is Not Enough (Or Is It?). Proceedings of NetWordS 2015, Pisa. 42-45. [paper]
  • Max Kisselew, Sebastian Padó, Alexis Palmer, Jan Šnajder (2015). Obtaining a Better Understanding of Distributional Models of German Derivational Morphology. Proceedings of the 11th International Conference on Computational Semantics (IWCS 2015), London. 58-63.[paper]
  • Mladen Karan, Goran Glavaš, Jan Šnajder, Bojana Dalbelo Bašić, Ivan Vulić, Marie-Francine Moens (2015). TKLBLIIR: Detecting Twitter Paraphrases with TweetingJay. Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver. 70-74.[paper] [data]

2014

  • Mauša, Goran; Galinac Grbac, Tihana; Dalbelo Bašić, Bojana Software Defect Prediction with Bug-Code Analyzer – a Data Collection Tool Demo // Proceedings of SoftCOM 2014, Split, Hrvatska, 2014. (poster, međunarodna recenzija, sažetak, ostalo)
  • Alajković, Adrian; Banek, Marko; Dalbelo Bašić, Bojana; Humski, Luka; Kalpić, Damir; Karan, Mladen; Milojević, Jelena; Pintar, Damir; Pleša, Marina; Skočir, Zoran et al.Model predviđanja akcijskog udjela prodajnih mjesta u promocijama, 2014.
  • Goran Glavaš and Jan Šnajder (2014). Constructing Coherent Event Hierarchies from News Stories. Proceedings of the Workshop on Graph-based Methods for Natural Language Processing (TextGraphs-9) at 19th Conference on Empirical Methods in Natural Language Processing (EMNLP’14), Doha. 1-5. [paper]
  • Sebastian Padó, Britta Zeller, Jan Šnajder (2014). Towards Semantic Validation of a Derivational Lexicon. Proceedings the 25th International Conference on Computational Linguistics (COLING 2014), Dublin. 1728-1739. [paper]
  • Filip Boltužić and Jan Šnajder (2014). Back up your Stance: Recognizing Arguments in Online Discussions. Proceedings of the First Workshop on Argumentation Mining (ArgMining 2014), Association for Computational Linguistics, ACL 2014, Baltimore. 49-58. [paper] [slides][data]
  • Goran Glavaš and Jan Šnajder (2014). Construction and Evaluation of Event Graphs. Natural Language Engineering (to appear). [paper] [data]
  • Goran Glavaš and Jan Šnajder (2014). Event Graphs for Information Retrieval and Multi-Document Summarization. Expert Systems with Applications, Volume 41, Issue 15. 6904-6916. [paper]
  • Jan Šnajder (2014). DerivBase.hr: A High-Coverage Derivational Morphology Resource for Croatian. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik. 3371-3377. [paper] [poster] [data]
  • Goran Glavaš, Jan Šnajder, Marie-Francine Moens, Parisa Kordjamshidi (2014). HiEve: A Corpus for Extracting Event Hierarchies from News Stories. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik. 3678-3683.[paper]
  • Frane Šarić, Bojana Dalbelo Bašić Marie-Francine Moens, Jan Šnajder (2014). Multi-Label Classification of Croatian Legal Documents using EuroVoc Thesaurus. Proceedings of SPLeT-Semantic Processing of Legal Texts: Legal Resources and Access to Law workshop, LREC’14, Reykjavik. 7-12. [paper] [data]
  • Petra Almić and Jan Šnajder (2014). Determining the Semantic Compositionality of Croatian Multi-Word Expressions. Proceedings of the Ninth Language Technologies Conference, Information Society (IS-JT 2014), Ljubljana. 32-37. [paper] [slides] [data]
  • Krešimir, Baksa, Dino, Dolović, Goran, Glavaš, Jan Šnajder (2014). Named Entity Recognition in Croatian Tweets. Proceedings of the Ninth Language Technologies Conference, Information Society (IS-JT 2014), Ljubljana. 85-89. [paper]
  • Siniša Biđin, Jan Šnajder, Goran Glavaš (2014). Predicting Croatian Phrase Sentiment Using a Deep Matrix-Vector Model. Proceedings of the Ninth Language Technologies Conference, Information Society (IS-JT 2014), Ljubljana. 95-98. [paper]
  • Luka Skukan, Goran Glavaš, Jan Šnajder (2014). HeidelTime.Hr: Extracting and Normalizing Temporal Expressions in Croatian. Proceedings of the Ninth Language Technologies Conference, Information Society (IS-JT 2014), Ljubljana. 99-103. [paper] [data]
  • Leo Zuanović, Mladen Karan, Jan Šnajder (2014). Experiments with Neural Word Embeddings for Croatian. Proceedings of the Ninth Language Technologies Conference, Information Society (IS-JT 2014), Ljubljana. 69-72. [paper]

2013

  • Ivanac, Vedran; Dalbelo Bašić, Bojana; Vanjak, Zvonimir, Construction and Evaluation of Cellular Automata Lattice Based on the Semantics of an Urban Traffic Network // Journal of cellular automata, 8 (2013), 5/6; 417-428 (članak, scientific)
  • Galinac Grbac, Tihana; Mauša, Goran; Dalbelo Bašić, Bojana
    Stability of Software Defect Prediction in Relation to Levels of Data Imbalance // Proceedings of SQAMIA 2013 / Budimac, Zoran (ur.).
    Novi Sad, 2013. str. 1-10
  • Mauša, Goran; Galinac Grbac, Tihana; Dalbelo Bašić, Bojana; Pavčević, Mario-Osvin
    Hill Climbing and Simulated Annealing in Large Scale Next Release Problem // Proceedings of EuroCon 2013 / Kuzle, I. ; Capuder, T. ; Pandžić H. (ur.).
    Zagreb
  • Jan Šnajder (2013). Models for Predicting the Inflectional Paradigm of Unknown Croatian Words. Slovenščina 2.0, 1 (2): 1-34. [paper]
  • Goran Glavaš and Jan Šnajder (2013). Event-Centered Information Retrieval Using Kernels on Event Graphs. Proceedings of the 8th Workshop on Graph-Based Methods in Natural Language Processing (TextGraphs-8) at 18th International Conference on Empirical Methods in Natural Language Processing (EMNLP 2013). 1-5. [paper] [data]
  • Britta Zeller, Jan Šnajder, Sebastian Padó (2013). DErivBase: Inducing and Evaluating a Derivational Morphology Resource for German. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Sofia: Association for Computational Linguistics. 1201-1211. [paper] [slides] [data]
  • Sebastian Padó, Jan Šnajder, Britta Zeller (2013). Derivational Smoothing for Syntactic Distributional Semantics. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Sofia: Association for Computational Linguistics. 731-735. [paper] [slides]
  • Jan Šnajder, Sebastian Padó, Željko Agić (2013). Building and Evaluating a Distributional Memory for Croatian. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Sofia: Association for Computational Linguistics. 784-789. [paper] [slides] [data]
  • Goran Glavaš and Jan Šnajder (2013). Recognizing Identical Events with Graph Kernels. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Sofia: Association for Computational Linguistics. 797-803. [paper] [data]
  • Mladen Karan, Goran Glavaš, Frane Šarić, Jan Šnajder, Jure Mijić, Artur Šilić, Bojana Dalbelo Bašić (2013). CroNER: Recognizing Named Entities in Croatian Using Conditional Random Fields. Informatica 37. 165-172. [paper]
  • Mladen Karan, Lovro Žmak, Jan Šnajder (2013). Frequently Asked Questions Retrieval for Croatian Based on Semantic Textual Similarity. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, Sofia: Association for Computational Linguistics. 24-33. [paper] [slides] [data]
  • Goran Glavaš, Damir Korenčić, Jan Šnajder (2013). Aspect-Oriented Opinion Mining from User Reviews in Croatian. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, Sofia, Association for Computational Linguistics. 18-23.[paper] [slides] [data]
  • Marko Bekavac and Jan Šnajder (2013). GPKEX: Genetically Programmed Keyphrase Extraction from Croatian Texts. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, Sofia, Association for Computational Linguistics. 43-47. [paper] [slides] [data]
  • Goran Glavaš and Jan Šnajder (2013). Exploring Coreference Uncertainty of Generically Extracted Event Mentions. Lecture Notes in Computer Science; Computational Linguistics and Intelligent Text Processing. 7816, 408-422. [data]

2012

  • Mauša, Goran; Galinac Grbac, Tihana; Dalbelo Bašić, Bojana, Multivariate Logistic Regression Prediction of Fault-Proneness in Software Modules // Proceedings of the 35th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2012 / Golubić, Stjepan ; Mikac, Branko ; Hudek, Vlasta (ur.). Opatija, Croatia, 2012. str. 813-818
  • Mauša, Goran; Galinac Grbac, Tihana; Dalbelo Bašić, Bojana, Overview of search-based optimization algorithms used in software engineering // Proceedings of IN-TECH 2012 / Car, Zlatan ; Kudláček, Jan ; Pepelnjak, Tomaž (ur.). Rijeka, Croatia, 2012. str. 409-412.
  • Ivanac, Vedran; Dalbelo Bašić, Bojana; Vanjak, Zvonimir, Construction of Cellular Automata Lattice Based on the Semantics of an Urban Traffic Network // 10th International Conference on Cellular Automata for Research and Industry, ACRI 2012, Santorini Island, Greece, September 24-27, 2012. Proceedings / Ch. Sirakoulis, Georgios ; Bandini, Stefania (ur.). Heidelberg, Germany: Springer Berlin Heidelberg, 2012. str. 795-806
  • Štajduhar, Ivan; Dalbelo-Bašić, Bojana Uncensoring censored data for machine learning: A likelihood-based approach // Expert systems with applications, 39 (2012), 8; 7226-7234.
  • Šilić, Artur; Morin, Annie; Chauchat, Jean-Hugues; Dalbelo Bašić, Bojana
    Visualization of temporal text collections based on Correspondence Analysis // Expert systems with applications, 39 (2012), 15; 12143-12157. doi:10.1016/j.eswa.2012.04.040
  • Šilić, Artur; Dalbelo Bašić, Bojana
    Exploring Classification Concept Drift on a Large News Text Corpus // Springer Lecture Notes in Computer Science,7181 (2012), 1; 428-437. doi:10.1007/978-3-642-28604-9
  • Frane Šarić, Goran Glavaš, Mladen Karan, Jan Šnajder, Bojana Dalbelo Bašić (2012). TakeLab: Systems for Measuring Semantic Text Similarity. Proceedings of *SEM 2012: The First Joint Conference on Lexical and Computational Semantics, Montreal, Canada. Association for Computational Linguistics. 441-448. [paper] [data]
  • Jan Šnajder (2012). Guessing the Correct Inflectional Paradigm of Unknown Croatian Words. Proceedings of the Eighth Language Technologies Conference, Ljubljana. Information Society. 185-190. [paper] [slides]
  • Mladen Karan, Jan Šnajder, Bojana Dalbelo Bašić (2012). Distributional Semantics Approach to Detecting Synonyms in Croatian Language. Proceedings of the Eighth Language Technologies Conference, Ljubljana. Information Society. 111-116. [paper]
  • Tin Franović and Jan Šnajder (2012). Speech Act Based Classification of Email Messages in Croatian Language. In Proceedings of the Eighth Language Technologies Conference, Ljubljana. Information Society. 69-72. [paper] [slides]
  • Goran Glavaš, Mladen Karan, Frane Šarić, Jan Šnajder, Jure Mijić, Artur Šilić, Bojana Dalbelo Bašić (2012). CroNER: A State-of-the-Art Named Entity Recognition and Classification for Croatian. In Proceedings of the Eighth Language Technologies Conference, Ljubljana. Information Society. 73-78. [paper] [slides]
  • Mladen Marović, Jan Šnajder, Goran Glavaš (2012). Event and Temporal Relation Extraction from Croatian Newspaper Texts. In Proceedings of the Eighth Language Technologies Conference, Ljubljana. Information Society. 141-146. [paper]
  • Goran Glavaš, Jan Šnajder, Bojana Dalbelo Bašić (2012). Are You for Real? Learning Event Factuality in Croatian Texts. In Proceedings of the Conference on Data Mining and Data Warehouses (SiKDD 2012), Ljubljana. [paper] [slides]
  • Goran Glavaš, Jan Šnajder, Bojana Dalbelo Bašić (2012). Semi-Supervised Acquisition of Croatian Sentiment Lexicon. Lecture notes in Artificial Intelligence (Text, Speech and Dialogue, 15th International Conference, TSD 2012, Brno, Czech Republic, September 2012). 7499, 166-173. [data]
  • Hrvoje Peradin, Jan Šnajder, Bojana Dalbelo Bašić (2012). Towards a Constraint Grammar Based Morphological Tagger for Croatian. Lecture notes in Artificial Intelligence (Text, Speech and Dialogue, 15th International Conference, TSD 2012, Brno, Czech Republic, September 2012). 7499, 174-182.
  • Frane Šarić, Jan Šnajder, Bojana Dalbelo Bašić (2012). Optimizing Sentence Boundary Detection for Croatian. Lecture notes in Artificial Intelligence (Text, Speech and Dialogue, 15th International Conference, TSD 2012, Brno, Czech Republic, September 2012). 7499, 105-111. [data]
  • Goran Glavaš, Krešimir Fertalj, Jan Šnajder (2012). Syntax-based Requirements Analysis for Data-Driven Application Development. In Natural Language Processing and Information Systems, pp. 339-344. Springer Berlin Heidelberg.
  • Goran Glavaš, Jan Šnajder, Bojana Dalbelo Bašić (2012). Experiments on Hybrid Corpus-Based Sentiment Lexicon Acquisition. 13th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2012, 1-9. [paper] [data]
  • Mladen Karan, Jan Šnajder, Bojana Dalbelo Bašić Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian. Proceedings of Eighth International Conference on Language Resources and Evaluation, LREC 2012, 657-662. [paper]

2011

  • Domijan, Dražen; Dalbelo Bašić, Bojana Pattern recognition // International Encyclopedia of Statistical Science / Lovric, Miodrag (ur.).
    Berlin Heidelberg: Springer Verlag, 2011. str. 1056-1058
  • Dalbelo Bašić, Bojana Distance mesures // International Encyclopedia of Statistical Science / Lovric, Miodrag (ur.).
    Berlin Heidelberg: Springer Verlag, 2011. str. 397-398
  • Tomislav Lombarović, Jan Šnajder, Bojana Dalbelo Bašić (2011). Question Classification for a Croatian QA System. (2011) Lecture Notes in Artificial Intelligence (Third Int. Workshop on Balto-Slavonic Natural Language Processing), 6836, 403–410. [slides]
  • Vedrana Janković, Jan Šnajder, Bojana Dalbelo Bašić (2011). Random Indexing Distributional Semantic Models for Croatian Language. Lecture Notes in Artificial Intelligence (Third Int. Workshop on Balto-Slavonic Natural Language Processing), 6836, 411–418. [data]
  • Josip Saratlija, Jan Šnajder, Bojana Dalbelo Bašić (2011). Unsupervised Topic-Oriented Keyphrase Extraction and its Application to Croatian. Lecture Notes in Artificial Intelligence (14 th International Conference on Text, Speech and Dialogue), 6836, 340–347.

2010

  • Jan Šnajder and Bojana Dalbelo Bašić (2010). A Computational Model of Croatian Derivational Morphology. Proceedings of the Seventh International Conference on Formal Approaches to South Slavic and Balkan Languages, Zagreb, Croatian Language Technologies Society, 109–117. [paper]
  • Jure Mijić, Jan Šnajder, Bojana Dalbelo Bašić (2010). Robust Keyphrase Extraction for a Large-Scale Croatian News Production System. Proceedings of the Seventh International Conference on Formal Approaches to South Slavic and Balkan Languages, Zagreb, Croatian Language Technologies Society, 59–66. [paper]
  • Mladen Mikša, Jan Šnajder, Bojana Dalbelo Bašić (2010). Correcting Word Merge Errors in Croatian Texts. Proceedings of Seventh International Conference on Formal Approaches to South Slavic and Balkan Languages, Zagreb, Croatian Language Technologies Society, 67-75. [paper]
  • Mladen Marović, Mladen Mikša, Jan Šnajder, Bojana Dalbelo Bašić (2010). Croatian OCR Error Correction Using Character Confusions and Language Modelling. Proceedings of the 21st Central European Conference on Information and Intelligent Systems, 281–288. [paper]
  • Sanja Seljan, Marko Tadić, Željko Agić, Jan Šnajder, Bojana Dalbelo Bašić, Vjekoslav Osmann (2010). Corpus Aligner (CorAl) Evaluation on English- Croatian Parallel Corpora. Proceedings of the Seventh International Conference on Language Resources and Evaluation, Valletta: European Language Resources Association, 3481-3484. [paper]
  • Saša Petrović, Jan Šnajder, Bojana Dalbelo Bašić (2010). Extending Lexical Association Measures for Collocation Extraction. Computer Speech and Language, 24 (2), 383–394.

2009

  • Delač, D., Krleža, Z., Dalbelo Bašić, B., Šnajder, J., Šarić, F. (2009). TermeX: A Tool for Collocation Extraction. Lecture Notes in Computer Science (Computational Linguistics and Intelligent Text Processing), 5449, 149–157.
  • Šnajder, J., Dalbelo Bašić, B. (2009). String Distance-Based Stemming of the Highly Inflected Croatian Language. Proceedings of Recent Advances in Natural Language Processing (RANLP-2009), 411–415. [paper]
  • Šantić, N., Šnajder, J., Dalbelo Bašić, B. (2009). Automatic Diacritics Restoration in Croatian Texts. In The Future of Information Sciences, Digital Resources and Knowledge Sharing, 309–318. [paper]
  • Ahel, R., Dalbelo Bašić, B., Šnajder, J. (2009). Automatic Keyphrase Extraction from Croatian Newspaper Articles. Proceesings of The Future of Information Sciences, Digital Resources and Knowledge Sharing, 207-218. [paper]
  • Seljan, S; Dalbelo Bašić, B., Šnajder, J., Delač, D., Šamec-Gjurin, M., Crnec, D. (2009). Comparative Analysis of Automatic Term and Collocation Extraction. Proceedings of the 2nd international conference The future of information sciences (INFuture 2009). Digital resources and knowledge sharing, 219-228. [paper]
  • Čupić, M., Šnajder, J., Dalbelo Bašić, B. (2009). Post-test analysis of automatically generated multiple choice exams: a case study. Proceedings of ICL 2009, Vienna: International Association of Online Engineering (published electronically). [paper]
  • Šnajder, J., Dalbelo Bašić, B.; Tadić, M. (2009). Lexicon-Based Morphological Normalisation and its Aplication to Croatian Language. In Technologies for the Processing and Retrieval of Semi-Structured Documents: Experience from the CADIAL Project. Zagreb: Croatian Language Technologies Society, 23-80.

2008

  • Šnajder, J., Dalbelo Bašić, B. (2008). Higher-order Functional Representation of Croatian Inflectional Morphology. Proceedings of the Sixth International Conference on Formal Approaches to South Slavic and Balkan Languages, 121–130. [paper]
  • Šnajder, J., Dalbelo Bašić, B., Petrović, S., Sikirić, I. (2008). Evolving New Lexical Association Measures Using Genetic Programming. Proceedings of ACL-08: HLT, Short Papers, 181–184. [paper]
  • Mijić, J., Dalbelo Bašić, B., Šnajder, J. (2008). Building a Search Engine Model with Morphological Normalization Support. Proceedings of the ITI 2008 30th Int. Conf. on Information Technology Interfaces, Zagreb: SRCE, 619-624. [paper]
  • Šnajder, J., Čupić, M., Dalbelo Bašić, B., Petrović, S. (2008.) Enthusiast: An authoring tool for automatic generation of paper-and-pencil multiple-choice tests. In Proceedings of ICL 2008, Villach (published electronically).
  • Šnajder, J., Dalbelo Bašić, B., Tadić, M. (2008). Automatic Acquisition of Inflectional Lexica for Morphological Normalisation. Information Processing and Management, 44 (5), 1720–1731.
  • Malenica, M., Šmuc, T., Šnajder, J., Dalbelo Bašić, B. (2008). Language Morphology Offset: Text Classification on a Croatian-English Parallel Corpus. Information Processing and Management, 44 (1), 325–339.

2007 –

  • Petrović, S., Šnajder, J., Dalbelo Bašić, B., Kolar, M. (2006). Comparison of Collocation Extraction Measures for Document Indexing. Journal of Computing and Information Technology, 14 (4), 321–327. [paper]
  • Kolar, M., Vukmirović, I., Dalbelo Bašić, B., Šnajder, J. (2005). Computer-Aided Document Indexing System. Journal of Computing and Information Technology, 13 (4), 299–305. [paper]
  • Šilić, A., Šarić, F., Dalbelo Bašić, B., Šnajder, J. (2007). TMT: Object-Oriented Text Classification Library. Proceedings of the 29th International Conference on Information Technology Interfaces, Zagreb: SRCE, 559-566. [paper]
  • Šarić, F., Šnajder, J., Dalbelo Bašić, B., Eklić, H. (2005). Enhanced Thesaurus Terms Extraction for Document Indexing. Proceedings of the 27th International Conference on Information Technology Interfaces: ITI 2005, Zagreb, SRCE University Computing Centre, 227-232.
  • Ribarić, S., Šnajder, J. (2005). Mapping Petri Net-Based Temporal Knowledge Representation Scheme into CP-Net Model. Proceedins of 28th international convention MIPRO 2005, Rijeka, MIPRO, 134-139. [ps]
  • Čupić, M., Šnajder, J., Dalbelo Bašić, B. (2003). Educational Interactive Software as a Support to the Teaching of Artifical Neural Network Methodology Applied to a Classification Problem. In Proceedings of the 2nd International Conference on Multimedia and Information & Communication Technologies in Education (m-ICTE2003), Badajoz, 2003. 1975-1979.
  • Šnajder, J., Kovač, M., Dalbelo Bašić, B. (2001). Analiza vremenske redundancije i kompenzacija pokreta kod digitalnog videa korištenjem programskog sustava Mathematica. Prvi znanstveno-stručni skup Programski sustav Mathematica u znanosti, tehnologiji i obrazovanju: PrimMath 2001. Prirodoslovno-matematički fakultet, Matematički odjel, 2001. 267-285.