EN BG

Assist. prof. Dr Tsvetana Dimitrova



Senior assistant professor
Department of Computational Linguistics

Phone: 02 9792971

Email: cvetana@dcl.bas.bg

Address:
Institute for Bulgarian Language,
52 Shipchenski Prohod St., Sofia 1113
Room 602

Interests

  • Theoretical linguistics,
  • corpus linguistics,
  • syntax,
  • diachronic syntax,
  • history of Bulgarian language,
  • language change,
  • Old Church Slavonic,
  • computer dictionaries,
  • lexical-semantic networks,
  • linguistic annotation

Achievements

Award “Prof. Marin Drinov” for Young Scholar in the Humanities (2011)

Education and experience

2010 – ongoing: Assistant Professor, Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences

2008 – 2010: Researcher, Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences

2002 – 2008: Ph.D. in Linguistics, Norwegian University of Science and Technololgy (NTNU)

1996 – 2001: М.А. in Bulgarian Philology, Faculty of Slavic Studies, Sofia University

Research projects

Enriched Databases for Bulgarian and Romanian (2015 – 2017), bilateral research project, the Institute for Bulgarian Language of the Bulgarian Academy of Sciences, and the Research Institute for Artificial Intelligence of the Romaian Academy, member of the research team

Bulgarian National Corpus (2010 – ongoing), Institute for Bulgarian Language, Bulgarian Academy of Sciences, member of the research team

Bulgarian Wordnet (BulNet): Lexical-Semantic Network of Bulgarian Language (2010 – ongoing), Institute for Bulgarian Language, Bulgarian Academy of Sciences, member of the research team

Named Entity Recognition for Bulgarian and Czech (2014 – 2016), , bilateral research project, the Institute for Bulgarian Language of the Bulgarian Academy of Sciences, and the Institute for Czech Language of the Czech Academy of Sciences, head of the Bulgarian team

Computer-Аssisted Description of the Old Bulgarian Lexica for an e-Based Derivational Dictionary of Old Bulgarian (2012 – 2013), Sofia University, funded by the National Science Fund (ДМУ 0313/16.12.2011), head of the project

Parsing and multi-word expressions. Towards linguistic precision and computational efficiency in natural language processing (PARSEME) (2014 – 2017), Information and Communication Technologies COST Action IC1207, member of the research team

From Lexical-Semantic Networks to Knowledge Bases: Enrichment of the Bulgarian WordNet and the Romanian WordNet with morpho-semantic information (2012 – 2014), bilateral research project, the Institute for Bulgarian Language of the Bulgarian Academy of Sciences, and the Research Institute for

Artificial Intelligence of the Romaian Academy, member of the research team

CESAR: CEntral and South-east europeAn Resources (2011 – 2013), member of the research team

The Tenth-Century Cyrillic Manuscript Codex Suprasliensis: the creation of an electronic corpus. UNESCO project (2010–2011), project of the Institute for Literature of the Bulgarian Academy of Sciences, funded by UNESCO, member of the research team

ICT Tools for Diachronic Linguistic Studies (2009-2011), Sofia University, member of the research team

Pragmatic Resources in Old Indo-European Languages (PROIEL), University of Oslo (2008 – 2012), annotator

Bulgarian Sense-Annotated Corpus (2008-2010), Institute for Bulgarian Language, Bulgarian Academy of Sciences, member of the research team

Membership in scientific organizations and research groups

International Committee of Slavists, Commission on Computer Supported Processing of Mediaeval Slavonic Manuscripts and Early Printed Books, member (2010 – ongoing)

Membership in orgnization and programme committees

The Annual Meeting of the Association for Computational Linguistics 2013, Sofia, Bulgaria, August 4 – 9, 2013, member of the organizing committee

The First International Conference on Computational Linguistics in Bulgaria (2014), Sofia, Bulgaria, September 4, 2014, member of the organizing committee

Computing in Humanities Workshop, Sofia, Bulgaria, April 8 – 9, 2015, member of the organizing committee

Global Wordnet Conference 2016), Bucharest, Romania, January 27 – 30, 2016, member of the programme committee

The Second International Conference on Computational Linguistics in Bulgaria (2016), Sofia, Bulgarian, September 9, 2016, member of the organizing committee

Paisii 2016: Scientific Session “Current Trends in Linguistics”, dedicated to the 85-anniversary of Prof. Iordan Penchev, Plovdiv, Bulgaria, November 10 – 11, 2016, member of the organizing committee

Annual International Conference of the Institute for Bulgarian Language, Sofia, Bulgaria, May 15 – 16, 2017, member of the organizing committee

Monographs

Dimitrova, T. The Old Bulgarian Noun Phrase: Towards an Annotation Specification. Doktoravhandlinger ved NTNU: 2008:99. Trondheim: Norwegian University of Science and Technology, 2008, 270 p. ISBN: 978-824-717-989-5. [С корекции: Dimitrova, T. The Old Bulgarian Noun Phrase. Saarbruecken: VDM Verlag, 2011, 316 p. ISBN: 978-363-934-362-5.]

Chapters in edited volumes

Търпоманова, Ек., Цв. Димитрова. Анотиране на паралелни многоезикови корпуси: Българско-английският паралелел корпус със съотнесени (прости) изречения (BulEnAC). [Annotation of Parallel Multilingual Corpora: Bulgarian-English Sentence- and Clause-Aligned Corpus] – В: Езикови ресурси и технологии за български език. София: Академично издателство „Проф. Марин Дринов“, 2014, с. 105 – 126. ISBN: 978-954-322-797-6.

Димитрова, Цв. Лингвистични конвенции при анотация на затворените класове. [Linguistic Conventions in Annotation of Closed Parts-of-Speech] – В: Българският семантично анотиран корпус. София: Институт за български език „Проф. Любомир Андрейчин“, 2010, с. 141 – 156. ISBN: 978-954-779-124-4.

Articles in journals

Димитрова, Цв. Наблюдения върху местоименните клитики в историята на българския език. [Observations on the Pronominal Clitics in the History of Bulgarian Language] – Известия на Института за български език, XXIX, Издателство на БАН „Проф. Марин Дринов“, 2016, с. 90 – 106. ISSN: 0323-9934.

Стефанова, В., Цв. Димитрова. Прилагателното име в Българския уърднет. [Adjectives in the Bulgarian WordNet] – Български език, кн. 4, 2016, с. 1 – 15. ISSN: 0005-4283.

Коева, Св., Д. Благоева, С. Колковска, Цв. Димитрова, Ив. Стоянова, Св. Лесева. Българският национален корпус в контекста на съвременната лингвистика. [The Bulgarian National Corpus in the Context of Contemporary Linguistics] – Български език, кн. 3, 2015, с. 102 – 119. ISSN: 0005-4283.

Krapova, I., T. Dimitrova. The Genitive-Dative syncretism in the history of Bulgarian. Towards an analysis. – Studi Slavistici, XII, 2015, pp. 181 – 208. ISSN:1824-761X (print); 1824-7601 (online). SJR:0.127.

Коева, Св., Ив. Стоянова, Цв. Димитрова, Св. Лесева. Традиции и новаторство в корпусната лингвистика: Българският национален корпус. [Tradition and Innovation in Corpus Linguistics: the Bulgarian National Corpus] – Списание на Българската академия на науките, кн. 6, 2012, с. 34 – 39. ISSN: 0007-3989.

Dimitrova, T. Computer-Аssisted Description of the Old Bulgarian Lexica Computer-Аssisted Description of the Old Bulgarian Lexica for an e-Based Derivational Dictionary of Old Bulgarian. – Littera et Lingua, Autumn 2012, p. 1 – 10. ISSN: 1312-6172.

Koeva, S., I. Stoyanova, S. Leseva, T. Dimitrova, R. Dekova, E. Tarpomanova. The Bulgarian National Corpus: Theory and Practice in Corpus Design. – Journal of Language Modelling, 1, 2012, pp. 65 – 110. ISSN: 2299-8470, DOI://dx.doi.org/10.15398/jlm.v0i1.33.

Димитрова, Цв. Диахронните корпуси: Подготвителна фаза. [Diachronic Corpora: Preliminary Stage] – Български език, кн. 3, 2011, с. 119 – 130. ISSN:0005-4283.

Димитрова, Цв. Проблеми на лингвистичната анотация в диахронните корпуси. [Issues in the Linguistic Annotation of Diachronic Corpora] – Български език, кн. 1, 2010, с. 23 – 36. ISSN: 0005-4283.

Димитрова, Цв. Двуличното като.[Double-Faced kato] – Български език, кн. 3, 2009, с. 149 – 151. ISSN:0005-4283.

Bojadžiev, А., T. Dimitrova. Linguistic Information in the Electronic Corpus of Old Slavic Texts. – Scripta & e-Scripta, 6, 2008, pp. 105 – 151. ISSN: 1312-238X.

Conference papers

Dimitrova, T., V. Stefanova. Adjectives in WordNet: Semantic Issues. – In: Proceedings of the 12th International Conference Linguistics Resources and Tools for Processing the Romanian Language (ConsILR-2016). Iași: Faculty for Computer Science, Alexandru Ioan Cuza University, 2016, pp. 131 – 141. ISSN:1843-911X.

Koeva, S., I. Stoyanova, M. Todorova, S. Leseva, T. Dimitrova. Metadata Extraction, Representation and Management within the Bulgarian National Corpus. – In: 4th Workshop on Challenges in the Management of Large Corpora Workshop Programme. ELDA, 2016, pp. 33 – 39.

Babru Mititelu, V., B. Rizov, E. Tarpomanova, S. Leseva, T. Dimitrova. Noun-Verb Derivation in the Bulgarian, Romanian and English Wordnets – a Comparative Approach. – In: Proceedings of the 11th International Conference Linguistics Resources and Tools for Processing the Romanian Language (ConsILR-2015). Iași: Faculty for Computer Science, Alexandru Ioan Cuza University, 2015, pp. 53 – 64. ISSN: 1843-911X.

Dimitrova, T., A. Bojadziev. Historical Corpora of Bulgarian Language and Second Position Markers. – In: Proceedings of the First International Conference Computational Linguistics in Bulgaria. Sofia: Institute for Bulgarian Language, 2014, pp. 55 – 63. ISSN: 2367-5578.

Dimitrova, T., E. Tarpomanova, B. Rizov. Coping with Derivation in the Bulgarian Wordnet.. – In: Proceedings of the Seventh Global WordNet Conference. Tartu: University of Tartu Press, 2014, pp. 109 – 117. ISBN: 978-994-932-492-7.

Koeva, S., B. Rizov, E. Tarpomanova, T. Dimitrova, R. Dekova, I. Stoyanova, S. Leseva, H. Kukova, A. Genov. Bulgarian-English Sentence- and Clause-Aligned Corpus. – In: Proceedings of the Second Workshop on Annotation of Corpora for Research in the Humanities (ACRH-2), Lisbon: Edicoes Colibri, 2012, ISBN:978-989-689-273-9.

Eckhoff, H. M., D. J. Birnbaum, A. Miltenova, T. Dimitrova. The Tenth- Century Cyrillic Manuscript Codex Suprasliensis: the creation of an electronic corpus UNESCO project (2010–2011). – In: Proceedings of the Workshop on Language Technologies for Digital Humanities and Cultural Heritage associated with The 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 2011, pp. 57 – 61. ISBN:978-954-452-019-9.

Dimitrova-Vulchanova, M., V. Vulchanov, T. Dimitrova. Issues of Pos-annotation of Old Bulgarian Texts. – In: Computer Applications in Slavic Studies. Proceedings of Azbuky.Net, International Conference and Workshop, 24-27 October 2005, Sofia, Bulgaria. Sofia: Boyan Penev Publishing Center, 2006, pp. 245 – 262. ISBN: 978-954-871-241-5.

Koeva, S., S. Leseva, B. Rizov, E. Tarpomanova, T. Dimitrova, H. Kukova, M. Todorova. Design and development of the Bulgarian Sense-Annotated Corpus. – In: Proceedings of the Third International Corpus Linguistics Conference (CILC), 7-9 April 2011, Valencia, Spain. Valencia: Universitat Politecnica de Valencia, 2011, pp. 143 – 150. ISBN: 978-846-946-225-6.

Koeva, S., S. Leseva, E. Tarpomanova, B. Rizov, T. Dimitrova, H. Kukova. Bulgarian Sense-annotated Corpus – Results and Achievements. – In: Proceedings of the 7th International Conference of Formal Approaches to South Slavic and Balkan Languages (FASSBL-7), 4-6 October 2010, Dubrovnik, Croatia 2010, pp. 41 – 48. ISBN: 978-953-553-752-6.

Koeva, S., S. Leseva, I. Stoyanova, R. Dekova, A. Genov, B. Rizov, T. Dimitrova, E. Tarpomanova, H. Kukova. Application of Clause Alignment for Statistical Machine Translation. – In: Proceedings of SSST-6: Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL 2012 / SIGMT / SIGLEX Workshop, Jeju, Korea. Association of Computational Linguistics, 2012, pp. 102 – 111. ISBN: 978-193-728-438-1.

Koeva, S., S. Leseva, I. Stoyanova, T. Dimitrova, M. Todorova. Automatic Prediction of Morphosemantic Relations. – In: Proceedings of the Eighth Global Wordnet Conference. Bucharest: Research Institute for Artificial Intelligence, Romanian Academy, 2016, pp. 168 – 176. ISBN: 978-973-020-728-6.

Koeva, S., T. Dimitrova. Rule-based Person Named Entity Recognition for Bulgarian. – In: Slavic Languages in the Perspective of Formal Grammar (Proceedings of FDSL 10.5, Brno 2014), Series Linguistik International, vol. 37, Peter Lang, 2015, pp. 121 – 139. ISBN: 978-363-166-251-9.

Leseva, S., M. Todorova, T. Dimitrova, B. Rizov, I. Stoyanova, S. Koeva. Automatic Classification of Wordnet Morphosemantic Relations. – In: Proceedings of the 5th Workshop on Balto-Slavic Natural Language Processing, The International Conference Recent Advances in Natural Language Processing (RANLP) 2015, 2015, pp. 59 – 64. ISBN: 978-954-452-033-5.

Rizov, B., T. Dimitrova, V. Barbu Mititelu. Hydra for Web: A Multilingual Wordnet Viewer. – In: Proceedings of the 11th International Conference Linguistics Resources and Tools for Processing the Romanian Language (ConsILR-2015). Iași: Faculty for Computer Science, Alexandru Ioan Cuza University, 2015, pp. 19 – 30. ISSN:1843-911X.

Rizov, B., T. Dimitrova. Hydra for Web: A Browser for Easy Access to Wordnets. – In: Proceedings of the Eighth Global Wordnet Conference. Bucharest: Research Institute for Artificial Intelligence, Romanian Academy, 2016, pp. 339 – 343. ISBN: 978-973-020-728-6.

Tarpomanova, E., S. Leseva, М. Todorova, T. Dimitrova, B. Rizov, V. Barbu Mititelu, E. Irimia. Noun-Verb Derivation in the Bulgarian and the Romanian WordNet – A Comparative Approach. – In: Proceedings of the First International Conference Computational Linguistics in Bulgaria. Sofia: Institute for Bulgarian Language, 2014, pp. 23 – 31. ISSN: 2367-5578.

Димитрова, Цв., А. Бояджиев. Сегментация на диахронните корпуси. [Segmentation of Diachronic Corpora] – В: Сборник доклади от заключителната конференция на проект „Компютърни и интерактивни средства за исторически езиковедски изследвания“. София: ГРАФИС – Ал. Жеков, 2011, с. 96 – 106. ISBN: 978-954-914-773-5.

Copyright © 2015 Department of computational linguistics. All rights reserved.