Bulgarian National Corpus « Секция по компютърна лингвистика

The Bulgarian National corpus is created at the Institute for Bulgarian Language „Prof. L. Andreychin” by research associates from the Department of Computational Linguistics and the Department of Bulgarian Lexicology and Lexicography. It incorporates several individual electronic corpora, developed in the period 2001-2009 for the purposes of the two departments. The corpus is constantly enlarged with new texts.

The Bulgarian National corpus consists of a monolingual (Bulgarian) part and 47 parallel corpora. The Bulgarian part includes about 1.2 billion words in over 240 000 text samples. The materials in the Corpus reflect the state of the Bulgarian language (mainly in its written form) from the middle of 20th century (1945) until present.

HTML Meta Tag

Bulgarian National Corpus

Bulgarian WordNet

Multilingual Image Corpus

Bulgarian National Corpus

Dictionary of Bulgarian Language, online implementation by DCL

META-SHARE – network of repositories of language data, tools and related web services

System for business intelligence, language resources provided by DCL.