|
General
The Bulgarian wordnet database has 21 444 synonym sets, distributed into four parts of speech - nouns, verbs, adjectives and adverbs. Each synonym set is supplied with an explanatory definition which represents the common referential meaning of all its members. The synonym sets are linked to each other by means of a number of semantic, morpho-semantic and extralinguistic relations that hold between the words in a language. The Bulgarian database has been integrated into the network of the Balkan languages BalkaNet and the previously developed network of the European languages EuroWordNet through unique interlingual indexes (ILIs) marking unambiguously the counterparts in the different languages.
The main goals of the project are:
The further expansion of the Bulgarian WordNet - an electronic system for representation of the semantic relations in the language with new 15 000 synsets;
The elaboration of word-sense annotated training corpus of Bulgarian
Design and implementation of a computer system for automatic word sense disambiguation (WSD)
Dissemination of the project's results
|