A parallel WordNet for English, Swedish and Bulgarian
Paper in proceeding, 2020

We present the parallel creation of a WordNet resource for Swedish and Bulgarian which is tightly aligned with the Princeton WordNet. The alignment is not only on the synset level, but also on word level, by matching words with their closest translations in each language. We argue that the tighter alignment is essential in machine translation and natural language generation. About one-fifth of the lexical entries are also linked to the corresponding Wikipedia articles. In addition to the traditional semantic relations in WordNet, we also integrate morphological and morpho-syntactic information. The resource comes with a corpus where examples from Princeton WordNet are translated to Swedish and Bulgarian. The examples are aligned on word and phrase level. The new resource is open-source and in its development we used only existing open-source resources.

WordNet

Translation

Wikipedia

Morphology

Sense-annotated Corpora

Author

Krasimir Angelov

University of Gothenburg

Chalmers, Computer Science and Engineering (Chalmers), Functional Programming

LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings

3008-3015
9791095546344 (ISBN)

12th International Conference on Language Resources and Evaluation, LREC 2020
Marseille, France,

Subject Categories (SSIF 2025)

Natural Language Processing

Comparative Language Studies and Linguistics

Studies of Specific Languages

More information

Latest update

9/12/2025