A parallel WordNet for English, Swedish and Bulgarian
Paper i proceeding, 2020

We present the parallel creation of a WordNet resource for Swedish and Bulgarian which is tightly aligned with the Princeton WordNet. The alignment is not only on the synset level, but also on word level, by matching words with their closest translations in each language. We argue that the tighter alignment is essential in machine translation and natural language generation. About one-fifth of the lexical entries are also linked to the corresponding Wikipedia articles. In addition to the traditional semantic relations in WordNet, we also integrate morphological and morpho-syntactic information. The resource comes with a corpus where examples from Princeton WordNet are translated to Swedish and Bulgarian. The examples are aligned on word and phrase level. The new resource is open-source and in its development we used only existing open-source resources.

WordNet

Translation

Wikipedia

Morphology

Sense-annotated Corpora

Författare

Krasimir Angelov

Göteborgs universitet

Chalmers, Data- och informationsteknik, Funktionell programmering

LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings

3008-3015
9791095546344 (ISBN)

12th International Conference on Language Resources and Evaluation, LREC 2020
Marseille, France,

Ämneskategorier (SSIF 2025)

Språkbehandling och datorlingvistik

Jämförande språkvetenskap och allmän lingvistik

Studier av enskilda språk

Mer information

Senast uppdaterat

2025-09-12