Multilingual Text Generation for Abstract Wikipedia in Grammatical Framework: Prospects and Challenges
Paper i proceeding, 2023

Abstract Wikipedia is an initiative to produce Wikipedia articles from abstract knowledge representations with multilingual natural language generation (NLG) algorithms. Its goal is to make encyclopaedic content available with equal coverage in the languages of the world. This paper discusses the issues related to the project in terms of an experimental implementation in Grammatical Framework (GF). It shows how multilingual NLG can be organized into different abstraction levels that enable the sharing of code across languages and the division of labour between programmers and authors with different skill requirements. The plan is to start with a simple but functional multilingual NLG system and to proceed towards more and more sophisticated language and wider coverage of topics, also allowing a human in the loop to create content via a Controlled Natural Language (CNL).

Natural language generation

Grammatical framework

Text robots

Wikipedia

Wikidata

Abstract wikipedia

Controlled natural language

Författare

Aarne Ranta

Göteborgs universitet

Chalmers, Data- och informationsteknik, Computing Science

Studies in Computational Intelligence

1860-949X (ISSN) 1860-9503 (eISSN)

Vol. 1081 125-149
9783031217791 (ISBN)

Symposium on Logic and Algorithms in Computational Linguistics, LACompLing 2021
Virtual, Online, ,

Ämneskategorier (SSIF 2025)

Språkbehandling och datorlingvistik

Datavetenskap (datalogi)

DOI

10.1007/978-3-031-21780-7_6

Mer information

Senast uppdaterat

2025-11-26