Abstract syntax as interlingua: Scaling up the grammatical framework from controlled languages to robust pipelines
Artikel i vetenskaplig tidskrift, 2020

Syntax is an interlingual representation used in compilers. Grammatical Framework (GF) applies the abstract syntax idea to natural languages. The development of GF started in 1998, first as a tool for controlled language implementations, where it has gained an established position in both academic and commercial projects. GF provides grammar resources for over 40 languages, enabling accurate generation and translation, as well as grammar engineering tools and components for mobile and Web applications. On the research side, the focus in the last ten years has been on scaling up GF to wide-coverage language processing. The concept of abstract syntax offers a unified view on many other approaches: Universal Dependencies, WordNets, FrameNets, Construction Grammars, and Abstract Meaning Representations. This makes it possible for GF to utilize data from the other approaches and to build robust pipelines. In return, GF can contribute to data-driven approaches by methods to transfer resources from one language to others, to augment data by rule-based generation, to check the consistency of hand-annotated corpora, and to pipe analyses into high-precision semantic back ends. This article gives an overview of the use of abstract syntax as interlingua through both established and emerging NLP applications involving GF.

Författare

Aarne Ranta

Göteborgs universitet

Krasimir Angelov

Göteborgs universitet

N. Gruzitis

Latvijas Universitate

Prasanth Kolachina

Göteborgs universitet

Computational Linguistics

0891-2017 (ISSN) 1530-9312 (eISSN)

Vol. 46 2 425-486

Ämneskategorier (SSIF 2011)

Språkteknologi (språkvetenskaplig databehandling)

Jämförande språkvetenskap och allmän lingvistik

Datorsystem

DOI

10.1162/COLI_a_00378

Mer information

Senast uppdaterat

2021-09-16