Abstract syntax as interlingua: Scaling up the grammatical framework from controlled languages to robust pipelines
Journal article, 2020

Syntax is an interlingual representation used in compilers. Grammatical Framework (GF) applies the abstract syntax idea to natural languages. The development of GF started in 1998, first as a tool for controlled language implementations, where it has gained an established position in both academic and commercial projects. GF provides grammar resources for over 40 languages, enabling accurate generation and translation, as well as grammar engineering tools and components for mobile and Web applications. On the research side, the focus in the last ten years has been on scaling up GF to wide-coverage language processing. The concept of abstract syntax offers a unified view on many other approaches: Universal Dependencies, WordNets, FrameNets, Construction Grammars, and Abstract Meaning Representations. This makes it possible for GF to utilize data from the other approaches and to build robust pipelines. In return, GF can contribute to data-driven approaches by methods to transfer resources from one language to others, to augment data by rule-based generation, to check the consistency of hand-annotated corpora, and to pipe analyses into high-precision semantic back ends. This article gives an overview of the use of abstract syntax as interlingua through both established and emerging NLP applications involving GF.

Author

Aarne Ranta

University of Gothenburg

Krasimir Angelov

University of Gothenburg

N. Gruzitis

University of Latvia

Prasanth Kolachina

University of Gothenburg

Computational Linguistics

0891-2017 (ISSN) 1530-9312 (eISSN)

Vol. 46 2 425-486

Subject Categories

Language Technology (Computational Linguistics)

General Language Studies and Linguistics

Computer Systems

DOI

10.1162/COLI_a_00378

More information

Latest update

9/16/2021