Graph Databases for Fast Queries in UD Treebanks
Paper in proceeding, 2025

We investigate if labeled property graphs, and graph databases, can be an useful and efficient way of encoding UD treebanks, to facilitate searching for complex syntactic phenomena. We give two alternative encodings of UD treebanks into the off-the-shelf graph database Neo4j, and show how to translate syntactic queries into the graph query language Cypher. Our evaluation shows that graph databases can improve query times by several orders of magnitude, compared to existing approaches.

Author

Niklas Deworetzki

University of Gothenburg

Chalmers, Computer Science and Engineering (Chalmers), Functional Programming

Peter Ljunglöf

Chalmers, Computer Science and Engineering (Chalmers), Functional Programming

University of Gothenburg

Proceedings of the 23rd International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2025)

32-43
979-8-89176-291-6 (ISBN)

23rd International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2025)
Ljubljana, Slovenia,

Subject Categories (SSIF 2025)

Natural Language Processing

Computer Sciences

More information

Latest update

11/12/2025