Graph Databases for Fast Queries in UD Treebanks
Paper i proceeding, 2025

We investigate if labeled property graphs, and graph databases, can be an useful and efficient way of encoding UD treebanks, to facilitate searching for complex syntactic phenomena. We give two alternative encodings of UD treebanks into the off-the-shelf graph database Neo4j, and show how to translate syntactic queries into the graph query language Cypher. Our evaluation shows that graph databases can improve query times by several orders of magnitude, compared to existing approaches.

Författare

Niklas Deworetzki

Göteborgs universitet

Chalmers, Data- och informationsteknik, Funktionell programmering

Peter Ljunglöf

Chalmers, Data- och informationsteknik, Funktionell programmering

Göteborgs universitet

Proceedings of the 23rd International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2025)

32-43
979-8-89176-291-6 (ISBN)

23rd International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2025)
Ljubljana, Slovenia,

Ämneskategorier (SSIF 2025)

Språkbehandling och datorlingvistik

Datavetenskap (datalogi)

Mer information

Senast uppdaterat

2025-11-12