Towards an Algebraic Approach for Corpus Queries
Övrigt konferensbidrag, 2024

Analysis of text corpora involves the use of specialised corpus search tools, capable of handling huge amounts of annotated text. The extent to which these tools apply optimisations to reduce query execution times is as diverse as the tools themselves. We argue that the development of a corpus algebra, similar to relational algebra in relational database systems, is a valuable foundation to improve corpus query optimisation. We demonstrate a query optimisation approach based on algebraic transformations, which vastly reduces query execution times.

Författare

Niklas Deworetzki

Chalmers, Data- och informationsteknik, Funktionell programmering

Göteborgs universitet

Peter Ljunglöf

Chalmers, Data- och informationsteknik, Funktionell programmering

Göteborgs universitet

Nicholas Smallbone

Chalmers, Data- och informationsteknik, Funktionell programmering

Göteborgs universitet

Swedish Language Technology Conference
, ,

Ämneskategorier (SSIF 2025)

Språkbehandling och datorlingvistik

Datavetenskap (datalogi)

Mer information

Senast uppdaterat

2025-07-02