Efficient corpus search using unary and binary indexes
Paper in proceeding, 2022

We investigate how disk-based inverted indexes can be used for efficient searching in large annotated corpora. We give a formal semantics for simple corpus queries, and show how they can be translated into lookups in unary and binary indexes.

Author

Peter Ljunglöf

Chalmers, Computer Science and Engineering (Chalmers), Functional Programming

University of Gothenburg

Nicholas Smallbone

University of Gothenburg

Chalmers, Computer Science and Engineering (Chalmers), Functional Programming

Mijo Thoresson

Student at Chalmers

Victor Salomonsson

Student at Chalmers

20th Conference on Natural Language Processing, KONVENS 2024 - Proceedings of the Conference

149-158

20th Conference on Natural Language Processing (KONVENS 2024)
Wien, Austria,

Subject Categories (SSIF 2025)

Natural Language Processing

More information

Latest update

6/27/2025