An optimization problem related to Bloom filters with bit patterns
Paper in proceeding, 2018

Bloom filters are hash-based data structures for membership queries without false negatives widely used across many application domains.They also have become a central data structure in bioinformatics. In genomics applications and DNA sequencing the number of items and number of queries are frequently measured in the hundreds of billions. Consequently, issues of cache behavior and hash function overhead become a pressing issue. Blocked Bloom filters with bit patterns offer a variant that can better cope with cache misses and reduce the amount of hashing. In this work we state an optimization problem concerning the minimum false positive rate for given numbers of memory bits, stored elements, and patterns. The aim is to initiate the study of pattern designs best suited for the use in Bloom filters. We provide partial results about the structure of optimal solutions and a link to two-stage group testing.

Bloom filter

group testing

genomics

antichain

almost disjunct matrix

Author

Peter Damaschke

Chalmers, Computer Science and Engineering (Chalmers), Data Science

Alexander Schliep

University of Gothenburg

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

03029743 (ISSN) 16113349 (eISSN)

Vol. 10706 LNCS 525-538
978-3-319-73116-2 (ISBN)

44th International Conference on Current Trends in Theory and Practice of Computer Science SOFSEM 2018
Krems an der Donau, Austria,

Areas of Advance

Information and Communication Technology

Roots

Basic sciences

Subject Categories

Computer Science

DOI

10.1007/978-3-319-73117-9_37

More information

Latest update

10/7/2024