Space/time trade-offs in hash coding with allowable errors

Article Properties
Abstract
Cite
Bloom, Burton H. “Space/Time/Trade-Offs/in/Hash/Coding/With/Allowable/Errors”. Communications of the ACM, vol. 13, no. 7, 1970, pp. 422-6, https://doi.org/10.1145/362686.362692.
Bloom, B. H. (1970). Space/time trade-offs in hash coding with allowable errors. Communications of the ACM, 13(7), 422-426. https://doi.org/10.1145/362686.362692
Bloom BH. Space/time trade-offs in hash coding with allowable errors. Communications of the ACM. 1970;13(7):422-6.
Journal Categories
Science
Mathematics
Instruments and machines
Electronic computers
Computer science
Science
Mathematics
Instruments and machines
Electronic computers
Computer science
Computer software
Technology
Electrical engineering
Electronics
Nuclear engineering
Electronics
Computer engineering
Computer hardware
Description

Can sacrificing some accuracy improve hash coding efficiency? This paper explores the space/time trade-offs in hash coding, specifically addressing the problem of testing messages for membership in a given set. It introduces two new hash-coding methods and compares them with a conventional method, focusing on hash area size (space), reject time, and allowable error frequency. The new methods aim to reduce the space needed for hash-coded information by tolerating a small fraction of errors. This approach is particularly relevant in applications with large datasets where a core-resident hash area isn't feasible using traditional methods. Performance can be improved by using a smaller core-resident hash area combined with a secondary test to catch errors. Analysis reveals that allowing a small number of test messages to be falsely identified as members of the given set permits a much smaller hash area to be used without increasing reject time. This trade-off can significantly improve overall performance in space-constrained applications, opening avenues for more efficient data management.

Published in Communications of the ACM, this paper aligns with the journal's focus on innovative approaches to computer science problems. By exploring space/time trade-offs in hash coding and introducing new methods to improve efficiency, the study contributes to the advancement of data management techniques relevant to the journal's readership.

Refrences
Citations
Citations Analysis
The first research to cite this article was titled Adaptive correction of program statements and was published in 1973. The most recent citation comes from a 2024 study titled Adaptive correction of program statements . This article reached its peak citation in 2022 , with 163 citations.It has been cited in 483 different journals, 16% of which are open access. Among related journals, the IEEE/ACM Transactions on Networking cited this research the most, with 71 citations. The chart below illustrates the annual citation trends for this article.
Citations used this article by year