Space/time trade-offs in hash coding with allowable errors

Article Properties

Language

English
DOI (url)

10.1145/362686.362692
Publication Date

1970/07/01
Journal

Communications of the ACM
Indian UGC (Journal)
Refrences

3
Citations

2,025
Burton H. Bloom Computer Usage Company, Newton Upper Falls, MA

Abstract

Cite

Bloom, Burton H. “Space/Time/Trade-Offs/in/Hash/Coding/With/Allowable/Errors”. Communications of the ACM, vol. 13, no. 7, 1970, pp. 422-6, https://doi.org/10.1145/362686.362692.

Bloom, B. H. (1970). Space/time trade-offs in hash coding with allowable errors. Communications of the ACM, 13(7), 422-426. https://doi.org/10.1145/362686.362692

Bloom BH. Space/time trade-offs in hash coding with allowable errors. Communications of the ACM. 1970;13(7):422-6.

Journal Categories

Science

Mathematics

Instruments and machines

Electronic computers

Computer science

Science

Mathematics

Instruments and machines

Electronic computers

Computer science

Computer software

Technology

Electrical engineering

Electronics

Nuclear engineering

Electronics

Computer engineering

Computer hardware

Description

Can sacrificing some accuracy improve hash coding efficiency? This paper explores the space/time trade-offs in hash coding, specifically addressing the problem of testing messages for membership in a given set. It introduces two new hash-coding methods and compares them with a conventional method, focusing on hash area size (space), reject time, and allowable error frequency. The new methods aim to reduce the space needed for hash-coded information by tolerating a small fraction of errors. This approach is particularly relevant in applications with large datasets where a core-resident hash area isn't feasible using traditional methods. Performance can be improved by using a smaller core-resident hash area combined with a secondary test to catch errors. Analysis reveals that allowing a small number of test messages to be falsely identified as members of the given set permits a much smaller hash area to be used without increasing reject time. This trade-off can significantly improve overall performance in space-constrained applications, opening avenues for more efficient data management.

Published in Communications of the ACM, this paper aligns with the journal's focus on innovative approaches to computer science problems. By exploring space/time trade-offs in hash coding and introducing new methods to improve efficiency, the study contributes to the advancement of data management techniques relevant to the journal's readership.

Category	Category Repetition
Science: Mathematics: Instruments and machines: Electronic computers. Computer science	1,110
Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware	667
Science: Science (General): Cybernetics: Information theory	651
Technology: Electrical engineering. Electronics. Nuclear engineering: Telecommunication	546
Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software	527

Space/time trade-offs in hash coding with allowable errors

Article Properties

Abstract

Cite

Journal Categories

You May Also Like

Description

Refrences

Citations

Citations Analysis

Citations used this article by year

Database	Last update
UGC	December 2024
DOAJ	December 2024
Crossref	May 2024