Programming Techniques: Regular expression search algorithm

Article Properties
Abstract
Cite
Thompson, Ken. “Programming Techniques: Regular Expression Search Algorithm”. Communications of the ACM, vol. 11, no. 6, 1968, pp. 419-22, https://doi.org/10.1145/363347.363387.
Thompson, K. (1968). Programming Techniques: Regular expression search algorithm. Communications of the ACM, 11(6), 419-422. https://doi.org/10.1145/363347.363387
Thompson K. Programming Techniques: Regular expression search algorithm. Communications of the ACM. 1968;11(6):419-22.
Journal Categories
Science
Mathematics
Instruments and machines
Electronic computers
Computer science
Science
Mathematics
Instruments and machines
Electronic computers
Computer science
Computer software
Technology
Electrical engineering
Electronics
Nuclear engineering
Electronics
Computer engineering
Computer hardware
Description

How can computers efficiently locate specific patterns within vast amounts of text? This classic paper describes a method for locating character strings embedded in text using regular expressions and discusses an implementation of this method in the form of a compiler. It tackles the fundamental challenge of pattern matching in computer science. The compiler accepts a regular expression as source language and produces an IBM 7094 program as object language. The resulting object program then accepts the text to be searched as input and generates a signal every time an embedded string in the text matches the regular expression. The paper presents examples, problems, and solutions, making it a foundational resource for understanding regular expression search algorithms. This work has had a lasting impact on computer science, influencing the development of numerous tools and techniques for text processing and data analysis.

This article, published in Communications of the ACM, directly aligns with the journal's mission to present innovative techniques and algorithms for computer programming. The description of a regular expression search algorithm and its implementation as a compiler is relevant to the journal's audience of computer scientists and software developers.

Refrences
Citations
Citations Analysis
The first research to cite this article was titled SCRIPT, An On-Line Manuscript Processing System and was published in 1968. The most recent citation comes from a 2024 study titled SCRIPT, An On-Line Manuscript Processing System . This article reached its peak citation in 2021 , with 29 citations.It has been cited in 162 different journals, 14% of which are open access. Among related journals, the Theoretical Computer Science cited this research the most, with 22 citations. The chart below illustrates the annual citation trends for this article.
Citations used this article by year