Efficient householder QR factorization for superscalar processors

Article Properties

Language

English
DOI (url)

10.1145/275323.275326
Publication Date

1997/09/01
Journal

ACM Transactions on Mathematical Software
Indian UGC (Journal)
Refrences

14
James J. Carrig Johns Hopkins Univ., Baltimore, MD
Gerard G. L. Meyer Johns Hopkins Univ., Baltimore, MD

Abstract

Cite

Carrig, James J., and Gerard G. L. Meyer. “Efficient Householder QR Factorization for Superscalar Processors”. ACM Transactions on Mathematical Software, vol. 23, no. 3, 1997, pp. 362-78, https://doi.org/10.1145/275323.275326.

Carrig, J. J., & Meyer, G. G. L. (1997). Efficient householder QR factorization for superscalar processors. ACM Transactions on Mathematical Software, 23(3), 362-378. https://doi.org/10.1145/275323.275326

Carrig JJ, Meyer GGL. Efficient householder QR factorization for superscalar processors. ACM Transactions on Mathematical Software. 1997;23(3):362-78.

Journal Categories

Science

Mathematics

Instruments and machines

Electronic computers

Computer science

Science

Mathematics

Instruments and machines

Electronic computers

Computer science

Computer software

Technology

Electrical engineering

Electronics

Nuclear engineering

Electronics

Computer engineering

Computer hardware

Technology

Technology (General)

Industrial engineering

Management engineering

Applied mathematics

Quantitative methods

Description

Can algorithm design unlock the full potential of superscalar processors? This paper presents two parameterized Householder QR factorization algorithms tailored for the cache and register architectures common in superscalar processors. Algorithm designers must streamline memory references and allow for efficient data reuse throughout the memory hierarchy. Guidelines are developed for selecting parameter values that optimize cache and register utilization. The new algorithms are implemented and performance-tuned on diverse systems, including an Intel Pentium Pro, an IBM SP2 node, and a Silicon Graphics POWER Challenge XL processor. The results demonstrate the effectiveness of these algorithms in maximizing processor performance, offering valuable insights for numerical computation and linear algebra on modern computing platforms. By addressing the specific challenges of superscalar architectures, this research contributes to high-performance computing.

Published in ACM Transactions on Mathematical Software, this research directly aligns with the journal's focus on efficient and reliable mathematical algorithms. By presenting optimized algorithms for Householder QR factorization, the paper contributes to the advancement of numerical computation and linear algebra software.

Database	Last update
UGC	December 2024
DOAJ	December 2024
Crossref	May 2024