|
ABSTRACT
A two-stage blocked algorithm for reduction of a regular matrix pair (A , B ) to upper Hessenberg-triangular form is presented. In stage 1 (A, B is reduced to block upper Hessenberg-triangular form using mainly level 3 (matrix-matrix) operations that permit data reuse in the higher levels of a memory hierarchy. In the second stage all but one of the r subdiagonals of the block Hessenberg A-part are set to zero using Givens rotations. The algorithm proceeds in a sequence of supersweeps, each reducing m columns. The updates with respect to row and column rotations are organized to reference consecutive columns of A and B. To further improve the data locality, all rotations produced in a supersweep are stored to enable a left-looking reference pattern, i.e., all updates are delayed until they are required for the continuation of the supersweep. Moreover, we present a blocked variant of the single-diagonal double-shift QZ method for computing the generalized Schur form of (A, B in upper Hessenberg-triangular form. The blocking for improved data locality is done similarly, now by restructuring the reference pattern of the updates associated with the bulge chasing in the QZ iteration. Timing results show that our new blocked variants outperform the current LAPACK routines, including drivers for the generalized eigenvalue problem, by a factor 2-5 for sufficiently large problems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
E. Anderson , Z. Bai , C. Bischof , J. Demmel , J. Dongarra , J. Du Croz , A. Greenbaum , S. Hammarling , A. McKenney , S. Ostrouchov , D. Sorensen, LAPACK's user's guide, Society for Industrial and Applied Mathematics, Philadelphia, PA, 1992
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
| |
6
|
BISCHOF, C. H., LANG, B., AND SUN, X. 1996. A framework for symmetric band reduction. Tech. Rep., Preprint BUGHW-SC-96/3. Fachbereich Mathematik, Bergische Universit t, Wuppertal, Germany.
|
| |
7
|
Jack J. Dongarra , L. S. Blackford , J. Choi , A. Cleary , E. D'Azeuedo , J. Demmel , I. Dhillon , S. Hammarling , G. Henry , A. Petitet , K. Stanley , D. Walker , R. C. Whaley, ScaLAPACK user's guide, Society for Industrial and Applied Mathematics, Philadelphia, PA, 1997
|
| |
8
|
DACKLAND, K. 1998. Parallel reduction of a regular matrix pair to block Hessenbergtriangular form--Algorithm design and performance modeling. Tech. Rep. UMINF-98.09. Department of Computing Science, Ume University, Ume , Sweden.
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
 |
12
|
|
| |
13
|
DUBRULLE, A.A. 1991. The multishift QR algorithm: Is it worth the trouble?. Tech. Rep. G320-3558+. Palo Alto Scientific Center, IBM Corp., Palo Alto, CA.
|
| |
14
|
DUBRULLE, A. A. AND GOLUB, G. H. 1994. A multishift QR algorithm without computation of the shifts. Num. Alg. 7, 173-181.
|
| |
15
|
ENRIGHT, W. AND SERBIN, S. 1978. A note on the efficient solution of matrix pencil systems. BIT 18, 276-281.
|
| |
16
|
|
| |
17
|
|
 |
18
|
|
 |
19
|
|
| |
20
|
|
| |
21
|
LANG, B. 1998a. Efficient eigenvalue and singular value computations on shared memory machines. Tech. Rep. BUGHW-SC 98/4. Fachbereich Mathematik, Bergische Universit t, Wuppertal, Germany.
|
| |
22
|
|
| |
23
|
MOLER, C. B. AND STEWART, G.W. 1973. An algorithm for generalized matrix eigenvalue problems. SIAM J. Numer. Anal. 10, 2, 241-256.
|
| |
24
|
|
| |
25
|
|
| |
26
|
|
| |
27
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
|