ACM Home Page
Please provide us with feedback. Feedback
Blocked algorithms and software for reduction of a regular matrix pair to generalized Schur form
Full text PdfPdf (391 KB)
Source ACM Transactions on Mathematical Software (TOMS) archive
Volume 25 ,  Issue 4  (December 1999) table of contents
Pages: 425 - 454  
Year of Publication: 1999
ISSN:0098-3500
Authors
Krister Dackland  Umeå Univ., Umeå Sweden
Bo Kågström  Umeå Univ., Umeå Sweden
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 49,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/332242.332244
What is a DOI?

ABSTRACT

A two-stage blocked algorithm for reduction of a regular matrix pair (A , B ) to upper Hessenberg-triangular form is presented. In stage 1 (A, B is reduced to block upper Hessenberg-triangular form using mainly level 3 (matrix-matrix) operations that permit data reuse in the higher levels of a memory hierarchy. In the second stage all but one of the r subdiagonals of the block Hessenberg A-part are set to zero using Givens rotations. The algorithm proceeds in a sequence of supersweeps, each reducing m columns. The updates with respect to row and column rotations are organized to reference consecutive columns of A and B. To further improve the data locality, all rotations produced in a supersweep are stored to enable a left-looking reference pattern, i.e., all updates are delayed until they are required for the continuation of the supersweep. Moreover, we present a blocked variant of the single-diagonal double-shift QZ method for computing the generalized Schur form of (A, B in upper Hessenberg-triangular form. The blocking for improved data locality is done similarly, now by restructuring the reference pattern of the updates associated with the bulge chasing in the QZ iteration. Timing results show that our new blocked variants outperform the current LAPACK routines, including drivers for the generalized eigenvalue problem, by a factor 2-5 for sufficiently large problems.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
 
5
 
6
BISCHOF, C. H., LANG, B., AND SUN, X. 1996. A framework for symmetric band reduction. Tech. Rep., Preprint BUGHW-SC-96/3. Fachbereich Mathematik, Bergische Universit t, Wuppertal, Germany.
 
7
 
8
DACKLAND, K. 1998. Parallel reduction of a regular matrix pair to block Hessenbergtriangular form--Algorithm design and performance modeling. Tech. Rep. UMINF-98.09. Department of Computing Science, Ume University, Ume , Sweden.
 
9
 
10
 
11
12
 
13
DUBRULLE, A.A. 1991. The multishift QR algorithm: Is it worth the trouble?. Tech. Rep. G320-3558+. Palo Alto Scientific Center, IBM Corp., Palo Alto, CA.
 
14
DUBRULLE, A. A. AND GOLUB, G. H. 1994. A multishift QR algorithm without computation of the shifts. Num. Alg. 7, 173-181.
 
15
ENRIGHT, W. AND SERBIN, S. 1978. A note on the efficient solution of matrix pencil systems. BIT 18, 276-281.
 
16
 
17
18
19
 
20
 
21
LANG, B. 1998a. Efficient eigenvalue and singular value computations on shared memory machines. Tech. Rep. BUGHW-SC 98/4. Fachbereich Mathematik, Bergische Universit t, Wuppertal, Germany.
 
22
 
23
MOLER, C. B. AND STEWART, G.W. 1973. An algorithm for generalized matrix eigenvalue problems. SIAM J. Numer. Anal. 10, 2, 241-256.
 
24
 
25
 
26
 
27


Collaborative Colleagues:
Krister Dackland: colleagues
Bo Kågström: colleagues

Peer to Peer - Readers of this Article have also read: