SuperLU is a general purpose library for the direct solution of large, sparse, nonsymmetric systems of linear equations on high performance machines. The library is written in C and is callable from either C or Fortran. The library routines will perform an LU decomposition with partial pivoting and triangular system solves through forward and back substitution. The LU factorization routines can handle non-square matrices but the triangular solves are performed only for square matrices. The matrix columns may be preordered (before factorization) either through library or user supplied routines. This preordering for sparsity is completely separate from the factorization. Working precision iterative refinement subroutines are provided for improved backward stability. Routines are also provided to equilibrate the system, estimate the condition number, calculate the relative backward error, and estimate error bounds for the refined solutions.
SuperLU package comes in three different flavors:
Referencing SuperLU in a publication.
The User Mailing List is used to announce changes, new releases, etc.
Please send me email if you have used any versions of the library. Also use this mailing address to report problems.
This is my survey article about sparse direct solvers of various flavours.
This project has been funded by DOE, NSF and DARPA.
Developers:
     Sherry Li
    
Jim Demmel
    
John Gilbert
    
Laura Grigori
Other Contributors:
     Pietro Cicotti, UCSD
     Daniel Schreiber
     Jinqchong Teo
     Yu Wang
     Eric Zhang, Albany High
SuperLU Version 3.1
SuperLU has achieved up to 40% of the theoretical floating-point rate
on a number of workstations, such as MIPS R8000 and IBM RS/6000.
The megaflop rate usually increases with increasing ratio of
floating-point operations count over the number of nonzeros in the
L and U factors.
SuperLU_MT Version 2.0
SuperLU_MT demonstrated 5--10 fold speedups on a range of commercially
popular SMPs, and up to 2.5 Gigaflops factorization rate.
(Last update: 01/15/08)
SuperLU_DIST Version 2.2
MPI is used for interprocess communication.
GESP stands for Gaussian Elimination with "Static Pivoting". Static pivoting
is a technique that combines the numerical stability
of partial pivoting with the scalability of no pivoting,
to run accurately and efficiently on large numbers of processors.
SuperLU_DIST demonstrated up to 100 fold speedup on the 512-PE Cray T3E
at NERSC, and 10.2 Gigaflops factorization rate, using MPI.
The scientific result was reported earlier in the cover article
of Science, Dec 24, 1999.