ATLAS
(Automatically Tuned Linear Algebra Subroutines) is a package that
automatically generates efficient BLAS libraries. Clint Whaley's papers
on ATLAS have some pretty good information about tuning.
FFTW (Fastest Fourier Transform
in the West) is an automatically-generated cache-oblivious
high-performance Fourier transform code.