<head> <title>Clint Whaley's Papers and Publications</title> <head> <center> <h1>Papers and Publications for R. Clint Whaley</h1> </center> <hr> <h1>Journal Publications</h1> <ol> <li> <A HREF="http://www.cs.utsa.edu/~whaley/papers/SCE001156.pdf"> "Reducing Floating Point Error in Dot Product using the Superblock Family of Algorithms"</A>, by Anthony M. Castaldo, R. Clint Whaley and Anthony T. Chronopoulos. <i>SIAM Journal on Scientific Computing (SISC)</i>, Volume 31, Number 2, pp 1156-1174, 2008. <p> <li> <A HREF="http://www.cs.utsa.edu/~whaley/papers/timing_SPE08.pdf"> "Achieving accurate and context-sensitive timing for code optimization"</A>, by R. Clint Whaley and Anthony M. Castaldo. <i>Software: Practice & Experience</i>, Volume 38, Number 15, pp 1621-1642, April, 2008. <p> <li> <a HREF="http://www.cs.utsa.edu/~whaley/papers/spercw04.ps"> "Minimizing Development and Maintenance Costs in Supporting Persistently Optimized BLAS"</a>, by R. Clint Whaley and Antoine Petitet. <i>Software: Practice & Experience</i>, Volume 35, Number 2, pp 101-121, February, 2005. <p> <li> "Self-Adapting Linear Algebra Algorithms and Software", by J. Demmel, J. Dongarra, V. Eijkhout, E. Fuentes, A. Petitet, R. Vuduc, R. C. Whaley and K. Yelick. <i>Proceedings of the IEEE</i>, Volume 93, Number 2, pp 293-312, February, 2005. <p> <li> <a HREF="http://www.cs.utsa.edu/~whaley/papers/blast-toms.pdf"> "An Updated Set of Basic Linear Algebra Subprograms (BLAS)"</a>, by L. Susan Blackford, James Demmel, Jack Dongarra, Iain Duff, Sven Hammarling, Greg Henry, Micheal Heroux, Linda Kaufman, Andrew Lumsdain, Antoine Petitet, Roldan Pozo, Karin Remington, and R. Clint Whaley. <i>ACM Transactions on Mathematical Software</i>, 28(2):135-151, June 2002. <p> <li> <a HREF="http://www.cs.utsa.edu/~whaley/papers/lawn147.ps"> "Automated Empirical Optimization of Software and the ATLAS project"</a> by R. Clint Whaley, Antoine Petitet and Jack Dongarra. <i>Parallel Computing</i>, 27(1-2):3-35, 2001. <p> <li> <a HREF="http://www.netlib.org/lapack/lawns/lawn112.ps"> "Practical Experience in the Numerical Dangers of Heterogeneous Computing",</a> <i>ACM Transactions on Mathematical Software</i> Volume 23, Number 2, pages 133-147, June 1997. <p> <li><a HREF="http://www.netlib.org/lapack/lawns/lawn95.ps"> "ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory Computers - Design Issues and Performance",</a> by J. Choi, J. Demmel, I. Dhillon, J. Dongarra, S. Ostrouchov, A. Petitet, K. Stanley, D. Walker, and R. C. Whaley. <i>Computer Physics Communications</i> Volume 97, pages 1-15, 1996. <p> <li><a HREF="http://www.netlib.org/lapack/lawns/lawn80.ps"> "The Design and Implementation of ScaLAPACK LU, QR, and Cholesky",</a> by J. Choi, J. Dongarra, S. Ostrouchov, A. Petitet, D. Walker, and R. C. Whaley. <i>Scientific Programming</i> Volume 5, pages 173-184, 1996. <p> </ol> <hr> <h1>Refereed Conference Publications</h1> <ol> <LI><A HREF="http://www.cs.utsa.edu/~whaley/papers/ettIEEE.pdf"> "Minimizing Startup Costs for Performance-Critical Threading"</A> by Anthony M. Castaldo and R. Clint Whaley. Accepted for publication in the <A HREF=http://ipdps.org/"><i>23rd IEEE International Parallel and Distributed Processing Symposium</i></A>, Rome, Italy, May 25-29, 2009. <p> <li><A HREF="http://www.cs.utsa.edu/~whaley/papers/lanb.pdf"> "Empirically Tuning LAPACK's Blocking Factor for Increased Performance"</A>, by R. Clint Whaley. <A HREF="http://www.imcsit.org/?cont=97&type=page&page=78"> <i>International Multiconference on Computer Science and Information Technology</i></A>, Wisla, Poland, October 20-22, 2008. <p> <li><A HREF="http://www.cs.utsa.edu/~qingyi/papers/gemm07.pdf"> "Automated Transformation for Performance-Critical Kernels"</A>, by Qing Yi and R. Clint Whaley. <i>ACM SIGPLAN Symposium on Library-Centric Software Design</i>, Montreal, Canada. Oct, 2007. <p> <li><A HREF="http://www.cs.utsa.edu/~whaley/papers/icpp05_8.ps"> "Tuning High Performance Kernels through Empirical Compilation"</A> by R. Clint Whaley and David B. Whalley. <i>The 2005 International Conference on Parallel Processing</i> (ICPP-05), June 14-17, 2005.<p> <p> <li> "Automatically Tuned Linear Algebra Software" by R. Clint Whaley and Jack Dongarra. <i>Ninth SIAM Conference on Parallel Processing for Scientific Computing</i>, March 22-24, 1999, CD-ROM Proceedings. <p> <li><a HREF="http://www.netlib.org/lapack/lawns/lawn139.ps"> "Numerical Linear Algebra Problem Solving Environment Designer's Perspective"</a>, <i>Society for Industrial and Applied Mathematics</i>, Philadelphia, PA, 1999. <p> <li> <a HREF="http://www.cs.utsa.edu/~whaley/papers/atlas_sc98.ps"> "Automatically Tuned Linear Algebra Software"</a> by R. Clint Whaley and Jack Dongarra. <b>Winner, best paper in systems catagory</b>, SuperComputing 1998: High Performance Networking and Computing. <p> <li><a HREF="http://www.netlib.org/utk/papers/siam397-scalapack/siam397-scalapack.ps"> "ScaLAPACK: A Linear Algebra Library for Message-passing Computers",<a> by S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, and R. Whaley. <i>Proceedings of 1997 SIAM Conference on Parallel Processing</i>, May 1997. <p> <li><a HREF="http://www.netlib.org/lapack/lawns/lawn100.ps"> "A Proposal for a Set of Parallel Basic Linear Algebra Subprograms"</a>, by Jaeyoung Choi, J. Dongarra, S. Ostrouchov, A. Petitet, D. Walker and R. C. Whaley. Second International Workshop, PARA'95, Lyngby, Denmark, August 1995. Proceedings in <i>Lecture Notes in Computer Science</i>, Number 1041, pages 107-114, Springer-Verlag, Berlin - Heidenberg - New York, 1996. <p> <li>"Two Dimensional Basic Linear Algebra Communications Subprograms", by R. Clint Whaley and Jack Dongarra. <i>Proceedings of the sixth SIAM Conference on Parallel Processing for Scientific Computing</i>, SIAM Publications, pages 347-352, Philadelphia, 1993. <p> </ol> <hr> <h1>Books</h1> <ol> <li> <a HREF="http://www.netlib.org/scalapack/slug/"> <u>ScaLAPACK Users' Guide</u></a> by L.S. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R. C. Whaley. SIAM Publications, Philadelphia, 1997, ISBN 0-89871-397-8. <li><u>Handbook on Parallel and Distributed Processing</u>, <i>editors:</i> J. Blazewicz, K. Ecker, B. Plateau, D. Trystram. Springer-Verlag Berlin Headelberg, 2000, ISBN: 3-540-6641-6. <p> </ol> <hr> <h1>Doctoral Dissertation</h1> <ol> <li><a HREF="http://www.cs.utsa.edu/~whaley/papers/diss.ps"> "Automated Empirical Optimization of High Performance Floating Point Kernels" </a> by R. Clint Whaley. Defended November 2, 2004. <br> Advisor: <A HREF="http://www.cs.fsu.edu/~whalley">David Whalley</A><br> </ol> <hr> <h1>Master's Thesis.</h1> <ol> <li><a HREF="http://www.netlib.org/lapack/lawns/lawn73.ps"> "Basic Linear Algebra Communication Subprograms: Analysis and Implementation Across Multiple Parallel Architectures"</a> by R. Clint Whaley. May, 1994. <br> Advisor: <A HREF="http://www.netlib.org/utk/people/JackDongarra/"> Jack Dongarra</A><br> </ol> <hr> <h1>Selected Workshops and Presentations</h1> <ol> <li><A HREF="http://www.cs.utsa.edu/~whaley/papers/atlas07iWAPT.pdf"> "ATLAS Version 3.8 : Overview and Status"</A> by R. Clint Whaley. <A HREF="http://www.na.cse.nagoya-u.ac.jp/~yamamoto/iWAPT2007.html"> <i>International Workshop on Automatic Performance Tuning (iWAPT07)</i></A>, Tokyo, Japan, September 20-21, 2007. Invited speaker with paper and talk. Proceedings available</a>. <li><A HREF="http://www.cs.utsa.edu/~whaley/talks/autotune07.pdf"> "Automatically Tuned Linear Algebra Software"<A> by R. Clint Whaley, <A HREF="http://cscads.rice.edu/workshops/july2007/autotune-workshop-07"> Workshop on Automatic Tuning for Petascale Systems</A>, Snowbird, Utah, July 9-12 2007. <li><A HREF="http://www.cs.utsa.edu/~whaley/talks/cri07.pdf"> "NSF CRI CNS-0551504, ATLAS Support and Development"</A>, by R. Clint Whaley, <A HREF="http://www.cs.bu.edu/NSF-CRI07/">2007 NSF/CISE CRI PI Meeting</A>, Boston, MA, June 4-5, 2007. </ol> <hr> <h1>User's Guides, HOWTOs, and miscellaneous.</h1> <ol> <li> <A HREF="http://www.cs.utsa.edu/~whaley/papers/atlas_install.pdf"> ATLAS Installation Guide</A>. <li> <a HREF="http://math-atlas.sourceforge.net/devel/atlas_contrib/assembly.html"> "Some notes on using assembly"</a>, by R. Clint Whaley. <li> <a HREF="http://www.cs.utsa.edu/~whaley/papers/atlas_contrib.ps"> "A Guide to User Contribution to ATLAS"</a>, by R. Clint Whaley. Also available online as <a HREF="http://math-atlas.sourceforge.net/devel/atlas_contrib/"> html</a>. <li> <a HREF="http://www.cs.utsa.edu/~whaley/papers/atlas_devel.ps"> "A Collaborative Guide to ATLAS Development"</a>, by R. Clint Whaley and Peter Soendergaard. Also available online as <a HREF="http://math-atlas.sourceforge.net/devel/atlas_devel/"> html</a>. <li> <a HREF="http://www.cs.utsa.edu/~whaley/extract/Extract400.ps"> "A User's Guide to Extract"</a>, by R. Clint Whaley. Also available online as <a HREF="http://www.cs.utsa.edu/~whaley/EXTRACT/UG4/Extract.html"> html</a>. <li> <a HREF="http://www.netlib.org/lapack/lawns/lawn137.ps"> "Installation Guide and Design of the HPF 1.1 interface to ScaLAPACK, SLHPF"</a> by L. S. Blackford, J. J. Dongarra, C. A. Papadopoulos, and R. C. Whaley. August, 1998. <li><A HREF="http://www.netlib.org/lapack/lawns/lawn136.ps"> "ScaLAPACK Evaluation and Performance at the DoD MSRCs"</a> by L. S. Blackford and R. C. Whaley. UT-CS-98.388, April 1998. <li> <a HREF="http://www.netlib.org/blacs/blacs_install.ps"> "Installation Guide for the BLACS and its Test Suite"</a> by R. Clint Whaley. <li> <a HREF="http://www.netlib.org/lapack/lawns/lawn94.ps"> "A User's Guide to the BLACS v1.1"</a>, by J. Dongarra and R. C. Whaley". March, 1995 (last updated, May 5, 1997). <li> <a HREF="http://www.netlib.org/lapack/lawns/lawn93.ps"> "Installation Guide for ScaLAPACK"</a> by J. Choi, J. Demmel, I. Dhillon, J. Dongarra, S. Ostrouchov, A. Petitet, K. Stanley, D. Walker, and R. C. Whaley. March, 1995. <li> <a HREF="http://www.cs.utsa.edu/~whaley/papers/BlacsMpiInSL.ps"> "Using BLACS and MPI in ScaLAPACK"</a> by R. Clint Whaley. <li> <a HREF="http://www.cs.utsa.edu/~whaley/papers/MpiBlacsIss.ps"> "Outstanding Issues in the MPIBLACS"</a> by R. Clint Whaley. <li> <a HREF="http://www.cs.utsa.edu/~whaley/papers/MpiProp.ps"> "Some Plebian Extensions to MPI"</a> by R. Clint Whaley. </ol> <hr> <center> <a HREF="http://www.cs.utsa.edu/~whaley/">Back to homepage</a> </center>