RC24704 (W0812-047) December 8, 2008 Computer Science IBM Research Report Optimizing Sparse Matrix-Vector Multiplication on GPUs Using Compile-time and Run-time Strategies Muthu Manikandan Baskaran Department of Computer Science and Engineering The Ohio State University Columbus, OH USA Rajesh Bordawekar IBM Research Division Thomas J. Watson Research Center P.O. Box 704 Yorktown Heights, NY 10598 USA Research Division Almaden - Austin - Beijing - Cambridge - Haifa - India - T. J. Watson - Tokyo - Zurich LIMITED DISTRIBUTION NOTICE: This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g. , payment of royalties). Copies may be requested from IBM T. J. Watson Research Center , P. O. Box 218, Yorktown Heights, NY 10598 USA (email: reports@us.ibm.com). Some reports are available on the internet at http://domino.watson.ibm.com/library/CyberDig.nsf/home .