Author of the publication

Rapid Development of High-Performance Linear Algebra Libraries.

, , , , , , and . PARA, volume 3732 of Lecture Notes in Computer Science, page 376-384. Springer, (2004)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Efficient high-precision matrix algebra on parallel architectures for nonlinear combinatorial optimization., , and . Math. Program. Comput., 2 (2): 103-124 (2010)Is Cache-Oblivious DGEMM Viable?, , , and . PARA, volume 4699 of Lecture Notes in Computer Science, page 919-928. Springer, (2006)Parallel deep neural network training for LVCSR tasks using blue gene/Q., , , , , , , , and . INTERSPEECH, page 1048-1052. ISCA, (2014)Scalable Community Detection with the Louvain Algorithm., , , and . IPDPS, page 28-37. IEEE Computer Society, (2015)An Early Performance Study of Large-Scale POWER8 SMP Systems., , , , , , , and . IPDPS, page 263-272. IEEE Computer Society, (2016)Blue Gene/L performance tools., , , , , , , , , and 2 other author(s). IBM Journal of Research and Development, 49 (2-3): 407-424 (2005)Extending stability beyond CPU millennium: a micron-scale atomistic simulation of Kelvin-Helmholtz instability., , , , , and . SC, page 58. ACM Press, (2007)Gordon Bell finalists I - Large scale drop impact analysis of mobile phone using ADVC on Blue Gene/L., , , , , , , , , and 8 other author(s). SC, page 46. ACM Press, (2006)Gordon Bell finalists I - Large-scale electronic structure calculations of high-Z metals on the BlueGene/L platform., , , , , , , , , and 1 other author(s). SC, page 45. ACM Press, (2006)Massively parallel models of the human circulatory system., , , , and . SC, page 1:1-1:11. ACM, (2015)