Author of the publication

LU Factorization of Small Matrices: Accelerating Batched DGETRF on the GPU.

, , , , , and . HPCC/CSS/ICESS, page 157-160. IEEE, (2014)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Step towards Energy Efficient Computing: Redesigning a Hydrodynamic Application on CPU-GPU., , , , , and . IPDPS, page 972-981. IEEE Computer Society, (2014)Tridiagonalization of a dense symmetric matrix on multiple GPUs and its application to symmetric eigenvalue problems., , , , , and . Concurrency and Computation: Practice and Experience, 26 (16): 2652-2666 (2014)Tridiagonalization of a Symmetric Dense Matrix on a GPU Cluster., , , and . IPDPS Workshops, page 1070-1079. IEEE, (2013)Mixed-Precision Orthogonalization Scheme and Adaptive Step Size for Improving the Stability and Performance of CA-GMRES on GPUs., , , and . VECPAR, volume 8969 of Lecture Notes in Computer Science, page 17-30. Springer, (2014)LU Factorization of Small Matrices: Accelerating Batched DGETRF on the GPU., , , , , and . HPCC/CSS/ICESS, page 157-160. IEEE, (2014)Towards batched linear solvers on accelerated hardware platforms., , , , and . PPOPP, page 261-262. ACM, (2015)A Fast Batched Cholesky Factorization on a GPU., , , and . ICPP, page 432-440. IEEE Computer Society, (2014)Poster: Acceleration of the BLAST Hydro Code on GPU., , , and . SC Companion, page 1337. IEEE Computer Society, (2012)Optimization for performance and energy for batched matrix computations on GPUs., , , , and . GPGPU@PPoPP, page 59-69. ACM, (2015)Batched matrix computations on hardware accelerators based on GPUs., , , , and . IJHPCA, 29 (2): 193-208 (2015)