Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Experiences in autotuning matrix multiplication for energy minimization on GPUs., , , , and . Concurrency and Computation: Practice and Experience, 27 (17): 5096-5113 (2015)Implementation and Tuning of Batched Cholesky Factorization and Solve for NVIDIA GPUs., , , and . IEEE Trans. Parallel Distrib. Syst., 27 (7): 2036-2048 (2016)Accelerating the LOBPCG method on GPUs using a blocked sparse matrix vector product., , and . SpringSim (HPS), page 75-82. SCS/ACM, (2015)Residual Replacement in Mixed-Precision Iterative Refinement for Sparse Linear Systems., , , , and . ISC Workshops, volume 11203 of Lecture Notes in Computer Science, page 554-561. Springer, (2018)Iterative Sparse Triangular Solves for Preconditioning., , and . Euro-Par, volume 9233 of Lecture Notes in Computer Science, page 650-661. Springer, (2015)A Block-Asynchronous Relaxation Method for Graphics Processing Units., , , and . IPDPS Workshops, page 113-124. IEEE Computer Society, (2012)ParILUT - A New Parallel Threshold ILU Factorization., , and . SIAM J. Scientific Computing, 40 (4): C503-C519 (2018)Updating incomplete factorization preconditioners for model order reduction., , , and . Numerical Algorithms, 73 (3): 611-630 (2016)Toward a modular precision ecosystem for high-performance computing., , , and . IJHPCA, (2019)Parallel selection on GPUs., and . Parallel Computing, (2020)