Author of the publication

Re-Introduction of communication-avoiding FMM-accelerated FFTs with GPU acceleration.

, , , , and . HPEC, page 1-6. IEEE, (2013)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Precise Data Locality Optimization of Nested Loops., , and . The Journal of Supercomputing, 21 (1): 37-76 (2002)The Open Community Runtime: A runtime system for extreme scale computing., , , , , , , , , and 7 other author(s). HPEC, page 1-7. IEEE, (2016)Automatic memory layout transformations to optimize spatial locality in parameterized loop nests., and . SIGARCH Computer Architecture News, 28 (1): 11-19 (2000)Periodic Polyhedra.. CC, volume 2985 of Lecture Notes in Computer Science, page 134-149. Springer, (2004)PUMA-V: Optimizing Parallel Code Performance Through Interactive Visualization., , , , and . IEEE Computer Graphics and Applications, 39 (1): 84-99 (2019)Memory reuse optimizations in the R-Stream compiler., , , and . GPGPU@ASPLOS, page 42-53. ACM, (2013)Re-Introduction of communication-avoiding FMM-accelerated FFTs with GPU acceleration., , , , and . HPEC, page 1-6. IEEE, (2013)Automatic cluster parallelization and minimizing communication via selective data replication., , , , , , , and . HPEC, page 1-7. IEEE, (2015)Polyhedral user mapping and assistant visualizer tool for the r-stream auto-parallelizing compiler., , , , , , , , , and 2 other author(s). VISSOFT, page 180-184. IEEE Computer Society, (2015)Polyhedral compilation for energy efficiency., , , , , and . HPEC, page 1-7. IEEE, (2016)