Author of the publication

Re-Introduction of communication-avoiding FMM-accelerated FFTs with GPU acceleration.

, , , , and . HPEC, page 1-6. IEEE, (2013)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Iterative Optimization in the Polyhedral Model: Part I, One-Dimensional Time., , , and . CGO, page 144-156. IEEE Computer Society, (2007)Diagonal Rescaling For Neural Networks., , and . CoRR, (2017)Learning Visual Features from Large Weakly Supervised Data., , , and . ECCV (7), volume 9911 of Lecture Notes in Computer Science, page 67-84. Springer, (2016)Fast Convolutional Nets With fbfft: A GPU Performance Evaluation., , , , , and . ICLR, (2015)Memory reuse optimizations in the R-Stream compiler., , , and . GPGPU@ASPLOS, page 42-53. ACM, (2013)The Next 700 Accelerated Layers: From Mathematical Expressions of Network Computation Graphs to Accelerated GPU Kernels, Automatically., , , , , , , , and . ACM Trans. Archit. Code Optim., 16 (4): 38:1-38:26 (2020)Polyhedral Code Generation in the Real World., , and . CC, volume 3923 of Lecture Notes in Computer Science, page 185-201. Springer, (2006)Automatic Correction of Loop Transformations., , and . PACT, page 292-304. IEEE Computer Society, (2007)Violated dependence analysis., , , and . ICS, page 335-344. ACM, (2006)A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction., , , , , , and . GPGPU, volume 425 of ACM International Conference Proceeding Series, page 51-61. ACM, (2010)