Author of the publication

Automatic Performance Modeling of HPC Applications.

, , , , , , , , , and 1 other author(s). Software for Exascale Computing, volume 113 of Lecture Notes in Computational Science and Engineering, Springer, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Slim Fly: A Cost Effective Low-Diameter Network Topology., and . SC, page 348-359. IEEE, (2014)Group Operation Assembly Language - A Flexible Way to Express Collective Communication., , and . ICPP, page 574-581. IEEE Computer Society, (2009)On the Effects of CPU Caches on MPI Point-to-Point Communications., , and . CLUSTER, page 495-503. IEEE Computer Society, (2012)Application-oriented ping-pong benchmarking: how to assess the real communication overheads., , and . Computing, 96 (4): 279-292 (2014)A practically constant-time MPI Broadcast Algorithm for large-scale InfiniBand Clusters with Multicast., , and . IPDPS, page 1-8. IEEE, (2007)The impact of network noise at large-scale communication performance., , and . IPDPS, page 1-8. IEEE, (2009)dCUDA: hardware supported overlap of computation and communication., , and . SC, page 52. ACM, (2016)Exact Dependence Analysis for Increased Communication Overlap., , and . EuroMPI, volume 7490 of Lecture Notes in Computer Science, page 89-99. Springer, (2012)Leveraging MPI's One-Sided Communication Interface for Shared-Memory Programming., , , , , , , , and . EuroMPI, volume 7490 of Lecture Notes in Computer Science, page 132-141. Springer, (2012)Modeling and analysis of remote memory access programming., , , and . OOPSLA, page 129-144. ACM, (2016)