Author of the publication

Catwalk: A Quick Development Path for Performance Models.

, , , , , , , , and . Euro-Par Workshops (2), volume 8806 of Lecture Notes in Computer Science, page 589-600. Springer, (2014)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Group Operation Assembly Language - A Flexible Way to Express Collective Communication., , and . ICPP, page 574-581. IEEE Computer Society, (2009)Slim Fly: A Cost Effective Low-Diameter Network Topology., and . SC, page 348-359. IEEE, (2014)On the Effects of CPU Caches on MPI Point-to-Point Communications., , and . CLUSTER, page 495-503. IEEE Computer Society, (2012)dCUDA: hardware supported overlap of computation and communication., , and . SC, page 52. ACM, (2016)Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis., and . CoRR, (2018)Taming Unbalanced Training Workloads in Deep Learning with Partial Collective Operations., , , , and . CoRR, (2019)The impact of network noise at large-scale communication performance., , and . IPDPS, page 1-8. IEEE, (2009)A practically constant-time MPI Broadcast Algorithm for large-scale InfiniBand Clusters with Multicast., , and . IPDPS, page 1-8. IEEE, (2007)Using Simulation to Evaluate the Performance of Resilience Strategies at Scale., , , , , and . PMBS@SC, volume 8551 of Lecture Notes in Computer Science, page 91-114. Springer, (2013)To Push or To Pull: On Reducing Communication and Synchronization in Graph Computations., , , , and . HPDC, page 93-104. ACM, (2017)