Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Group Operation Assembly Language - A Flexible Way to Express Collective Communication., , and . ICPP, page 574-581. IEEE Computer Society, (2009)Slim Fly: A Cost Effective Low-Diameter Network Topology., and . SC, page 348-359. IEEE, (2014)On the Effects of CPU Caches on MPI Point-to-Point Communications., , and . CLUSTER, page 495-503. IEEE Computer Society, (2012)Application-oriented ping-pong benchmarking: how to assess the real communication overheads., , and . Computing, 96 (4): 279-292 (2014)The impact of network noise at large-scale communication performance., , and . IPDPS, page 1-8. IEEE, (2009)A practically constant-time MPI Broadcast Algorithm for large-scale InfiniBand Clusters with Multicast., , and . IPDPS, page 1-8. IEEE, (2007)dCUDA: hardware supported overlap of computation and communication., , and . SC, page 52. ACM, (2016)Engineering Algorithms for Scalability through Continuous Validation of Performance Expectations., , , , , and . IEEE Trans. Parallel Distrib. Syst., 30 (8): 1768-1785 (2019)Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis., and . CoRR, (2018)Taming Unbalanced Training Workloads in Deep Learning with Partial Collective Operations., , , , and . CoRR, (2019)