Author of the publication

High performance OpenSHMEM for Xeon Phi clusters: Extensions, runtime designs and application co-design.

, , , , , , and . CLUSTER, page 10-18. IEEE Computer Society, (2014)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Designing Topology-Aware Communication Schedules for Alltoall Operations in Large InfiniBand Clusters., , , , , , and . ICPP, page 231-240. IEEE Computer Society, (2014)OpenSHMEM Non-blocking Data Movement Operations with MVAPICH2-X: Early Experiences., , , and . PAW@SC, page 9-16. IEEE Computer Society, (2016)Can Network-Offload Based Non-blocking Neighborhood MPI Collectives Improve Communication Overheads of Irregular Graph Algorithms?, , , , , , and . CLUSTER Workshops, page 222-230. IEEE Computer Society, (2012)Designing Non-blocking Broadcast with Collective Offload on InfiniBand Clusters: A Case Study with HPL., , , , , , and . Hot Interconnects, page 27-34. IEEE Computer Society, (2011)Codesign for InfiniBand Clusters., , , , , and . IEEE Computer, 44 (11): 31-36 (2011)A scalable and portable approach to accelerate hybrid HPL on heterogeneous CPU-GPU clusters., , , , , and . CLUSTER, page 1-8. IEEE Computer Society, (2013)Designing Scalable Graph500 Benchmark with Hybrid MPI+OpenSHMEM Programming Models., , , and . ISC, volume 7905 of Lecture Notes in Computer Science, page 109-124. Springer, (2013)High performance OpenSHMEM for Xeon Phi clusters: Extensions, runtime designs and application co-design., , , , , , and . CLUSTER, page 10-18. IEEE Computer Society, (2014)Scalable Graph500 design with MPI-3 RMA., , , , , , and . CLUSTER, page 230-238. IEEE Computer Society, (2014)A Novel Functional Partitioning Approach to Design High-Performance MPI-3 Non-blocking Alltoallv Collective on Multi-core Systems., , , , and . ICPP, page 611-620. IEEE Computer Society, (2013)