Author of the publication

Modeling Large Compute Nodes with Heterogeneous Memories with Cache-Aware Roofline Model.

, , , , and . PMBS@SC, volume 10724 of Lecture Notes in Computer Science, page 91-113. Springer, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

NIC-assisted cache-efficient receive stack for message passing over Ethernet.. Concurrency and Computation: Practice and Experience, 23 (2): 199-210 (2011)Exposing the Locality of Heterogeneous Memory Architectures to HPC Applications.. MEMSYS, page 30-39. ACM, (2016)Dodging Non-uniform I/O Access in Hierarchical Collective Operations for Multicore Clusters., and . IPDPS Workshops, page 788-794. IEEE, (2011)Optimizing MPI communication within large multicore nodes with kernel assistance., , , and . IPDPS Workshops, page 1-7. IEEE, (2010)Finding a tradeoff between host interrupt load and MPI latency over Ethernet., and . CLUSTER, page 1-9. IEEE Computer Society, (2009)Enabling high-performance memory migration for multithreaded applications on LINUX., and . IPDPS, page 1-9. IEEE, (2009)An Efficient OpenMP Runtime System for Hierarchical Architectures., , , , and . IWOMP, volume 4935 of Lecture Notes in Computer Science, page 161-172. Springer, (2007)NIC-Assisted Cache-Efficient Receive Stack for Message Passing over Ethernet.. Euro-Par, volume 5704 of Lecture Notes in Computer Science, page 1065-1077. Springer, (2009)ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures., , , , and . International Journal of Parallel Programming, 38 (5-6): 418-439 (2010)Co-Scheduling HPC Workloads on Cache-Partitioned CMP Platforms., , , , and . CLUSTER, page 348-358. IEEE Computer Society, (2018)