Author of the publication

Implementation and Evaluation of OpenSHMEM Contexts Using OFI Libfabric.

, , , , , and . OpenSHMEM, volume 10679 of Lecture Notes in Computer Science, page 19-34. Springer, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Dynamic Task Parallelism with a GPU Work-Stealing Runtime System., , , and . LCPC, volume 7146 of Lecture Notes in Computer Science, page 203-217. Springer, (2011)Speculative Execution of Parallel Programs with Precise Exception Semantics on GPUs., , , , and . LCPC, volume 8664 of Lecture Notes in Computer Science, page 342-356. Springer, (2013)CnC-CUDA: Declarative Programming for GPUs., , , and . LCPC, volume 6548 of Lecture Notes in Computer Science, page 230-245. Springer, (2010)A Pluggable Framework for Composable HPC Scheduling Libraries., , , , and . IPDPS Workshops, page 723-732. IEEE Computer Society, (2017)S2FA: an accelerator automation framework for heterogeneous computing in datacenters., , , , , and . DAC, page 153:1-153:6. ACM, (2018)Accelerating Habanero-Java programs with OpenCL generation., , , , and . PPPJ, page 124-134. ACM, (2013)Auto-grading for parallel programs., , , , and . EduHPC@SC, page 3:1-3:8. ACM, (2015)OpenMP as a High-Level Specification Language for Parallelism - And its use in Evaluating Parallel Programming Systems., , and . IWOMP, volume 9903 of Lecture Notes in Computer Science, page 141-155. (2016)Efficient Checkpointing of Multi-threaded Applications as a Tool for Debugging, Performance Tuning, and Resiliency., and . IPDPS, page 232-241. IEEE Computer Society, (2016)Data-parallel distributed training of very large models beyond GPU capacity., , , , , and . CoRR, (2018)