Author of the publication

Speculative Execution of Parallel Programs with Precise Exception Semantics on GPUs.

, , , , and . LCPC, volume 8664 of Lecture Notes in Computer Science, page 342-356. Springer, (2013)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Pluggable Framework for Composable HPC Scheduling Libraries., , , , and . IPDPS Workshops, page 723-732. IEEE Computer Society, (2017)S2FA: an accelerator automation framework for heterogeneous computing in datacenters., , , , , and . DAC, page 153:1-153:6. ACM, (2018)Dynamic Task Parallelism with a GPU Work-Stealing Runtime System., , , and . LCPC, volume 7146 of Lecture Notes in Computer Science, page 203-217. Springer, (2011)Speculative Execution of Parallel Programs with Precise Exception Semantics on GPUs., , , , and . LCPC, volume 8664 of Lecture Notes in Computer Science, page 342-356. Springer, (2013)CnC-CUDA: Declarative Programming for GPUs., , , and . LCPC, volume 6548 of Lecture Notes in Computer Science, page 230-245. Springer, (2010)Accelerating Habanero-Java programs with OpenCL generation., , , , and . PPPJ, page 124-134. ACM, (2013)Auto-grading for parallel programs., , , , and . EduHPC@SC, page 3:1-3:8. ACM, (2015)Efficient Checkpointing of Multi-threaded Applications as a Tool for Debugging, Performance Tuning, and Resiliency., and . IPDPS, page 232-241. IEEE Computer Society, (2016)A survey of sparse matrix-vector multiplication performance on large matrices., , , , , and . CoRR, (2016)Data-parallel distributed training of very large models beyond GPU capacity., , , , , and . CoRR, (2018)