Author of the publication

Portable parallel performance from sequential, productive, embedded domain-specific languages.

, , , , , , , and . PPOPP, page 303-304. ACM, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A case for FAME: FPGA architecture model execution., , , , , and . ISCA, page 290-301. ACM, (2010)CudaDMA: optimizing GPU memory bandwidth via warp specialization., , and . SC, page 12:1-12:11. ACM, (2011)Fast speaker diarization using a high-level scripting language., , , and . ASRU, page 553-558. IEEE, (2011)A hardware evaluation of cache partitioning to improve utilization and energy-efficiency while preserving responsiveness., , , , , and . ISCA, page 308-319. ACM, (2013)A 45nm 1.3GHz 16.7 double-precision GFLOPS/W RISC-V processor with vector accelerators., , , , , , and . ESSCIRC, page 199-202. IEEE, (2014)CUDA-level Performance with Python-level Productivity for Gaussian Mixture Model Applications., , , , , and . HotPar, USENIX Association, (2011)RAMP gold: an FPGA-based architecture simulator for multiprocessors., , , , , , and . DAC, page 463-468. ACM, (2010)Portable parallel performance from sequential, productive, embedded domain-specific languages., , , , , , , and . PPOPP, page 303-304. ACM, (2012)An Agile Approach to Building RISC-V Microprocessors., , , , , , , , , and 8 other author(s). IEEE Micro, 36 (2): 8-20 (2016)Single-chip microprocessor that communicates directly using light., , , , , , , , , and 12 other author(s). Nature, 528 (7581): 534-538 (2015)