Author of the publication

Optimizing tensor contraction expressions for hybrid CPU-GPU execution.

, , , , and . Cluster Computing, 16 (1): 131-155 (2013)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Designing Efficient Heterogeneous Memory Architectures., , , , , and . IEEE Micro, 35 (4): 60-68 (2015)Efficient Breadth-First Search on the Cell/BE Processor., , and . IEEE Trans. Parallel Distrib. Syst., 19 (10): 1381-1395 (2008)Challenges in Mapping Graph Exploration Algorithms on Advanced Multi-core Processors., , , and . IPDPS, page 1-10. IEEE, (2007)Accelerating subsurface transport simulation on heterogeneous clusters., , and . CLUSTER, page 1-8. IEEE Computer Society, (2013)Exact multi-pattern string matching on the cell/b.e. processor., , and . Conf. Computing Frontiers, page 33-42. ACM, (2008)Exploring Manycore Multinode Systems for Irregular Applications with FPGA Prototyping., , , , and . FCCM, page 238. IEEE Computer Society, (2013)Exploring hardware support for scaling irregular applications on multi-node multi-core architectures., , , , , and . ASAP, page 309-313. IEEE Computer Society, (2013)A High Performance Computing Network and System Simulator for the Power Grid: NGNS^2., , , , and . SC Companion, page 313-322. IEEE Computer Society, (2012)Acceleration of Streamed Tensor Contraction Expressions on GPGPU-Based Clusters., , , and . CLUSTER, page 207-216. IEEE Computer Society, (2010)Power/performance hardware optimization for synchronization intensive applications in MPSoCs., , , and . DATE, page 606-611. European Design and Automation Association, Leuven, Belgium, (2006)