Author of the publication

Thoroughly Exploring GPU Buffering Options for Stencil Code by Using an Efficiency Measure and a Performance Model.

, , and . IEEE Trans. Multi-Scale Computing Systems, 4 (3): 477-490 (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Neighborhood Prefetching on Multiprocessors Using Instruction History.. IEEE PACT, page 123-132. IEEE Computer Society, (2000)A Lower Bound on the Average Physical Length of Edges in the Physical Realization of Graphs.. Parallel Processing Letters, 6 (1): 137-143 (1996)GeauxDock: A novel approach for mixed-resolution ligand docking using a descriptor-based force field., , , , , , , and . Journal of Computational Chemistry, 36 (27): 2013-2026 (2015)A Massive Data Parallel Computational Framework for Petascale/Exascale Hybrid Computer Systems, , , , , , , and . CoRR, (2012)A Performance Model and Efficiency-Based Assignment of Buffering Strategies for Automatic GPU Stencil Code Generation., , and . MCSoC, page 361-368. IEEE Computer Society, (2016)The effects on branch prediction when utilizing control independence., and . IPDPS Workshops, page 1-4. IEEE, (2010)A Self-Routing Permutation Network., and . J. Parallel Distrib. Comput., 10 (2): 140-151 (1990)Discovering barriers to efficient execution, both obvious and subtle, using instruction-level visualization., and . VPA@SC, page 36-41. IEEE, (2014)GPU road network graph contraction and SSSP query., , and . ICS, page 250-260. ACM, (2019)The Interaction and Relative Effectiveness of Hardware and Software Data Prefetch., and . Journal of Circuits, Systems, and Computers, (2012)