Author of the publication

KernelFaRer: Replacing Native-Code Idioms with High-Performance Library Calls.

, , , , , , and . ACM Trans. Archit. Code Optim., 18 (3): 38:1-38:22 (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Combining Static and Dynamic Data Coalescing in Unified Parallel C., , , , and . IEEE Trans. Parallel Distrib. Syst., 27 (2): 381-393 (2016)On the Merits of Distributed Work-Stealing on Selective Locality-Aware Tasks., , and . ICPP, page 100-109. IEEE Computer Society, (2013)Eliminating Redundant Join-Set Computations in Static Single Assignment., and . J. UCS, 12 (8): 1007-1019 (2006)Using machines to learn method-specific compilation strategies., , , , and . CGO, page 257-266. IEEE Computer Society, (2011)Workload Reduction for Multi-input Feedback-Directed Optimization., , , and . CGO, page 59-69. IEEE Computer Society, (2009)Minimum Register Instruction Sequence Problem: Revisiting Optimal Code Generation for DAGs., , , , and . IPDPS, page 26. IEEE Computer Society, (2001)Caching Single-Assignment Structures to Build a Robust Fine-Grain Multi-Threading System., , , and . IPDPS, page 589-594. IEEE Computer Society, (2000)Using shared-data localization to reduce the cost of inspector-execution in unified-parallel-C programs., , , , and . Parallel Computing, (2016)10th Workshop on Compiler-Driven Performance., , , , and . CASCON, page 371-372. IBM / ACM, (2011)Forma: A framework for safe automatic array reshaping., , , , and . ACM Trans. Program. Lang. Syst., 30 (1): 2 (2007)