Author of the publication

AUGEM: automatically generate high performance dense linear algebra kernels on x86 CPUs.

, , , and . SC, page 25:1-25:12. ACM, (2013)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

P-DOT: a model of computation for big data., , , and . IJPEDS, 31 (3): 233-253 (2016)Efficient parallel optimizations of a high-performance SIFT on GPUs., , , , , , and . J. Parallel Distrib. Comput., (2019)A Relational Theory of Locality., , , , and . TACO, 16 (3): 33:1-33:26 (2019)HPC AI500: A Benchmark Suite for HPC AI Systems., , , , , , , , , and 3 other author(s). CoRR, (2019)Large Scale Satellite Imagery Simulations with Physically Based Ray Tracing on Tianhe-1A Supercomputer., , and . HPCC/EUC, page 549-556. IEEE, (2013)Early Performance Evaluation of Dawning 5000A and DeepComp 7000., , , , , , and . ICPADS, page 578-585. IEEE Computer Society, (2009)Automatic FFT Performance Tuning on OpenCL GPUs., , , , and . ICPADS, page 228-235. IEEE Computer Society, (2011)Fast Convolution Operations on Many-Core Architectures., , , and . HPCC/CSS/ICESS, page 316-323. IEEE, (2015)Implementing High-performance Intensity Model with Blur Effect on GPUs for Large-scale Star Image Simulation., , , and . IPDPS Workshops, page 1879-1888. IEEE Computer Society, (2012)Development of a Scalable Solver for the Earth's Core Convection., , and . HPCA (China), volume 5938 of Lecture Notes in Computer Science, page 497-502. Springer, (2009)