Author of the publication

Automatic FFT Performance Tuning on OpenCL GPUs.

, , , , and . ICPADS, page 228-235. IEEE Computer Society, (2011)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Implementation and Optimization of Multi-dimensional Real FFT on ARMv8 Platform., , , and . ICA3PP (2), volume 11335 of Lecture Notes in Computer Science, page 338-353. Springer, (2018)AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs., , , , , , , , and . ISPA/BDCloud/SocialCom/SustainCom, page 159-166. IEEE, (2021)Evolutionary Based Intelligent Algorithm for Topology Optimization of Structure., and . ISDA (1), page 897-902. IEEE Computer Society, (2006)Automatic FFT Performance Tuning on OpenCL GPUs., , , , and . ICPADS, page 228-235. IEEE Computer Society, (2011)Efficient parallel optimizations of a high-performance SIFT on GPUs., , , , , , and . J. Parallel Distrib. Comput., (2019)Research on Mahalanobis Distance Algorithm Optimization Based on OpenCL., , , and . HPCC/CSS/ICESS, page 84-91. IEEE, (2014)Optimized Password Recovery for Encrypted RAR on GPUs., , and . HPCC/CSS/ICESS, page 591-598. IEEE, (2015)Parallel Processing Systems for Big Data: A Survey., , , , , , and . Proceedings of the IEEE, 104 (11): 2114-2136 (2016)GPURoofline: A Model for Guiding Performance Optimizations on GPUs., , , , , and . Euro-Par, volume 7484 of Lecture Notes in Computer Science, page 920-932. Springer, (2012)DropPruning for Model Compression., , , , , , , and . CoRR, (2018)