Author of the publication

Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark.

, , , and . SIGMETRICS Performance Evaluation Review, 38 (4): 23-29 (2011)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

An investigation of the performance portability of OpenCL., , , , , and . J. Parallel Distrib. Comput., 73 (11): 1439-1450 (2013)Separable projection integrals for higher-order correlators of the cosmic microwave sky: Acceleration by factors exceeding 100., , , , and . J. Comput. Physics, (2016)LDPLFS: Improving I/O Performance without Application Modification., , , , , and . IPDPS Workshops, page 1352-1359. IEEE Computer Society, (2012)Developing Performance-Portable Molecular Dynamics Kernels in OpenCL., and . SC Companion, page 386-395. IEEE Computer Society, (2012)WMTools - Assessing Parallel Application Memory Utilisation at Scale., , , and . EPEW, volume 6977 of Lecture Notes in Computer Science, page 148-162. Springer, (2011)CosmoFlow: Using Deep Learning to Learn the Universe at Scale., , , , , , , , , and 7 other author(s). CoRR, (2018)Light-Weight Parallel I/O Analysis at Scale., , , and . EPEW, volume 6977 of Lecture Notes in Computer Science, page 235-249. Springer, (2011)Implications of a metric for performance portability., , and . Future Generation Comp. Syst., (2019)High-Performance Code Generation though Fusion and Vectorization., and . CoRR, (2017)Exploring SIMD for Molecular Dynamics, Using Intel® Xeon® Processors and Intel® Xeon Phi Coprocessors., , , and . IPDPS, page 1085-1097. IEEE Computer Society, (2013)