Author of the publication

Performance analysis and optimization strategies for a D3Q19 lattice Boltzmann kernel on nVIDIA GPUs using CUDA.

, , , and . Advances in Engineering Software, 42 (5): 266-272 (2011)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Benchmark Analysis and Application Results for Lattice Boltzmann Simulations on NEC SX Vector and Intel Nehalem Systems., , and . Parallel Processing Letters, 19 (4): 491-511 (2009)An Evaluation of Different I/O Techniques for Checkpoint/Restart., , , , and . IPDPS Workshops, page 1708-1716. IEEE, (2013)RZBENCH: Performance evaluation of current HPC architectures using low-level and application benchmarks, , , and . CoRR, (2007)CRAFT: A Library for Easier Application-Level Checkpoint/Restart and Automatic Fault Tolerance., , , , , and . IEEE Trans. Parallel Distrib. Syst., 30 (3): 501-514 (2019)Extreme Scale-out SuperMUC Phase 2 - lessons learned., , , , , , , , , and 29 other author(s). PARCO, volume 27 of Advances in Parallel Computing, page 827-836. IOS Press, (2015)Efficient Temporal Blocking for Stencil Computations by Multicore-Aware Wavefront Parallelization., , , , and . COMPSAC (1), page 579-586. IEEE Computer Society, (2009)CRAFT: A library for easier application-level Checkpoint/Restart and Automatic Fault Tolerance., , , , , and . CoRR, (2017)Asynchronous MPI for the Masses, , , and . CoRR, (2013)A Survey of Checkpoint/Restart Techniques on Distributed Memory Systems., , , , , and . Parallel Processing Letters, (2013)Data Access Characteristics and Optimizations for Sun UltraSPARC T2 and T2+ Systems., , and . Parallel Processing Letters, 18 (4): 471-490 (2008)