Publications

Martin Herrer\'ıas Azcué, Hugo Capdevila, Huan Zhou, and Annette Hammer. Simulation of Large PV Plants Using a Continuous Radiance Distribution Model and Cell-Resolution Mismatch Calculation. European Photovoltaic Solar Energy Conference and Exhibition, 1311--1316, 2020. [PUMA: Irradiance Mismatch Performance Radiance distribution losses modeling transposition]

Naweiluo Zhou, Yiannis Georgiou, Marcin Pospieszny, Li Zhong, Huan Zhou, Christoph Niethammer, Branislav Pejak, Oskar Marko, and Dennis Hoppe. Container orchestration on HPC systems through Kubernetes. Journal of Cloud Computing, (10)1:1--14, SpringerOpen, 2021. [PUMA: Cloud Container HPC Kubernetes Singularity TORQUE computing manager orchestration workload]

Huan Zhou, Christoph Niethammer, and Martin Herrerias Azcue. Usage Experiences of Performance Tools for Modern C $$$$ Code Analysis and Optimization. Tools for High Performance Computing 2018/2019, 103--121, Springer, 2021. [PUMA: C++ HPC Optimization Performance analysis]

Huan Zhou, José Gracia, Naweiluo Zhou, and Ralf Schneider. Collectives in hybrid MPI+ MPI code: Design, practice and performance. Parallel Computing, 102669, Elsevier, 2020. [PUMA: MPI collective communications hybrid myown programming] URL

Huan Zhou, José Gracia, and Ralf Schneider. MPI Collectives for Multi-core Clusters: Optimized Performance of the Hybrid MPI+ MPI Parallel Codes. Proceedings of the 48th International Conference on Parallel Processing: Workshops, 1--10, ACM, 2019. [PUMA: collective communication mpi myown shared-memory]

Huan Zhou, and José Gracia. Asynchronous Progress Design for a MPI-Based PGAS One-Sided Communication System.. ICPADS, 999–1006, IEEE, 2016. [PUMA: DART MPI asynchronous data-locality myown one-sided overlap progress] URL

Huan Zhou, and José Gracia. Towards Performance Portability through Locality-Awareness for Applications Using One-Sided Communication Primitives.. CANDAR, 536–542, IEEE, 2016. [PUMA: DART-MPI application myown porting] URL

Huan Zhou, and José Gracia. Application Productivity and Performance Evaluation of Transparent Locality-aware One-sided Communication Primitives. In Egawa Ryusuke (Eds.), International Journal of Networking and Computing, (7)2:136--153, 2017. [PUMA: DART DART-MPI Locality-awareness MPI blocking myown] URL

C. Niethammer, D. Khabi, H. Zhou, V. Marjanovic, and J. Gracia. Impact of Late-Arrivals on MPI Collective Operations. INFOCOMP 2015, 2015. [PUMA: MPI benchmarking hlrs late-arrivals myown]

H. Zhou, V. Marjanovic, C. Niethammer, and J. Gracia. A Bandwidth-saving Optimization for MPI Broadcast Collective Operation. Proceedings of the International Conference on Parallel Processing Workshops, ICPPW, September 2015. [PUMA: Bandwidth Broadcast Long MPI Optimization hlrs message myown]

Huan Zhou, Yousri Mhedheb, Kamran Idrees, Colin W. Glass, José Gracia, and Karl Fürlinger. DART-MPI: An MPI-based Implementation of a PGAS Runtime System.. In Allen D. Malony, and Jeff R. Hammond (Eds.), PGAS, 3:1–3:11, ACM, 2014. [PUMA: MPI PGAS Portability RMA hlrs myown] URL

Huan Zhou, Kamran Idrees, and José Gracia. Leveraging MPI-3 Shared-Memory Extensions for Efficient PGAS Runtime Systems.. In Jesper Larsson Träff, Sascha Hunold, and Francesco Versaci (Eds.), Euro-Par, (9233):373–384, Springer, 2015. [PUMA: MPI-3 PGAS RMA hlrs myown shared-memory] URL

Karl Fürlinger, Colin W. Glass, José Gracia, Andreas Knüpfer, Jie Tao, Denis Hünich, Kamran Idrees, Matthias Maiterth, Yousri Mhedheb, and Huan Zhou. DASH: Data Structures and Algorithms with Support for Hierarchical Locality. Euro-Par 2014: Parallel Processing Workshops - Euro-Par 2014 International Workshops, Porto, Portugal, August 25-26, 2014, Revised Selected Papers, Part II, 542–552, 2014. [PUMA: dash hlrs myown]