Simulation of Large PV Plants Using a Continuous Radiance Distribution Model and Cell-Resolution Mismatch Calculation. European Photovoltaic Solar Energy Conference and Exhibition, 1311--1316, 2020. [PUMA: Irradiance Mismatch Performance Radiance distribution losses modeling transposition]
Container orchestration on HPC systems through Kubernetes. Journal of Cloud Computing, (10)1:1--14, SpringerOpen, 2021. [PUMA: Cloud Container HPC Kubernetes Singularity TORQUE computing manager orchestration workload]
Usage Experiences of Performance Tools for Modern C $$$$ Code Analysis and Optimization. Tools for High Performance Computing 2018/2019, 103--121, Springer, 2021. [PUMA: C++ HPC Optimization Performance analysis]
Collectives in hybrid MPI+ MPI code: Design, practice and performance. Parallel Computing, 102669, Elsevier, 2020. [PUMA: MPI collective communications hybrid myown programming] URL
MPI Collectives for Multi-core Clusters: Optimized Performance of the Hybrid MPI+ MPI Parallel Codes. Proceedings of the 48th International Conference on Parallel Processing: Workshops, 1--10, ACM, 2019. [PUMA: collective communication mpi myown shared-memory]
Asynchronous Progress Design for a MPI-Based PGAS One-Sided Communication System.. ICPADS, 999–1006, IEEE, 2016. [PUMA: DART MPI asynchronous data-locality myown one-sided overlap progress] URL
Towards Performance Portability through Locality-Awareness for Applications Using One-Sided Communication Primitives.. CANDAR, 536–542, IEEE, 2016. [PUMA: DART-MPI application myown porting] URL
Application Productivity and Performance Evaluation of Transparent Locality-aware One-sided Communication Primitives. In Egawa Ryusuke (Eds.), International Journal of Networking and Computing, (7)2:136--153, 2017. [PUMA: DART DART-MPI Locality-awareness MPI blocking myown] URL
Impact of Late-Arrivals on MPI Collective Operations. INFOCOMP 2015, 2015. [PUMA: MPI benchmarking hlrs late-arrivals myown]
A Bandwidth-saving Optimization for MPI Broadcast Collective Operation. Proceedings of the International Conference on Parallel Processing Workshops, ICPPW, September 2015. [PUMA: Bandwidth Broadcast Long MPI Optimization hlrs message myown]
DART-MPI: An MPI-based Implementation of a PGAS Runtime System.. In Allen D. Malony, and Jeff R. Hammond (Eds.), PGAS, 3:1–3:11, ACM, 2014. [PUMA: MPI PGAS Portability RMA hlrs myown] URL
Leveraging MPI-3 Shared-Memory Extensions for Efficient PGAS Runtime Systems.. In Jesper Larsson Träff, Sascha Hunold, and Francesco Versaci (Eds.), Euro-Par, (9233):373–384, Springer, 2015. [PUMA: MPI-3 PGAS RMA hlrs myown shared-memory] URL
DASH: Data Structures and Algorithms with Support for Hierarchical Locality. Euro-Par 2014: Parallel Processing Workshops - Euro-Par 2014 International Workshops, Porto, Portugal, August 25-26, 2014, Revised Selected Papers, Part II, 542–552, 2014. [PUMA: dash hlrs myown]