Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MPI Collectives for Multi-core Clusters: Optimized Performance of the Hybrid MPI+ MPI Parallel Codes

H. Zhou, J. Gracia, und R. Schneider. Proceedings of the 48th International Conference on Parallel Processing: Workshops, Seite 18:1-18:10. New York, NY, USA, ACM, (2019)
DOI: 10.1145/3339186.3339199

Zusammenfassung

The advent of multi-/many-core processors in clusters advocates hybrid parallel programming, which combines Message Passing Interface (MPI) for inter-node parallelism with a shared memory model for on-node parallelism. Compared to the traditional hybrid approach of MPI plus OpenMP, a new, but promising hybrid approach of MPI plus MPI-3 shared-memory extensions (MPI+MPI) is gaining attraction. We describe an algorithmic approach for collective operations (with allgather and broadcast as concrete examples) in the context of hybrid MPI+MPI, so as to minimize memory consumption and memory copies. With this approach, only one memory copy is maintained and shared by on-node processes. This allows the removal of unnecessary on-node copies of replicated data that are required between MPI processes when the collectives are invoked in the context of pure MPI. We compare our approach of collectives for hybrid MPI+MPI and the traditional one for pure MPI, and also have a discussion on the synchronization that is required to guarantee data integrity. The performance of our approach has been validated on a Cray XC40 system (Cray MPI) and NEC cluster (Open MPI), showing that it achieves comparable or better performance for allgather operations. We have further validated our approach with a standard computational kernel, namely distributed matrix multiplication, and a Bayesian Probabilistic Matrix Factorization code.

Links und Ressourcen

BibTeX-Schlüssel: 10.1145/3339186.3339199
Eintragstyp: inproceedings
Adresse: New York, NY, USA
Buchtitel: Proceedings of the 48th International Conference on Parallel Processing: Workshops
Jahr: 2019
Seiten: 18:1-18:10
Verlag: ACM
Reihe: ICPP 2019
isbn: 9781450371964
numpages: 10
articleno: 18
location: Kyoto, Japan
DOI: 10.1145/3339186.3339199
URL: https://doi.org/10.1145/3339186.3339199

@ralfschneiders Tags hervorgehoben

Zitieren Sie diese Publikation

@inproceedings{10.1145/3339186.3339199, abstract = {The advent of multi-/many-core processors in clusters advocates hybrid parallel programming, which combines Message Passing Interface (MPI) for inter-node parallelism with a shared memory model for on-node parallelism. Compared to the traditional hybrid approach of MPI plus OpenMP, a new, but promising hybrid approach of MPI plus MPI-3 shared-memory extensions (MPI+MPI) is gaining attraction. We describe an algorithmic approach for collective operations (with allgather and broadcast as concrete examples) in the context of hybrid MPI+MPI, so as to minimize memory consumption and memory copies. With this approach, only one memory copy is maintained and shared by on-node processes. This allows the removal of unnecessary on-node copies of replicated data that are required between MPI processes when the collectives are invoked in the context of pure MPI. We compare our approach of collectives for hybrid MPI+MPI and the traditional one for pure MPI, and also have a discussion on the synchronization that is required to guarantee data integrity. The performance of our approach has been validated on a Cray XC40 system (Cray MPI) and NEC cluster (Open MPI), showing that it achieves comparable or better performance for allgather operations. We have further validated our approach with a standard computational kernel, namely distributed matrix multiplication, and a Bayesian Probabilistic Matrix Factorization code.}, added-at = {2021-09-28T11:26:33.000+0200}, address = {New York, NY, USA}, articleno = {18}, author = {Zhou, Huan and Gracia, Jos\'{e} and Schneider, Ralf}, biburl = {https://puma.ub.uni-stuttgart.de/bibtex/212bbebb35041c50aeb84aec3177a1311/ralfschneider}, booktitle = {Proceedings of the 48th International Conference on Parallel Processing: Workshops}, doi = {10.1145/3339186.3339199}, interhash = {942d7e4eea20dfc15b05fa5663d95a48}, intrahash = {12bbebb35041c50aeb84aec3177a1311}, isbn = {9781450371964}, keywords = {MPI collective communication hybrid memory model myown programming shared}, location = {Kyoto, Japan}, numpages = {10}, pages = {18:1-18:10}, publisher = {ACM}, series = {ICPP 2019}, timestamp = {2021-09-28T09:26:33.000+0200}, title = {MPI Collectives for Multi-core Clusters: Optimized Performance of the Hybrid MPI+ MPI Parallel Codes}, url = {https://doi.org/10.1145/3339186.3339199}, year = 2019 }

PUMA

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MPI Collectives for Multi-core Clusters: Optimized Performance of the Hybrid MPI+ MPI Parallel Codes

Zusammenfassung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen
(0)

PUMA

KopierenLöschenDiese Publikation zur Ablage hinzufügenCommunity-EintragVersionsverlauf dieses EintragsURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML MPI Collectives for Multi-core Clusters: Optimized Performance of the Hybrid MPI+ MPI Parallel Codes

Zusammenfassung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen (0)

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MPI Collectives for Multi-core Clusters: Optimized Performance of the Hybrid MPI+ MPI Parallel Codes

Kommentare und Rezensionen
(0)