PUMA publications for /user/amerwafai/cell%20performance%20intel%20parallelization%20memory%20shorthttps://puma.ub.uni-stuttgart.de/user/amerwafai/cell%20performance%20intel%20parallelization%20memory%20shortPUMA RSS feed for /user/amerwafai/cell%20performance%20intel%20parallelization%20memory%20short2024-03-29T22:18:16+01:00Optimized Force Calculation of Molecular Dynamics Simulations for the Intel Xeon Phihttps://puma.ub.uni-stuttgart.de/bibtex/2dccb0302d8b765b1450e86f4b2798951/amerwafaiamerwafai2016-01-29T09:34:55+01:00HLRS SCOPE absolute and calculation cell colored core distributed dynamic force gather intel interaction law linked memory molecular myown newton operation optimization parallel parallelization performance phi processing range scatter shared short site third xeon yellow <span data-person-type="author" class="authorEditorList "><span><span itemtype="http://schema.org/Person" itemscope="itemscope" itemprop="author"><a title="Nikola Tchipev" itemprop="url" href="/person/1902fd5f88d25cf9f9a271091a6bbd41a/author/0"><span itemprop="name">N. Tchipev</span></a></span>, </span><span><span itemtype="http://schema.org/Person" itemscope="itemscope" itemprop="author"><a title="Amer Wafai" itemprop="url" href="/person/1902fd5f88d25cf9f9a271091a6bbd41a/author/1"><span itemprop="name">A. Wafai</span></a></span>, </span><span><span itemtype="http://schema.org/Person" itemscope="itemscope" itemprop="author"><a title="Colin W. Glass" itemprop="url" href="/person/1902fd5f88d25cf9f9a271091a6bbd41a/author/2"><span itemprop="name">C. Glass</span></a></span>, </span><span><span itemtype="http://schema.org/Person" itemscope="itemscope" itemprop="author"><a title="Wolfgang Eckhardt" itemprop="url" href="/person/1902fd5f88d25cf9f9a271091a6bbd41a/author/3"><span itemprop="name">W. Eckhardt</span></a></span>, </span><span><span itemtype="http://schema.org/Person" itemscope="itemscope" itemprop="author"><a title="Alexander Heinecke" itemprop="url" href="/person/1902fd5f88d25cf9f9a271091a6bbd41a/author/4"><span itemprop="name">A. Heinecke</span></a></span>, </span><span><span itemtype="http://schema.org/Person" itemscope="itemscope" itemprop="author"><a title="Hans-Joachim Bungartz" itemprop="url" href="/person/1902fd5f88d25cf9f9a271091a6bbd41a/author/5"><span itemprop="name">H. Bungartz</span></a></span>, </span> and <span><span itemtype="http://schema.org/Person" itemscope="itemscope" itemprop="author"><a title="Philipp Neumann" itemprop="url" href="/person/1902fd5f88d25cf9f9a271091a6bbd41a/author/6"><span itemprop="name">P. Neumann</span></a></span></span>. </span><span class="additional-entrytype-information">(<em><span>August 2015<meta content="August 2015" itemprop="datePublished"/></span></em>)</span>Fri Jan 29 09:34:55 CET 2016Vienna, AustriaEuro-Par 2015: Parallel ProcessingAugustLNCSOptimized Force Calculation of Molecular Dynamics Simulations for the Intel Xeon Phi92332015HLRS SCOPE absolute and calculation cell colored core distributed dynamic force gather intel interaction law linked memory molecular myown newton operation optimization parallel parallelization performance phi processing range scatter shared short site third xeon yellow We provide details on the shared-memory parallelization for manycore architectures of the molecular dynamics framework ls1-mardyn, including an optimization of the SIMD vectorization for multi-centered molecules. The novel shared-memory parallelization scheme allows to re- tain Newton's third law optimization and exhibits very good scaling on many-core devices such as a full Xeon Phi card running 240 threads. The Xeon Phi can thus be exploited and delivers comparable performance as IvyBridge nodes in our experiments.