Author of the publication

Acceleration of bulk memory operations in a heterogeneous multicore architecture.

, , , , , , , and . PACT, page 423-424. ACM, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Optimizing GPU Register Usage: Extensions to OpenACC and Compiler Optimizations., , , , and . ICPP, page 572-581. IEEE Computer Society, (2016)The OpenACC data model: Preliminary study on its major challenges and implementations., , , , , , and . Parallel Computing, (2018)Multi-GPU Support on Single Node Using Directive-Based Programming Model., , , and . Scientific Programming, (2015)Acceleration of bulk memory operations in a heterogeneous multicore architecture., , , , , , , and . PACT, page 423-424. ACM, (2012)Compiling a High-Level Directive-Based Programming Model for GPGPUs., , , , , and . LCPC, volume 8664 of Lecture Notes in Computer Science, page 105-120. Springer, (2013)Assessing One-to-One Parallelism Levels Mapping for OpenMP Offloading to GPUs., , , and . PMAM@PPoPP, page 68-73. ACM, (2017)An Analytical Model-Based Auto-tuning Framework for Locality-Aware Loop Scheduling., , , and . ISC, volume 9697 of Lecture Notes in Computer Science, page 3-20. Springer, (2016)Implementing the OpenACC Data Model., , , , , , and . IPDPS Workshops, page 662-672. IEEE Computer Society, (2017)NAS Parallel Benchmarks for GPGPUs Using a Directive-Based Programming Model., , , , and . LCPC, volume 8967 of Lecture Notes in Computer Science, page 67-81. Springer, (2014)Performance and Power Characteristics of Matrix Multiplication Algorithms on Multicore and Shared Memory Machines., , , , and . SC Companion, page 626-632. IEEE Computer Society, (2012)