Author of the publication

A Customizable Matrix Multiplication Framework for the Intel HARPv2 Xeon+FPGA Platform: A Deep Learning Case Study.

, , , , , , , , , and . FPGA, page 107-116. ACM, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Performance and power optimization through data compression in Network-on-Chip architectures., , , , , , , and . HPCA, page 215-225. IEEE Computer Society, (2008)ScalCore: Designing a core for voltage scalability., , , , , and . HPCA, page 681-693. IEEE Computer Society, (2016)WRPN: Wide Reduced-Precision Networks., , , and . CoRR, (2017)Application-aware prefetch prioritization in on-chip networks., , , , , and . PACT, page 441-442. ACM, (2012)In-Package Domain-Specific ASICs for Intel® Stratix® 10 FPGAs: A Case Study of Accelerating Deep Learning Using TensorTile ASIC., , , , , , , , , and 1 other author(s). FPL, page 106-110. IEEE Computer Society, (2018)Exploration of Low Numeric Precision Deep Learning Inference Using Intel® FPGAs., , , , , and . FCCM, page 73-80. IEEE Computer Society, (2018)A case for integrated processor-cache partitioning in chip multiprocessors., , , , and . SC, ACM, (2009)A heterogeneous multiple network-on-chip design: an application-aware approach., , and . DAC, page 36:1-36:10. ACM, (2013)Accelerating recurrent neural networks in analytics servers: Comparison of FPGA, CPU, GPU, and ASIC., , , , , and . FPL, page 1-4. IEEE, (2016)Towards characterizing cloud backend workloads: insights from Google compute clusters., , , and . SIGMETRICS Performance Evaluation Review, 37 (4): 34-41 (2010)