MOCL:An Efficient OpenCL Implementation for the Matrix-2000 Architecture

Zhang, Peng and Fang, Jianbin and Yang, Canqun and Tang, Tao and Huang, Chun and Wang, Zheng (2018) MOCL:An Efficient OpenCL Implementation for the Matrix-2000 Architecture. In: CF '18 Proceedings of the 15th ACM International Conference on Computing Frontiers. ACM, New York, pp. 26-35. ISBN 9781450357616

[thumbnail of CF18_paper_91]
Preview
PDF (CF18_paper_91)
CF18_paper_91.pdf - Accepted Version
Available under License Creative Commons Attribution-NonCommercial.

Download (1MB)

Abstract

This paper presents the design and implementation of an Open Computing Language (OpenCL) framework for the Matrix-2000 many-core architecture. This architecture is designed to replace the Intel XeonPhi accelerators of the TianHe-2 supercomputer. We share our experience and insights on how to design an effective OpenCL system for this new hardware accelerator. We propose a set of new analysis and optimizations to unlock the potential of the hardware. We extensively evaluate our approach using a wide range of OpenCL benchmarks on a single and multiple computing nodes. We present our design choices and provide guidance how to optimize code on the new Matrix-2000 architecture.

Item Type:
Contribution in Book/Report/Proceedings
Additional Information:
© Authors ACM, 2018. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in CF '18 Proceedings of the 15th ACM International Conference on Computing Frontiershttp://dx.doi.org/10.1145/3203217.3203244
ID Code:
123999
Deposited By:
Deposited On:
12 Mar 2018 10:14
Refereed?:
Yes
Published?:
Published
Last Modified:
17 Sep 2023 04:01