Abstract
Accelerators, and other resource constrained systems, are increasingly being used in computer systems. Accelerators provide power efficient performance and often provide a shared memory model. However, it is a challenge to map feature rich APIs, such as OpenMP, to resource constrained systems. In this paper, we present a lightweight system where an accelerator can remotely execute library functions on a host processor. The implementation takes up 750 bytes but can replace arbitrary library calls leading to significant savings in memory foot print. We evaluate with a set of SPLASH-2 applications and show that the impact on execution time is negligible when compared to GCCs OpenMP implementation.
Original language | English |
---|---|
Title of host publication | Using and Improving OpenMP for Devices, Tasks, and More : Proceedings of the 10th International Workshop on OpenMP, IWOMP 2014 |
Editors | Luiz DeRose, Bronis R. de Supinski, Stephen L. Olivier, Barbara M. Chapman, Matthias S. Müller |
Publisher | Springer |
Publication date | 2014 |
Pages | 187-201 |
ISBN (Print) | 978-3-319-11453-8 |
ISBN (Electronic) | 978-3-319-11454-5 |
DOIs | |
Publication status | Published - 2014 |
Event | 10th International Workshop on OpenMP - Salvador, Brazil Duration: 28 Sept 2014 → 30 Sept 2014 Conference number: 10 http://www.iwomp.org/ |
Workshop
Workshop | 10th International Workshop on OpenMP |
---|---|
Number | 10 |
Country/Territory | Brazil |
City | Salvador |
Period | 28/09/2014 → 30/09/2014 |
Internet address |
Series | Lecture Notes in Computer Science |
---|---|
Number | 8766 |
ISSN | 0302-9743 |