Automatic Task-Based Code Generation for High Performance Domain Specific Embedded Language
Document Type
Article
Publication Date
6-1-2016
Abstract
Providing high level tools for parallel programming while sustaining a high level of performance has been a challenge that techniques like Domain Specific Embedded Languages try to solve. In previous works, we investigated the design of such a DSEL—NT (Formula presented.) —providing a Matlab -like syntax for parallel numerical computations inside a C++ library. In this paper, we show how NT (Formula presented.) has been redesigned for shared memory systems in an extensible and portable way. The new NT (Formula presented.) design relies on a tiered Parallel Skeleton system built using asynchronous task management and automatic compile-time taskification of user level code. We describe how this system can operate various shared memory runtimes and evaluate the design by using two benchmarks implementing linear algebra algorithms.
Publication Source (Journal or Book title)
International Journal of Parallel Programming
First Page
449
Last Page
465
Recommended Citation
Tran Tan, A., Falcou, J., Etiemble, D., & Kaiser, H. (2016). Automatic Task-Based Code Generation for High Performance Domain Specific Embedded Language. International Journal of Parallel Programming, 44 (3), 449-465. https://doi.org/10.1007/s10766-015-0354-9