Automatic Task-Based Code Generation for High Performance Domain Specific Embedded Language

Document Type

Article

Publication Date

6-1-2016

Abstract

Providing high level tools for parallel programming while sustaining a high level of performance has been a challenge that techniques like Domain Specific Embedded Languages try to solve. In previous works, we investigated the design of such a DSEL—NT (Formula presented.) —providing a Matlab -like syntax for parallel numerical computations inside a C++ library. In this paper, we show how NT (Formula presented.) has been redesigned for shared memory systems in an extensible and portable way. The new NT (Formula presented.) design relies on a tiered Parallel Skeleton system built using asynchronous task management and automatic compile-time taskification of user level code. We describe how this system can operate various shared memory runtimes and evaluate the design by using two benchmarks implementing linear algebra algorithms.

Publication Source (Journal or Book title)

International Journal of Parallel Programming

First Page

449

Last Page

465

This document is currently not available here.

Share

COinS