Improving the Parallel Performance of an NBody Application Using Adaptive Techniques in HPX

Document Type

Conference Proceeding

Publication Date

7-2-2017

Abstract

The overheads of manually tuning a loop's parameters, such as chunk size, might prevent an application from reaching its maximum parallel performance. In this paper, we address this challenge by implementing a multinomial logistic regression model on an HPX loop. We present a framework that captures both the static and dynamic information of the runtime environment and feeds this information to a learning model which assigns an efficient loop chunk size automatically. Our evaluated execution results show that the proposed technique improves the performance of an NBody application by an average about 33%, 17%, and 19% compared to using existing HPX auto-parallelization tools when run with problem sizes of 105, 106, and 107 particles respectively.

Publication Source (Journal or Book title)

Proceedings - 2017 IEEE 19th Intl Conference on High Performance Computing and Communications, HPCC 2017, 2017 IEEE 15th Intl Conference on Smart City, SmartCity 2017 and 2017 IEEE 3rd Intl Conference on Data Science and Systems, DSS 2017

First Page

621

Last Page

622

This document is currently not available here.

Share

COinS