Improving the Parallel Performance of an NBody Application Using Adaptive Techniques in HPX
Document Type
Conference Proceeding
Publication Date
7-2-2017
Abstract
The overheads of manually tuning a loop's parameters, such as chunk size, might prevent an application from reaching its maximum parallel performance. In this paper, we address this challenge by implementing a multinomial logistic regression model on an HPX loop. We present a framework that captures both the static and dynamic information of the runtime environment and feeds this information to a learning model which assigns an efficient loop chunk size automatically. Our evaluated execution results show that the proposed technique improves the performance of an NBody application by an average about 33%, 17%, and 19% compared to using existing HPX auto-parallelization tools when run with problem sizes of 105, 106, and 107 particles respectively.
Publication Source (Journal or Book title)
Proceedings - 2017 IEEE 19th Intl Conference on High Performance Computing and Communications, HPCC 2017, 2017 IEEE 15th Intl Conference on Smart City, SmartCity 2017 and 2017 IEEE 3rd Intl Conference on Data Science and Systems, DSS 2017
First Page
621
Last Page
622
Recommended Citation
Khatami, Z., Kaiser, H., & Ramanujam, J. (2017). Improving the Parallel Performance of an NBody Application Using Adaptive Techniques in HPX. Proceedings - 2017 IEEE 19th Intl Conference on High Performance Computing and Communications, HPCC 2017, 2017 IEEE 15th Intl Conference on Smart City, SmartCity 2017 and 2017 IEEE 3rd Intl Conference on Data Science and Systems, DSS 2017, 2018-January, 621-622. https://doi.org/10.1109/HPCC-SmartCity-DSS.2017.84