An early prototype of an autonomic performance environment for exascale
Document Type
Conference Proceeding
Publication Date
7-16-2013
Abstract
Extreme-scale computing requires a new perspective on the role of performance observation in the Exascale system software stack. Because of the anticipated high concurrency and dynamic operation in these systems, it is no longer reasonable to expect that a post-mortem performance measurement and analysis methodology will suffice. Rather, there is a strong need for performance observation that merges first-and third-person observation, in situ analysis, and introspection across stack layers that serves online dynamic feedback and adaptation. In this paper we describe the DOE-funded XPRESS project and the role of autonomic performance support in Exascale systems. XPRESS will build an integrated Exascale software stack (called OpenX) that supports the ParalleX execution model and is targeted towards future Exascale platforms. An initial version of an autonomic performance environment called APEX has been developed for OpenX using the current TAU performance technology and results are presented that highlight the challenges of highly integrative observation and runtime analysis. © 2013 ACM.
Publication Source (Journal or Book title)
Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, ROSS 2013 - In Conjunction with ICS 2013
Recommended Citation
Huck, K., Shende, S., Malony, A., Kaiser, H., Porterfield, A., Fowler, R., & Brightwell, R. (2013). An early prototype of an autonomic performance environment for exascale. Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, ROSS 2013 - In Conjunction with ICS 2013 https://doi.org/10.1145/2481425.2481434