Performance modeling and optimization of parallel out-of-core tensor contractions

Document Type

Conference Proceeding

Publication Date

12-1-2005

Abstract

The Tensor Contraction Engine (TCE) is a domain-specific compiler for implementing complex tensor contraction expressions arising in quantum chemistry applications modeling electronic structure. This paper develops a performance model for tensor contractions, considering both disk I/O as well as inter-processor communication costs, to facilitate performance-model driven loop optimization for this domain. Experimental results are provided that demonstrate the accuracy and effectiveness of the model. Copyright 2005 ACM.

Publication Source (Journal or Book title)

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP

First Page

266

Last Page

276

This document is currently not available here.

Share

COinS