A matrix-based approach to the global locality optimization problem
Document Type
Conference Proceeding
Publication Date
1-1-1998
Abstract
Global locality analysis is a technique for improving the cache performance of a sequence of loop nests through a combination of loop and data layout optimizations. Pure loop transformations are restricted by data dependences and may not be very successful in optimizing imperfectly nested loops; the impact of a data transformation on an array might be program-wide. Therefore, in this paper we argue for a combined approach which employs both loop and data transformations. The method enjoys the advantages of the most of the previous techniques for enhancing locality and is efficient. In our approach, the loop nests are processed one by one and the data layout constraints obtained from one nest are propagated for optimization of the remaining loop nests. We show that this process can be put in a simple matrix framework which can be manipulated by an optimizing compiler. The search space that we consider for possible loop transformations comprises general non-singular linear transformation matrices and the data layouts that we consider are those which can be expressed using hyperplanes. Experiments using several programs on an SGI Origin 2000 distributed-shared-memory machine demonstrate the efficacy of our approach.
Publication Source (Journal or Book title)
Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT
First Page
306
Last Page
313
Recommended Citation
Kandemir, M., Choudhary, A., Ramanujam, J., & Banerjee, P. (1998). A matrix-based approach to the global locality optimization problem. Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT, 306-313. https://doi.org/10.1109/PACT.1998.727266