LSU Master's Theses

Resampling Methods and Visualization Tools for Computer Performance Comparisons in the Presence of Performance Variation

Samuel Oridge Irving, Louisiana State University and Agricultural and Mechanical CollegeFollow

Semester of Graduation

Spring 2018

Degree

Master of Electrical Engineering (MEE)

Department

ECE

Document Type

Thesis

Abstract

Performance variability, stemming from non-deterministic hardware and software behaviors or deterministic behaviors such as measurement bias, is a well-known phenomenon of computer systems which increases the difficulty of comparing computer performance metrics and is slated to become even more of a concern as interest in Big Data Analytics increases. Conventional methods use various measures (such as geometric mean) to quantify the performance of different benchmarks to compare computers without considering this variability which may lead to wrong conclusions. In this paper, we propose three resampling methods for performance evaluation and comparison: a randomization test for a general performance comparison between two computers, bootstrapping confidence estimation, and an empirical distribution and five-number-summary for performance evaluation. The results show that for both PARSEC and high-variance BigDataBench benchmarks: 1) the randomization test substantially improves our chance to identify the difference between performance comparisons when the difference is not large; 2) bootstrapping confidence estimation provides an accurate confidence interval for the performance comparison measure (e.g. ratio of geometric means); and 3) when the difference is very small, a single test is often not enough to reveal the nature of the computer performance due to the variability of computer systems. We further propose using empirical distribution to evaluate computer performance and a five-number-summary to summarize computer performance. We use published SPEC 2006 results to investigate the sources of performance variation by predicting performance and relative variation for 8,236 machines. We achieve a correlation of predicted performances of 0.992 and a correlation of predicted and measured relative variation of 0.5. Finally, we propose the utilization of a novel Biplotting technique to visualize the effectiveness of benchmarks and cluster machines by behavior. We illustrate the results and conclusion through detailed Monte Carlo simulation studies and real examples.

Date

4-5-2018

Recommended Citation

Irving, Samuel Oridge, "Resampling Methods and Visualization Tools for Computer Performance Comparisons in the Presence of Performance Variation" (2018). LSU Master's Theses. 4702.
https://repository.lsu.edu/gradschool_theses/4702

Committee Chair

Dr. Lu Peng

DOI

10.31390/gradschool_theses.4702

Download

Included in

Computer Engineering Commons

COinS

LSU Master's Theses

Resampling Methods and Visualization Tools for Computer Performance Comparisons in the Presence of Performance Variation

Semester of Graduation

Degree

Department

Document Type

Abstract

Date

Recommended Citation

Committee Chair

DOI

Included in

Search

Browse

Author Corner

SPONSORED BY

LSU Master's Theses

Resampling Methods and Visualization Tools for Computer Performance Comparisons in the Presence of Performance Variation

Author

Semester of Graduation

Degree

Department

Document Type

Abstract

Date

Recommended Citation

Committee Chair

DOI

Included in

Share

Search

Browse

Author Corner

SPONSORED BY