Fast, Light-weight, and Accurate Performance Evaluation using Representative Datacenter Behaviors

Jaewon Lee, Dongmoon Min, Ilkwon Byun, Hanhwi Jang, Jangwoo Kim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Datacenters rapidly evolve by adopting new features such as new hardware deployment and software patches. Adopting a new feature requires an accurate evaluation of its impact to minimize the risk to the multi-million dollar computing infrastructure. However, a comprehensive performance analysis of a datacenter is extremely challenging due to its cost and multitenancy. Evaluating the performance in a live datacenter is accurate but prohibitive to prevent any damage to production services. Using conventional load-testing benchmarks on small-scale testbeds is imprecise as they do not consider the effect of other co-located jobs. In this paper, we propose FLARE, a fast, lightweight, and accurate performance evaluation method using representative datacenter behaviors. The key idea is to extract a small set of representative job colocation scenarios from all possible job colocations in a target datacenter. FLARE systematically characterizes and groups job colocations according to performance and resource metrics, providing high-level insights into the datacenter's behaviors. Then, it reconstructs the colocations on a testbed and allows accurate feature evaluation with load-testing benchmarks. We evaluate FLARE using an in-house datacenter and three features: cache sizing, DVFS, and SMT configurations. FLARE accurately estimates the impact of features with less than 1% errors by incurring 50× and 10× lower evaluation costs compared to full datacenter and sampling-based evaluation, respectively.

Original languageEnglish
Title of host publicationMiddleware 2023 - Proceedings of the 24th ACM/IFIP International Middleware Conference
PublisherAssociation for Computing Machinery, Inc
Pages220-233
Number of pages14
ISBN (Electronic)9798400701771
DOIs
Publication statusPublished - Nov 27 2023
Externally publishedYes
Event24th ACM/IFIP International Middleware Conference, Middleware 2023 - Bologna, Italy
Duration: Dec 11 2023Dec 15 2023

Publication series

NameMiddleware 2023 - Proceedings of the 24th ACM/IFIP International Middleware Conference

Conference

Conference24th ACM/IFIP International Middleware Conference, Middleware 2023
Country/TerritoryItaly
CityBologna
Period12/11/2312/15/23

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Software

Fingerprint

Dive into the research topics of 'Fast, Light-weight, and Accurate Performance Evaluation using Representative Datacenter Behaviors'. Together they form a unique fingerprint.

Cite this