Minghua Chen, CUHK, Hong Kong

Title: Energy Efficient Dynamic Provisioning in Data Centers: The Benefit of Seeing the Future

Abstract:
Energy consumption represents a significant cost in data center operation. In 2010, data centers world-wide consumed 240 billion kWh electricity, enough to power 5 Hong Kong or roughly the entire Spain. However, real-world statistics reveals that a large fraction of the energy is used to power idle servers when the workload is low. Dynamic provisioning techniques aim at saving this portion of the energy, by turning off unnecessary servers. In dynamic provisioning, it is a common approach to predict future workload to certain extent and exploit the information to achieve good performance. This naturally leads to the following fundamental questions:
- Can we design solutions that require zero future workload information, called online solutions, yet still achieve close-to-optimal performance?
- Can we characterize the benefit of knowing future workload information in dynamic provisioning?

In this work, we seek answers to the above questions. In particular, we develop online dynamic provisioning solutions with and without future workload information available. We first reveal an elegant structure of the off-line dynamic provisioning problem, which allows us to characterize the optimal solution in a “divide-and-conquer” manner. We then exploit this insight to design two online algorithms with competitive ratios 2- a and e/(e-1+ a), respectively, where 0 ≤ a ≤ 1 is the normalized size of a look-ahead window in which future workload information is available. A fundamental observation is that future workload information beyond the full-size look-ahead window (corresponding to a = 1) will not improve dynamic provisioning performance. Our algorithms are decentralized and easy to implement. We demonstrate up to 71% of energy saving in a case study using real-world traces.

The approach and perspective outlined in this work have found applications in various problem domains including power systems and cloud computing scheduling.