An Empirical Study of Inter-cluster Resource Orchestration within Federated Cloud Clusters

Lindsay, Dominic and Yeung, Ging-Fung and Elkhatib, Yehia and Garraghan, Peter (2021) An Empirical Study of Inter-cluster Resource Orchestration within Federated Cloud Clusters. In: 2021 IEEE International Conference on Joint Cloud Computing (JCC). IEEE. ISBN 9781665434805

[img]
Text (An Empirical Study of Inter-cluster Resource Orchestration within Federated Cloud Clusters)
An_Empirical_Study_of_Inter_cluster_Resource_Orchestration_within_Federated_Cloud_Clusters.pdf - Accepted Version
Available under License Creative Commons Attribution-NonCommercial.

Download (741kB)

Abstract

Federated clusters are composed of multiple independent clusters of machines interconnected by a resource management system, and possess several advantages over centralized cloud datacenter clusters including seamless provisioning of applications across large geographic regions, greater fault tolerance, and increased cluster resource utilization. However, while existing resource management systems for federated clusters are capable of improving application intra-cluster performance, they do not capture inter-cluster performance in their decision making. This is important given federated clusters must execute a wide variety of applications possessing heterogeneous system architectures, which are a impacted by unique inter-cluster performance conditions such as network latency and localized cluster resource contention. In this work we present an empirical study demonstrating how inter-cluster performance conditions negatively impact federated cluster orchestration systems. We conduct a series of micro-benchmarks under various cluster operational scenarios showing the critical importance in capturing inter-cluster performance for resource orchestration in federated clusters. From this benchmark, we determine precise limitations in existing federated orchestration, and highlight key insights to design future orchestration systems. Findings of notable interest entail different application types exhibiting innate performance affinities across various federated cluster operational conditions, and experience substantial performance degradation from even minor increases to latency (8.7x) and resource contention (12.0x) in comparison to centralized cluster architectures.

Item Type:
Contribution in Book/Report/Proceedings
Additional Information:
©2021 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
ID Code:
158064
Deposited By:
Deposited On:
14 Oct 2021 15:00
Refereed?:
Yes
Published?:
Published
Last Modified:
22 Oct 2021 00:56