A Service-Aware Autoscaling Strategy for Container Orchestration Platforms with Soft Resource Isolation
Paper in proceeding, 2023

Container orchestration platforms like Kubernetes (K8s) allow easy deployment and management of cloud native services. When deploying their services, service providers need to specify a proper amount of resources to K8s, so that the desired Quality of Service (QoS) to their users can be maintained. To cope with the varying traffic demand coming from users, they can rely on the K8s Horizontal Pod Autoscaling (HPA) mechanism. To ensure that enough resources are available when needed, the standard HPA mechanism relies on resource overprovisioning. In this way, the required QoS is achieved most of (or all) the time but at the expense of additional resources that are allocated (and charged for), while they may stay idle for significant periods of time. A way to reduce overprovisioning is provided by the soft resource isolation of K8s, which allows services to compensate for a temporary lack of resources with shared resources, i.e., idle resources of the machines where services are running. However, during traffic spikes, these idle resources may not be enough to serve the whole demand, degrading the QoS. The HPA, which is not aware of how much demand could not be served, is not always able to correctly estimate the required additional resources, further degrading the QoS. To overcome this, service providers need to leverage overprovisioning, limiting the use of shared resources. In this paper, we propose a novel mechanism for autoscaling resources in K8s that relies on service-related data to avoid the additional degradation introduced by the HPA. The proposed strategy also offers a way to tune overprovisioning and shared resources. Simulation results show that our approach can reduce idle resources by up to 60% compared with the HPA mechanism.

Cloud native services

Shared resources

QoS

service degradation

Pod as a Service

Pod autoscaling

Kubernetes

Author

Federico Tonini

Chalmers, Electrical Engineering, Communication, Antennas and Optical Networks

Carlos Natalino Da Silva

Chalmers, Electrical Engineering, Communication, Antennas and Optical Networks

Dagnachew A. Temesgene

Ericsson

Zere Ghebretensae

Ericsson

Lena Wosinska

Chalmers, Electrical Engineering, Communication, Antennas and Optical Networks

Paolo Monti

Chalmers, Electrical Engineering, Communication, Antennas and Optical Networks

2023 JOINT EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS & 6G SUMMIT, EUCNC/6G SUMMIT

2475-6490 (ISSN) 2575-4912 (eISSN)

454-459
979-8-3503-1102-0 (ISBN)

Joint European Conference on Networks and Communications / 6G Summit (EuCNC/6G Summit)
Gothenburg, Sweden,

Subject Categories

Communication Systems

Computer Science

Computer Systems

DOI

10.1109/EUCNC/6GSUMMIT58263.2023.10188268

More information

Latest update

1/15/2024