The horizontal pod autoscaling (HPA) provides scalability for cloud services composed of microservice architecture in cloud-native computing. However, the efficiency of the service is reduced and scaling is delayed due to the dynamic load occurs by va...
The horizontal pod autoscaling (HPA) provides scalability for cloud services composed of microservice architecture in cloud-native computing. However, the efficiency of the service is reduced and scaling is delayed due to the dynamic load occurs by various workload patterns. In addition, which is difficult to estimate the size of efficient resources for workloads composed of data of various sizes, resulting in resource waste and overloads. Therefore, this study proposes the efficient container resource management (ECoRM) that stably and elastically manages the container resources to ensure the scalability and efficiency of the service even under rapidly changing dynamic loads. The ECoRM forecasts future workloads by utilizing the sample convolutional and interaction network (SCINet) model applied with the reversible instance normalization (RevIN) method. The adaptive request generator in ECoRM generates a resource request composed of elastic size through the forecasted CPU and memory usage. Hybrid pod autoscaler scaling the resource request of pod to efficient resource sizes through ECoRM's VPA (ECo-VPA) and ECoRM's HPA (ECo-HPA), and immediately scaling the number of replicas. When the performance of ECoRM was evaluated, even if the size of resources for various workloads was incorrectly estimated, average resource utilization improved by about 17∼59% compared to the conventional HPA, and the number of overloads was measured lower than that of conventional HPA.