Auto scaling and Metrics Server

October 01, 2025

Benifits of Autoscaling:

High/Improved availability of the application
Elasticity
Bettter resource utilization
Seamless load management

There are 2 types of Auto scaling

Vertical scaling: Increasing the capacity of the same single system
Horizontal scaling: Increasing number of instances/servers/pods

In the world of devops, horizontal scaling is best

HPA: Horizontal POD Autoscaling --> Used to scale up/down no of POD replicas based on observed metris (CPU or memory utilization)

It observes all the required metrics, based on that it will add the PODS
Tracks multiple metrics, accordingly it will adjust the PODS
HPA will interact with Metric server to identify CPU/Memory utilization

VPA: Vertical POD Autoscaling

Metric server is an application that collects metrics from PODS, nodes according to state of CPU and Memory. Metric server will not be present by default in the K8S Cluster

The Metrics Server is a scalable, efficient source of container resource metrics for Kubernetes built-in autoscaling pipelines. It collects CPU and memory usage from nodes and pods.

Search This Blog

Vishnu_VVV

Auto scaling and Metrics Server

Comments

Post a Comment

Popular posts from this blog

Master and Slave Architecture in Jenkins

Monolithic Architecture

Load Balancers and Load Balancers Algorithms