Auto scaling and Metrics Server
Benifits of Autoscaling:
- High/Improved availability of the application
- Elasticity
- Bettter resource utilization
- Seamless load management
There are 2 types of Auto scaling
- Vertical scaling: Increasing the capacity of the same single system
- Horizontal scaling: Increasing number of instances/servers/pods
In the world of devops, horizontal scaling is best
HPA: Horizontal POD Autoscaling --> Used to scale up/down no of POD replicas based on observed metris (CPU or memory utilization)
- It observes all the required metrics, based on that it will add the PODS
- Tracks multiple metrics, accordingly it will adjust the PODS
- HPA will interact with Metric server to identify CPU/Memory utilization
VPA: Vertical POD Autoscaling
Metric server is an application that collects metrics from PODS, nodes according to state of CPU and Memory. Metric server will not be present by default in the K8S Cluster
The Metrics Server is a scalable, efficient source of container resource metrics for Kubernetes built-in autoscaling pipelines. It collects CPU and memory usage from nodes and pods.
Comments
Post a Comment