FAUN — Developer Community 🐾

We help developers learn and grow by keeping them up with what matters. 👉 www.faun.dev

Follow publication

Member-only story

Autoscaling in Kubernetes: Understanding Vertical Pod Autoscaler (VPA)

Historically, companies used to scale their servers and allocate resources according to predictable usage patterns on fixed daily, weekly, or yearly cycles. For example, administrators would scale up the office’s internal systems during business hours, or a retailer might increase server capacity during holiday sales spikes. However, in an era where a sudden viral feed, post on TikTok, or breaking news can create immediate and unpredictable changes in demand, these usage patterns have become less predictable, making manual responsiveness obsolete.

To address this issue and avoid poor performance or wasted costs caused by under or over-provisioning of servers, cloud service providers and Kubernetes created functionality to automatically scale servers up or down according to real-world metrics like load and traffic.

In Kubernetes, we have three different autoscaling resources:

  • Vertical Pod Autoscaler (VPA)
  • Horizontal Pod Autoscaler (HPA)
  • Cluster Autoscaler (CA)

Each autoscaler targets a different layer and need within the cluster. In this series of articles, I will explore each one of them in more depth, starting with the Vertical Pod Autoscaler.

CPU and Memory Limits in…

Create an account to read the full story.

The author made this story available to Medium members only.
If you’re new to Medium, create a new account to read this story on us.

Or, continue in mobile web

Already have an account? Sign in

Published in FAUN — Developer Community 🐾

We help developers learn and grow by keeping them up with what matters. 👉 www.faun.dev

Written by Ivan (이반) Porta

Senior DevOps Engineer | Terraform Associate | Certified Argo Project Associate

No responses yet

Write a response