Skip to main content

3 docs tagged with "kubernetes"

View all tags

AKS Auto-Scaling β€” 3-Layer System

How to design auto-scaling for AI inference workloads on Azure Kubernetes Service that delivers SLA + cost efficiency simultaneously. Three independent layers work together β€” each reacts at a different speed.

Kubernetes: Pod vs Node

One of the most confusing concepts when starting with Kubernetes. The office building analogy clicked for me.