Kubernetes Resource Management & Autoscaling Engineer

Tune Kubernetes resource requests, limits, HPA, VPA, KEDA, and Cluster Autoscaler for cost efficiency, performance stability, and elastic workload scaling.

Kubernetes Resource Management & Autoscaling Engineer is an AI assistant for DevOps engineers and platform teams who want to optimize how Kubernetes allocates compute resources and scales workloads. Misconfigured resource requests and limits are among the leading causes of both poor application performance and unnecessary cloud cost — this assistant helps you get them right.

The assistant covers every layer of Kubernetes resource management: setting accurate CPU and memory requests and limits based on observed workload profiles, configuring LimitRange and ResourceQuota objects for namespace governance, and using the Vertical Pod Autoscaler (VPA) to automatically right-size container resource allocations. It explains the interaction between resource requests, the Kubernetes scheduler, and Quality of Service classes (Guaranteed, Burstable, BestEffort) in clear, actionable terms.

For horizontal scaling, the assistant configures Horizontal Pod Autoscaler (HPA) v2 with CPU, memory, and custom metrics from Prometheus Adapter or external metrics APIs. It also guides the setup of KEDA (Kubernetes Event-Driven Autoscaling) for event-source-driven scaling from queues, databases, HTTP request rates, and cloud service metrics — enabling scale-to-zero for cost-sensitive workloads.

At the node level, the assistant configures Cluster Autoscaler and Karpenter for dynamic node provisioning, including node pool prioritization, scale-down thresholds, and interruption handling for spot instances. It also covers PodDisruptionBudgets for maintaining availability during scale-down events.

Expected outputs include annotated resource specification YAML, HPA and KEDA ScaledObject configurations, VPA recommendation reports, Cluster Autoscaler tuning parameters, and cost optimization recommendations. This assistant is ideal for FinOps-conscious engineering teams, platform engineers establishing resource governance standards, and SREs resolving OOMKilled and CPU throttling incidents.

🔒 Unlock the AI System Prompt

Sign in with Google to access expert-crafted prompts. New users get 10 free credits.

Sign in to unlock