登录查看更多内容

Kubernetes - Horizontal Pod Autoscaling (HPA)

Kiran Kulkarni

Building applications | Specialised in cloud-based solutions

发布日期: 2025年2月17日

Horizontal Pod Autoscaling (HPA) automatically creates replicas in a deployment or replica set based on metrics like CPU utilization, memory usage etc based on the traffic at any given point of time and scales down when the traffic is less.

Components of HPA:

Metrics Server: Needed for resource metrics (CPU/Memory) from pods. This data is used by HPA to make scaling decisions.
HPA Resource: Defines the scaling policy (e.g., CPU utilization, Min/Max replicas).
Pods with Resource Limits: Pods must specify CPU/memory requests and limits in their configuration for HPA to calculate.

Lets us now implement HPA by following the below steps:

Step 1: Kind installation - you can refer this article for installation - https://www.dhirubhai.net/pulse/kubernetes-install-kind-create-multi-node-cluster-kiran-kulkarni-2ya8c

Step 2: Install the metrics API.

kubectl apply -f https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml

Step 3: Edit the Deployment metrics-server

kubectl -n kube-system edit deployment metrics-server

Add the security bypass to deployment under container.args

Note: This is only done for development or testing environment

- --kubelet-insecure-tls

Step 4: Restart the metrics server

kubectl -n kube-system rollout restart deployment metrics-server

kubectl get pods -n kube-system

Note: You can check that metrics-server-744898d85-g7pxf pod is created

领英推荐

Scaling Kubernetes Pods Automatically with the…

Christopher Adamson 1 年前

Strategies for Using VPA and HPA Together

Christopher Adamson 1 年前

?? Docker: Solving “It Works on My Machine”… or Did…

Sumit Kar 4 个月前

kubectl top nodes

Step 5: Create the Apache deployment apache-deployment.yaml

apiVersion: apps/v1
kind: Deployment
metadata:
  name: apache-deployment
spec:
  replicas: 1
  selector:
    matchLabels:
      app: apache
  template:
    metadata:
      labels:
        app: apache
    spec:
      containers:
      - name: apache
        image: httpd:2.4
        ports:
        - containerPort: 80
        resources:
          requests:
            cpu: 100m
          limits:
            cpu: 200m

kubectl apply -f apache-deployment.yml

Step 9: Create the HPA apache-hpa.yml

#apache-hpa.yaml
apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
  name: apache-hpa
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: apache-deployment
  minReplicas: 1
  maxReplicas: 5
  metrics:
  - type: Resource
    resource:
      name: cpu
      target:
        type: Utilization
        averageUtilization: 2

Step 10: Create the load generator and test

kubectl run -i --tty load-generator --rm --image=busybox --restart=Never -- /bin/sh -c "while true; do wget -q -O- https://apache-deployment.default.svc.cluster.local; done"

Step 11: Watch the automatic scaling of pods - Open this in another terminal to see the imapact on the server

kubectl get hpa php-apache -w

Step 12: Watch the automatic scaling down of pods

Note: Check it after 4-5 mins as it does not scales down immediately. This prevents Rapid scaling.

要查看或添加评论，请登录

Kiran Kulkarni的更多文章

Kubernetes - Configmaps and Secrets

2025年3月21日

Kubernetes - Configmaps and Secrets

ConfigMaps handle non-sensitive data Secrets are for sensitive information like passwords etc. To understand better we…
Kubernetes - Implementing Liveness and Readiness Probe for a Nodejs Application

2025年3月3日

Kubernetes - Implementing Liveness and Readiness Probe for a Nodejs Application

?? Liveness Probe Checks if your app is alive and working. If it’s NOT alive → Kubernetes restarts the app.
Kubernetes - Taints, Tolerations, Node Selectors & Node Affinity

2025年2月10日

Kubernetes - Taints, Tolerations, Node Selectors & Node Affinity

Taints, Tolerations, Node Selectors & Node Affinity are mechanisms for pod scheduling onto nodes. Before we start lets…
Kubernetes - A Step-by-Step Guide For Implementing Statefulset using MongoDB

2025年1月30日

Kubernetes - A Step-by-Step Guide For Implementing Statefulset using MongoDB

Follow this step-by-step guide to deploy a mongodb statefulset. StatefulSets are ideal for stateful applications like…
Kubernetes - Ingress (React and Node.js App)

2025年1月22日

Kubernetes - Ingress (React and Node.js App)

Ingress is an API object that manages external access to the services in a cluster. To demonstrate how Ingress work.
Kubernetes - Service (NodePort)

2025年1月12日

Kubernetes - Service (NodePort)

First let us briefly understand Kubernetes Nodeport Service and its features A NodePort service is used to access…

1 条评论
Kubernetes - Persistent Volume

2025年1月7日

Kubernetes - Persistent Volume

Persistent Volume: Persistent Volumes (PVs) in Kubernetes, provide persistent storage for applications running within a…
Kubernetes - ReplicaSet

2025年1月4日

Kubernetes - ReplicaSet

In this article let us understand how ReplicaSet in Kubernetes work with an example. Step 1 : Check the nodes present…
Kubernetes - Nginx Deployment Using Kind

2024年12月27日

Kubernetes - Nginx Deployment Using Kind

Step1: Create a namespace. The namespace is a logical separation for different resources within a cluster.
Kubernetes - Install Kind & Create a Multi-node Cluster

2024年12月23日

Kubernetes - Install Kind & Create a Multi-node Cluster

Kind (Kubernetes in Docker), is an open-source tool for running Kubernetes clusters locally using Docker containers as…

See all articles

Kubernetes - Horizontal Pod Autoscaling (HPA)

Kiran Kulkarni

Building applications | Specialised in cloud-based solutions

Components of HPA:

领英推荐

Kiran Kulkarni的更多文章

社区洞察

其他会员也浏览了

Interrupt Handling in ARM Cortex M Core

Between predictable and practical - on kubernetes limits

Mastering Manual Pod Scheduling in Kubernetes

Kubernetes - A story of two "idles"

Avoid Noisy Neighbors in Kubernetes: A Deep Dive into Resource Quotas ??

Delving into the Android Operating System Architecture: A Comprehensive Overview

Achieving Operational Excellence: CenterGrid's Adoption of VergeOS

DPDK Summit 2019 - The High Ground of Software Packet processing

A New Era in Operating Systems: Google's AI-Driven, Non-UNIX-Based OS, Fuchsia

Solving .NET Multiplatform Builds with Docker: A Real-World Example

Components of HPA:

领英推荐

Kiran Kulkarni的更多文章

Kubernetes - Configmaps and Secrets

Kubernetes - Implementing Liveness and Readiness Probe for a Nodejs Application

Kubernetes - Taints, Tolerations, Node Selectors & Node Affinity

Kubernetes - A Step-by-Step Guide For Implementing Statefulset using MongoDB

Kubernetes - Ingress (React and Node.js App)

Kubernetes - Service (NodePort)

Kubernetes - Persistent Volume

Kubernetes - ReplicaSet

Kubernetes - Nginx Deployment Using Kind

Kubernetes - Install Kind & Create a Multi-node Cluster

社区洞察

其他会员也浏览了

Interrupt Handling in ARM Cortex M Core

Between predictable and practical - on kubernetes limits

Mastering Manual Pod Scheduling in Kubernetes

Kubernetes - A story of two "idles"

Avoid Noisy Neighbors in Kubernetes: A Deep Dive into Resource Quotas ??

Delving into the Android Operating System Architecture: A Comprehensive Overview

Achieving Operational Excellence: CenterGrid's Adoption of VergeOS

DPDK Summit 2019 - The High Ground of Software Packet processing

A New Era in Operating Systems: Google's AI-Driven, Non-UNIX-Based OS, Fuchsia

Solving .NET Multiplatform Builds with Docker: A Real-World Example