啶す啶苦啶︵啶班ぞ啶溹啶む 啶膏啶曕啶?啶灌啶傕ぁ啶囙啶︵啶?Makakuha ng libreng 700pho sa bawat deposito

In a docker-based micro-service setup, the application does not automatically scale based on the number of users accessing it during business hours.

To resolve this, the team at CloudifyOps suggested implementing a Horizontal Pod Autoscaler (HPA) in the Kubernetes environment. With this approach, we can scale the application pod based on CPU and memory utilization. It further reduces the need to run more pods (more replicas) during non business hours.

Introduction:

Kubernetes autoscaling: The three scalability tools that Kubernetes has are the Horizontal pod autoscaler, Vertical pod autoscaler (VPA) and the cluster autoscaler. HPA and VPA tools are used to scale up and monitor the application layer.

Horizontal pod autoscaling: When a spike or drop in consumption occurs, Kubernetes can automatically decrease or increase the number of pods that serve the workload.

Vertical pod autoscaling: Deciding how much compute resources to dedicate to a particular workload is challenging. With the right configuration, Kubernetes can help you get the most out of the allocated resources.

Requirement:

We need one Kubernetes cluster configuration ready to deploy.

Steps to follow:

Installing the metrics-server: The goal of the HPA is to make scaling decisions based on the per-pod resource metrics that are retrieved from the metrics API (metrics.k8s.io).

Create the cluster without giving the --yes argument to it. This will only create the configuration. Now, we need to make the below changes to the metrics server configuration.

The metrics server will be useful in creating the HPA.?

For Cluster created with KOPS, follow the steps:

kops edit cluster <.cluster name.>

add the below configuration to your cluster configuration under kubelet

kubelet:

????anonymousAuth: false

????authorizationMode: Webhook

????authenticationTokenWebhook: true

After making the changes, we should update the cluster. With the below command, the cluster will be created with the required configuration.

?kops update cluster --name bittergourd.xyz --yes --admin

If you are changing the configuration after deploying the cluster, you need to run the rolling-update, which causes the master to terminate and redeploy. Later, new nodes will be deployed and the old nodes will be terminated.

To avoid this recreation, we are following the above steps while creating the Kubernetes cluster with kops (Kubernetes operations).

Now we need to install the metrics server

Kubectl apply -f https://raw.githubusercontent.com/kubernetes/kops/master/addons/metrics-server/v1.16.x.yaml

Below is the output of the metrics server creation step.

To confirm the metrics server installation?

kubectl get pods -n kube-system

You will find metrics server pod in the pods list.

We can find out the memory and CPUutilization of pods and nodes using below commands?

kubectl top node <.node name.>
kubectl top pod -n <namespce> <.pod_name.>
kubectl top nodes
kubectl top -n <.namespace.>

The output should look like this.

Resource Requests and limits:

If the resource limit of a pod is exceeded, then it can use more than its requested resource. However, a container can't use more than its resource limit.

If you set a memory request for 256 MiB, and a container is in a scheduled pod, then it can use more RAM.

If the limit is set at 4GiB, the kubelet enforces the limit. The runtime stops the process that tries to consume more than the permitted amount of memory.

Configuring HPA:

It is important that we have resource requests and limits mentioned in the container resources like shown in the above image.

First, we will start a deployment running the image and expose it as a service using the following command?

kubectl apply -f https://k8s.io/examples/application/php-apache.yaml

One new deployment and service will be created with the above command. After completing the deployment, we need to deploy the HPA.

Follow below commands:

echo ‘apiVersion: autoscaling/v1

kind: HorizontalPodAutoscaler

metadata:

??name: php-apache

spec:

??scaleTargetRef:

????apiVersion: apps/v1

????kind: Deployment

????name: php-apache

??minReplicas: 1

??maxReplicas: 10

??targetCPUUtilizationPercentage: 50 ‘ | kubectl apply -f -?

The HPA will be deployed as shown here.

As there is no load applied on the deployment, the targets show 0%/50%. To test the HPA, we shall apply load on the deployment.

kubectl run -i --tty load-generator --rm --image=busybox --restart=Never -- /bin/sh -c "while sleep 0.01; do wget -q -O- https://php-apache; done"

Run the above command to apply some load on the deployment. You will get an output like below.

The deployment is scaled up. If you see in the below image, when the load increases above? 50%, the deployment scaled up to 7 pods.

The default time to scale down is 300 seconds. The scale down time can be customized to suit different requirements.

Note: If you use AWS EKS, the metric server needs to be enabled with the following command.

kubectl apply -f https://github.com/kubernetes-sigs/metrics-server/releases/download/v0.3.6/components.yaml

With the above exercise, we applied load on the CPU. Similarly, we can configure for memory usage as well.

To learn more about these cutting edge technologies & real time industry applied best practices, follow our LinkedIn Page. To explore our services, visit our website.

Reference links:

Setting up a Horizontal Pod Autoscaler for Kubernetes cluster

CloudifyOps

Accelerating your digital journey in Cloud

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Scaling Kubernetes Workloads based on Dynatrace Metrics

Horizontal and Vertical in context of Containerization

Manage Your Containers through Kubernetes

K For Kubernetes (Blog-5)

Scalability of a System

IBM AND ITS USE CASES SOLVED BY KUBERNETES

Manage Your Containers through Kubernetes

Manage Your Containers through Kubernetes

Prometheus Triggers for the KEDA for the Kubernetes Autoscalling - KEDA 2/4

It’s All About the API: Finding Common Ground

领英推荐

Visualizing Security - Essential dashboards for SOC Insights

2024年10月7日

Optimizing Security Operations with a SOAR-Driven SOC

2024年9月19日

Proactive SOC : Case Management supercharged with external Threat Insights

2024年9月13日

SIEM: The Heartbeat of a Modern SOC

2024年9月4日

Understanding Security Operations Centers: The Cybersecurity Fortress

2024年8月21日

Liquibase: Streamlining Database Migration for Effortless Development

2024年7月4日

Canary and Blue-Green Deployments on Amazon EKS with Istio

2024年5月14日

Unraveling the Power of Istio Mesh on Amazon EKS

2024年4月30日

Getting Started with DevOptymize: Use Our Framework to Elevate Your Cloud and DevOps Workflows

2024年4月16日

Unlocking Cost Efficiency: Leveraging Karpenter and Spot Instances for EKS Cost Optimization

2024年3月4日

社区洞察

其他会员也浏览了

Scaling Kubernetes Workloads based on Dynatrace Metrics

Horizontal and Vertical in context of Containerization

Manage Your Containers through Kubernetes

K For Kubernetes (Blog-5)

Scalability of a System

IBM AND ITS USE CASES SOLVED BY KUBERNETES

Manage Your Containers through Kubernetes

Manage Your Containers through Kubernetes

Prometheus Triggers for the KEDA for the Kubernetes Autoscalling - KEDA 2/4

It’s All About the API: Finding Common Ground