登录查看更多内容

Practical Kubernetes Optimization Tips for Peak Performance ?? (With code example)

Sandip Das

AWS Container Hero | Founder @ Good Cloud Development | Cloud & DevOps Architect for Startups | Kubernetes Specialist | SRE, Platform Engineering & MLOps Enthusiast | Educator | Mentor

发布日期: 2024年10月3日

Optimizing your Kubernetes cluster can significantly boost performance and efficiency. Here are some practical strategies to help you streamline operations and reduce costs.

Optimize Resource Requests and Limits

Set appropriate resource requests and limits for CPU and memory prevent over-provisioning and underutilization plus monitor workloads and use tools like kubectl top, Prometheus, or metrics server to set resource requests and limits accurately based on actual usage.

apiVersion: apps/v1
kind: Deployment
metadata:
  name: optimized-app
spec:
  replicas: 2
  selector:
    matchLabels:
      app: optimized-app
  template:
    metadata:
      labels:
        app: optimized-app
    spec:
      containers:
      - name: app-container
        image: your-app-image
        resources:
          requests:
            memory: "512Mi"
            cpu: "500m"
          limits:
            memory: "1Gi"
            cpu: "1"

Use Horizontal Pod Autoscaling (HPA)

HPA can help scale pods automatically based on CPU, memory, or custom metrics to ensure applications can handle varying workloads by defining a target metric for autoscaling and configuring the HPA in the deployment YAML.

apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
  name: hpa-app
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: optimized-app
  minReplicas: 2
  maxReplicas: 10
  metrics:
  - type: Resource
    resource:
      name: cpu
      target:
        type: Utilization
        averageUtilization: 60

Leverage Cluster Autoscaler

Automatically we can adjust the size of the cluster by adding or removing nodes when required, which can reduce costs and ensure there’s always enough capacity by configuring the cluster autoscaler to work with your cloud provider (e.g., AWS, GCP) to add or remove nodes based on resource demand.

# Configure Cluster Autoscaler to scale node groups in an AWS EKS cluster
apiVersion: apps/v1
kind: Deployment
metadata:
  name: cluster-autoscaler
  labels:
    app: cluster-autoscaler
spec:
  replicas: 1
  selector:
    matchLabels:
      app: cluster-autoscaler
  template:
    metadata:
      labels:
        app: cluster-autoscaler
    spec:
      serviceAccountName: cluster-autoscaler
      containers:
      - image: k8s.gcr.io/autoscaling/cluster-autoscaler:v1.xxx #enter latest version
        name: cluster-autoscaler
        command:
        - ./cluster-autoscaler
        - --cloud-provider=aws
        - --skip-nodes-with-local-storage=false
        - --nodes=1:10:<node-group-name>

Or use a better alternative like Karpenter

Karpenter is an open-source Kubernetes cluster auto scaler that dynamically provisions just-in-time compute resources based on the needs of your workloads. It simplifies node provisioning and reduces cluster costs by launching nodes tailored to your workload requirements.

# Install Karpenter Helm chart
helm repo add karpenter https://charts.karpenter.sh/
helm repo update

helm install karpenter karpenter/karpenter \
    --namespace karpenter --create-namespace \
    --set controller.clusterName=<your-cluster-name> \
    --set controller.clusterEndpoint=$(aws eks describe-cluster --name <your-cluster-name> --query "cluster.endpoint" --output text) \
    --set serviceAccount.create=true \
    --set serviceAccount.annotations."eks\.amazonaws\.com/role-arn"=<your-karpenter-role-arn>

apiVersion: karpenter.sh/v1alpha5
kind: Provisioner
metadata:
  name: amd64
spec:
  requirements:
    - key: karpenter.sh/capacity-type
      operator: In
      values: ["on-demand"]
    - key: "topology.kubernetes.io/zone" 
      operator: In
      values: ["eu-west-1a", "eu-west-1b", "eu-west-1c"]
    - key: "kubernetes.io/arch" 
      operator: In
      values: ["amd64"]
  limits:
    resources:
      cpu: 100
  provider:
    instanceProfile: KarpenterNodeInstanceProfile-karpenter-cluster
    securityGroupSelector:
      kubernetes.io/cluster/karpenter-cluster: '*'
  ttlSecondsAfterEmpty: 30

Implement Pod Disruption Budgets (PDB)

Ensure critical applications maintain availability during voluntary disruptions like node upgrades or autoscaling events by defining Pod Disruption Budgets to limit the number of pods that can be taken down simultaneously during planned maintenance.

apiVersion: policy/v1
kind: PodDisruptionBudget
metadata:
  name: pdb-app
spec:
  minAvailable: 2
  selector:
    matchLabels:
      app: optimized-app

Enable Vertical Pod Autoscaler (VPA)

PA automatically adjusts resource requests and limits based on actual usage over time, improving resource utilization by enabling VPA for workloads that require frequent changes in resource consumption to automatically tune their resource allocations.

(This is especially useful for DB and cache-related workloads)

Vishwas N. 2 年前

System Design - Horizontal Scaling v/s Vertical Scaling

Harsh Kumar Sharma 2 个月前

Setting up a Horizontal Pod Autoscaler for Kubernetes…

CloudifyOps 2 年前

apiVersion: autoscaling.k8s.io/v1
kind: VerticalPodAutoscaler
metadata:
  name: vpa-app
spec:
  targetRef:
    apiVersion: "apps/v1"
    kind: Deployment
    name: optimized-app
  updatePolicy:
    updateMode: "Auto"

Optimize Networking with CNI Plugins

Efficient network communication reduces latency and improves performance, especially for large clusters by choosing a suitable CNI plugin (e.g., Calico, Cilium, Flannel) based on network requirements and configuring network policies to secure and streamline communication between pods. (Below is an example of Calico )

#install Calico CNI plugin:
kubectl apply -f https://docs.projectcalico.org/manifests/calico.yaml

apiVersion: networking.k8s.io/v1
kind: NetworkPolicy
metadata:
  name: allow-specific-app
spec:
  podSelector:
    matchLabels:
      app: optimized-app
  policyTypes:
  - Ingress
  - Egress
  ingress:
  - from:
    - podSelector:
        matchLabels:
          app: allowed-app

Use Efficient Storage Classes and Persistent Volumes

Properly configured storage classes improve I/O performance for applications that rely on databases or need fast read/write speeds by choosing appropriate storage classes based on the workload (SSD, HDD, etc.), and ensuring that persistent volumes are efficiently utilized by deleting unused volumes.

apiVersion: storage.k8s.io/v1
kind: StorageClass
metadata:
  name: fast-ssd
provisioner: kubernetes.io/aws-ebs
parameters:
  type: gp3
  iopsPerGB: "50"

---

apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: app-storage
spec:
  accessModes:
  - ReadWriteOnce
  storageClassName: fast-ssd
  resources:
    requests:
      storage: 10Gi

Leverage Node Local DNS Cache

It improves DNS resolution times and reduces the load on the cluster DNS service by caching DNS lookups locally using the NodeLocal DNS Cache feature in Kubernetes to improve performance, especially for applications that make many DNS queries.

# Enable NodeLocal DNS cache by running this command
kubectl apply -f https://k8s.io/examples/admin/dns/nodelocaldns.yaml

Optimize Pod Affinity and Anti-affinity Rules

It ensures efficient workload distribution across nodes, preventing resource contention and improving resilience by using pod affinity and anti-affinity rules to co-locate or distribute pods based on performance needs and failover scenarios.

apiVersion: apps/v1
kind: Deployment
metadata:
  name: affinity-app
spec:
  replicas: 3
  selector:
    matchLabels:
      app: affinity-app
  template:
    metadata:
      labels:
        app: affinity-app
    spec:
      affinity:
        podAffinity:
          requiredDuringSchedulingIgnoredDuringExecution:
          - labelSelector:
              matchExpressions:
              - key: app
                operator: In
                values:
                - another-app
            topologyKey: "kubernetes.io/hostname"
      containers:
      - name: affinity-container
        image: your-app-image

Use Readiness and Liveness Probes

It ensures that only healthy pods serve traffic, preventing bad user experiences and reducing downtime by configuring readiness and liveness probes in your deployments to monitor the health of your application and automatically restart unhealthy pods.

apiVersion: apps/v1
kind: Deployment
metadata:
  name: probe-app
spec:
  replicas: 3
  selector:
    matchLabels:
      app: probe-app
  template:
    metadata:
      labels:
        app: probe-app
    spec:
      containers:
      - name: probe-container
        image: your-app-image
        livenessProbe:
          httpGet:
            path: /healthz
            port: 8080
          initialDelaySeconds: 5
          periodSeconds: 10
        readinessProbe:
          httpGet:
            path: /readiness
            port: 8080
          initialDelaySeconds: 5
          periodSeconds: 10

By applying these optimization techniques, you can enhance Kubernetes cluster performance, improve resource utilization, and ensure smoother, more scalable operations.

Are these tips useful? if yes, please repost this article so that it can reach to more people who need to read it!

Follow Sandip Das for more!

Learn Cloud, DevOps & Coding

49,062 位关注者

要查看或添加评论，请登录

Sandip Das的更多文章

OpenCost: The Open-Source Tool You Need for Kubernetes Cost Management

2024年11月14日

OpenCost: The Open-Source Tool You Need for Kubernetes Cost Management

It tracks and breaks down Kubernetes resource costs by workload, namespace, or service, enabling teams to monitor and…

11 条评论
Topology-Aware Routing in Kubernetes: Improved Efficiency and Lower Costs!

2024年10月21日

Topology-Aware Routing in Kubernetes: Improved Efficiency and Lower Costs!

Topology-aware routing is a powerful feature in Kubernetes that optimizes internal traffic flows, reduces latency, and…

5 条评论
Common Terraform Mistakes You Should Avoid!

2024年10月1日

Common Terraform Mistakes You Should Avoid!

Terraform is an open-source Infrastructure as Code (IaC) tool that allows you to define, provision, and manage cloud…

23 条评论
Your Ultimate Authorization Guide: OAuth, JWT, SAML 2.0, OpenID, LDAP, AWS Cognito, Auth0

2024年7月11日

Your Ultimate Authorization Guide: OAuth, JWT, SAML 2.0, OpenID, LDAP, AWS Cognito, Auth0

In an era where securing digital assets is paramount, understanding and implementing the right authorization mechanisms…

16 条评论
Top 7 Kubernetes Opensource CD Tools (Don't miss!)

2024年6月23日

Top 7 Kubernetes Opensource CD Tools (Don't miss!)

What is Continuous Delivery (CD)? Continuous Delivery is a software development practice where code changes are…

5 条评论
Logging & It's Best Practices for DevOps

2024年6月3日

Logging & It's Best Practices for DevOps

Logging is a critical aspect of DevOps for monitoring, troubleshooting, and maintaining system health. But Let's first…

9 条评论
Kubernetes Service Mesh and How to choose the best one?

2024年5月10日

Kubernetes Service Mesh and How to choose the best one?

First of all, let's understand what exactly it is! Kubernetes Service Mesh A Kubernetes Service Mesh is an…

5 条评论
Navigating AWS Migration: Strategies and Tools for Success

2024年5月7日

Navigating AWS Migration: Strategies and Tools for Success

In today’s digital era, migrating to the cloud is a strategic move for businesses aiming for scalability, flexibility…

12 条评论
Open Source Shocker: The XZ Utils Backdoor of 2024

2024年4月4日

Open Source Shocker: The XZ Utils Backdoor of 2024

On March 29th, a critical backdoor within XZ Utils—a software library integral to file compression operations across…
What is Cloud Native & CNCF? Graduated Projects? Info Insight!

2024年3月20日

What is Cloud Native & CNCF? Graduated Projects? Info Insight!

What is Cloud Native? Cloud-native refers to a set of practices and technologies used for building and running scalable…

4 条评论

See all articles

Practical Kubernetes Optimization Tips for Peak Performance ?? (With code example)

Sandip Das

AWS Container Hero | Founder @ Good Cloud Development | Cloud & DevOps Architect for Startups | Kubernetes Specialist | SRE, Platform Engineering & MLOps Enthusiast | Educator | Mentor

Optimize Resource Requests and Limits

Use Horizontal Pod Autoscaling (HPA)

Leverage Cluster Autoscaler

Implement Pod Disruption Budgets (PDB)

Enable Vertical Pod Autoscaler (VPA)

领英推荐

Optimize Networking with CNI Plugins

Use Efficient Storage Classes and Persistent Volumes

Leverage Node Local DNS Cache

Optimize Pod Affinity and Anti-affinity Rules

Use Readiness and Liveness Probes

Learn Cloud, DevOps & Coding

49,062 位关注者

Sandip Das的更多文章

社区洞察

其他会员也浏览了

Kubernetes Cost Optimization.

Kubernetes 1.29 (Mandala) Shining Solutions in the Container Cosmos

Why do you need Kubernetes? What Kubernetes is not?

RESEARCH ON KUBERNETES USE CASE

Google Kubernetes Engine aka Google GKE

How does Kubernetes work?

What is Kubernetes and case study of Kubernetes

How to Optimise Kubernetes Costs?

Azure Kubernetes Services aka Azure AKS

KUBERNETS

Optimize Resource Requests and Limits

Use Horizontal Pod Autoscaling (HPA)

Leverage Cluster Autoscaler

Implement Pod Disruption Budgets (PDB)

Enable Vertical Pod Autoscaler (VPA)

领英推荐

Optimize Networking with CNI Plugins

Use Efficient Storage Classes and Persistent Volumes

Leverage Node Local DNS Cache

Optimize Pod Affinity and Anti-affinity Rules

Use Readiness and Liveness Probes

Learn Cloud, DevOps & Coding

49,062 位关注者

Sandip Das的更多文章

OpenCost: The Open-Source Tool You Need for Kubernetes Cost Management

Topology-Aware Routing in Kubernetes: Improved Efficiency and Lower Costs!

Common Terraform Mistakes You Should Avoid!

Your Ultimate Authorization Guide: OAuth, JWT, SAML 2.0, OpenID, LDAP, AWS Cognito, Auth0

Top 7 Kubernetes Opensource CD Tools (Don't miss!)

Logging & It's Best Practices for DevOps

Kubernetes Service Mesh and How to choose the best one?

Navigating AWS Migration: Strategies and Tools for Success

Open Source Shocker: The XZ Utils Backdoor of 2024

What is Cloud Native & CNCF? Graduated Projects? Info Insight!

社区洞察

其他会员也浏览了

Kubernetes Cost Optimization.

Kubernetes 1.29 (Mandala) Shining Solutions in the Container Cosmos

Why do you need Kubernetes? What Kubernetes is not?

RESEARCH ON KUBERNETES USE CASE

Google Kubernetes Engine aka Google GKE

How does Kubernetes work?

What is Kubernetes and case study of Kubernetes

How to Optimise Kubernetes Costs?

Azure Kubernetes Services aka Azure AKS

KUBERNETS