登录查看更多内容

Common Kubernetes troubleshooting tasks

Prabhakar T

Platform Engineering Engineer at Amadeus Labs

发布日期: 2022年4月17日

If you are working on?kubernetes ?or heard about that with someone working on kubernetes, says Kubernetes is complex and hard to manage or troubleshoot. Same time you may see in kubernetes cluster either one, goes wrong sometimes.

You may experienced most common issues are container unavailable or pod doesn’t respond. With that any guess how your DevOps/SRE teams figures out the cause of the issue and fix it?

In this section will see what common scenarios your DevOps/SRE teams are may encounter, and how they address them.

1. Node unavailable

One of the key reason Kubernetes known for High Availability, where kubernetes automatically distribute the applications across multiple available nodes hosted on physical datacenter or virtual machines. If there is some availability issue, there is likely an insufficient number of available nodes.

If you see any node related issue, make sure you have enough nodes assigned to cluster. Any High availability cluster should contain minimum two nodes, please note we are speaking about kubernetes master node.

Even with enough nodes, you may find that nodes fail after you’ve set up and joined them to a cluster. One way to address this issue is to enable auto-recovery of any VMs that host nodes. Most cloud providers and on-premises VM platforms offer auto-recovery features that restart a failed machine automatically.

Increasing the number of servers in a cluster may also improve node availability, even if the number of nodes stays the same. When you spread nodes across multiple servers, you limit the harm done to your cluster by a server failure.

Continue Reading on Common Kubernetes troubleshooting tasks - FoxuTech

要查看或添加评论，请登录

查看全部

Common Kubernetes troubleshooting tasks

Prabhakar T

Platform Engineering Engineer at Amadeus Labs

1. Node unavailable

更多精彩文章

社区洞察

其他会员也浏览了

AIOps for the IT Infrastructure

DevOps Project - 8 (Monitoring)

What ‘Software-Defined’ Really Means, Part One

Unleashing the Power of NetDevOps: Simplifying Network Operations with Automation

What Kubernetes is & VSCO Case Study !!

Embracing Automation with Ansible: The Power of Dynamic Inventories

Navigating Challenges in Docker: Strategies for Smooth Sailing

Infrastructure Cost Saving Initiatives

How I Reduced CI/CD infrastructure costs at Nanoheal

?? Infrastructure as Code

1. Node unavailable

How to create azure container instance using terraform

2022年7月13日

Azure Container Instances – its features

2022年7月12日

How to Manage ArgoCD RBAC configuration

2022年7月5日

Deploy an application in Kubernetes using Argo CD with GitHub

2022年5月16日

Deploy an application in Kubernetes using Argo CD with GitHub

2022年5月16日

How to deploy an application in Kubernetes using Argo CD

2022年5月9日

Setup Prometheus on Azure Kubernetes Service

2022年5月9日

Let’s Understand about GitOps

2022年4月28日

How to create Azure Kubernetes Service using Terraform

2022年4月28日

Sloop – Kubernetes Events History Visualization

2022年4月19日

社区洞察

其他会员也浏览了

AIOps for the IT Infrastructure

DevOps Project - 8 (Monitoring)

What ‘Software-Defined’ Really Means, Part One

Unleashing the Power of NetDevOps: Simplifying Network Operations with Automation

What Kubernetes is & VSCO Case Study !!

Embracing Automation with Ansible: The Power of Dynamic Inventories

Navigating Challenges in Docker: Strategies for Smooth Sailing

Infrastructure Cost Saving Initiatives

How I Reduced CI/CD infrastructure costs at Nanoheal

?? Infrastructure as Code