登录查看更多内容

Adding Readiness & Liveness to Kubernetes Workloads (Kibana)

Rajaraman Sathyamurthy

Associate Director & Senior Architect, Data Architecture

发布日期: 2023年4月10日

In our Kubernetes environment, we have Traefik load balancers routing the user traffic to Kibana application servers. We noticed that one of the Kibana servers was hung / unresponsive but the Traefik load balancers were still continuing to route the user traffic, resulting users to get server error.

So we decided to setup Readiness and Liveness probes for Kibana application container pods.

Now what is Readiness and Liveness probe in Kubernetes?

Both readiness and liveness are used to monitor the status of the pod, but the action taken is different between the two.?

If readiness is configured, when pod becomes unresponsive, the traffic to the unresponsive pod is removed from Service load balancers (without which the traffic would continue to get routed to the pod running the service but won't cater the service to users). But the service defunct pod continues to run as it is but user impact is avoided.

If liveness is configured, the unresponsive pod gets rebooted so that it can service again (useful for problems that gets resolved on reboot).

领英推荐

Efficient TCP Server Connection Management

Keploy ?? 1 个月前

How Load Balancing Algorithms Work and How to Choose…

RedSwitches 3 个月前

Why Linux Dedicated Servers are the Backbone of…

Kennies IT 1 个月前

Both of these are achieved by the periodic probes sent by kubelet. The configuration includes setting up the following parameters:

failure threshold, initial delay seconds, period seconds, success threshold, timeout seconds, command to be executed (used with "if" and "fi" conditions).

The periodSeconds field specifies that the kubelet should perform a liveness probe every x seconds. The initialDelaySeconds field tells the kubelet that it should wait x seconds before performing the first probe. To perform a probe, the kubelet executes the command (written in exec:) in the target container. In our configuration, if the command succeeds, it returns 200, and the kubelet considers the container to be alive and healthy. If the command returns a non-200 value, the kubelet kills the container and restarts it.

The TCP liveness probe can also be configured to connect on specific ports like 8080 to determine liveness. Both readiness and liveness can be used on same container (first to remove the unresponsive pod from Service load balancers and then to reboot the unresponsive pod).

If there is a pod that usually takes longer time to start, startup probe can be setup. Once the pod is up, liveness probe takes over and start monitoring. If startup is unsuccessful, the container will be killed after defined time based on pod restart policy.

Now after enabling this configuration, If Kibana goes into hung state, it will be automatically removed from Service load balancers. If Kibana starts responding again before reaching the threshold, it again gets added to service load balancers; if not container gets rebooted on reaching liveness threshold.

Team: Rajesh Mehra ; Eshwar Hudge and Rajaraman Sathyamurthy

要查看或添加评论，请登录

Rajaraman Sathyamurthy的更多文章

Fail-safe Logstash Pipelines

2023年11月14日

Fail-safe Logstash Pipelines

This article talks about establishing fail-safe mechanism for Logstash pipelines. Logstash is an open-source data…

1 条评论
Business Function Health Visualization (BFHV)

2023年3月30日

Business Function Health Visualization (BFHV)

Authors (alphabetical order): 1. Anup Kumar Gupta PMP? 2.

1 条评论
Issues in Kubernetes Pods post node reboot

2023年1月30日

Issues in Kubernetes Pods post node reboot

You may have automated patching and reboot scheduled for your VMs (as part of maintenance / patching window), using…
Data Pipeline Monitoring

2023年1月23日

Data Pipeline Monitoring

There are many reasons, a data pipeline could break. Most of the time it is due to issues on the data source.
Setting up multiple replicas in Kubernetes

2023年1月23日

Setting up multiple replicas in Kubernetes

Our Kibana application was running as a single instance (pod) and there was no redundancy. How did we address it…
Prod Kubernetes Challenges - Single Master, Legacy version, non-prod DC

2023年1月16日

Prod Kubernetes Challenges - Single Master, Legacy version, non-prod DC

We were stuck in legacy K8 version (v1.13.
Elastic Data Lake - Cluster status RED - issues, challenges in remediation, how did we solve it?

2023年1月13日

Elastic Data Lake - Cluster status RED - issues, challenges in remediation, how did we solve it?

One of the datalake environment we built as POC, in couple of years became production datalake and hit it's capacity…

2 条评论
Elastic Data Lake - Decision to move away from Tivoli LFA

2023年1月13日

Elastic Data Lake - Decision to move away from Tivoli LFA

AIX logs from hundreds of servers, were being ingested to Elastic Datalake; to Logstash using Tivoli LogFile Agent…

1 条评论

See all articles

Adding Readiness & Liveness to Kubernetes Workloads (Kibana)

Rajaraman Sathyamurthy

Associate Director & Senior Architect, Data Architecture

领英推荐

Rajaraman Sathyamurthy的更多文章

社区洞察

其他会员也浏览了

Introduction To OpenVZ Virtualization

Configuring HTTP Load Balancer

The Evolution of Application Hosting: From Physical Servers to Container Orchestration and Beyond

Implementing Active/Active Load-Balancer with KEEPALIVED and NGINX servers and using them as SSL termination

Research: Configuration of multiple WEB SERVERS on Docker Containers along with Host Computer Web Server on the top of AWS

Mastering System Design Essentials: Building Scalable and Robust Systems

System Design: Load balancers

Tech Tale #1-The Heroic Load Balancer: Ensuring Smooth Sailing Systems

Load Balancer Algorithms: Static vs. Dynamic

Use Nginx as a Load Balancer

领英推荐

Rajaraman Sathyamurthy的更多文章

Fail-safe Logstash Pipelines

Business Function Health Visualization (BFHV)

Issues in Kubernetes Pods post node reboot

Data Pipeline Monitoring

Setting up multiple replicas in Kubernetes

Prod Kubernetes Challenges - Single Master, Legacy version, non-prod DC

Elastic Data Lake - Cluster status RED - issues, challenges in remediation, how did we solve it?

Elastic Data Lake - Decision to move away from Tivoli LFA

社区洞察

其他会员也浏览了

Introduction To OpenVZ Virtualization

Configuring HTTP Load Balancer

The Evolution of Application Hosting: From Physical Servers to Container Orchestration and Beyond

Implementing Active/Active Load-Balancer with KEEPALIVED and NGINX servers and using them as SSL termination

Research: Configuration of multiple WEB SERVERS on Docker Containers along with Host Computer Web Server on the top of AWS

Mastering System Design Essentials: Building Scalable and Robust Systems

System Design: Load balancers

Tech Tale #1-The Heroic Load Balancer: Ensuring Smooth Sailing Systems

Load Balancer Algorithms: Static vs. Dynamic

Use Nginx as a Load Balancer