登录查看更多内容

Kubernetes platform - at the fundamental level

Sambit Tripathy

Solutions Architect - Machine Learning & AI

发布日期: 2018年11月30日

"I see a lot of people having problems to understand how the Kubernetes platform works at the fundamental level, e.g. resiliency and behaviour. If you start thinking about Kubernetes as a fully event-driven system, there's answers to so many "Why"'s"

Think of the API server as an immutable (replicated) log (queue) and stream of events. Events are facts that can be causally related ("happened-before") or not related at all (then we say they happened "concurrently"). etcd is an implementation detail.

All controllers, e.g. the scheduler, deployment controller, endpoint controller, Kubelet, etc. can be understood as consumers and/or producers (consumers can be producers as well, and vice versa).

Consumers specify the objects (and optionally namespace) they want to receive events from the API server. This is called a "watch" in Kubernetes. Think of the combination of object+namespace as a dedicated (virtual) event queue the API server handles.

Consumers and producers don't know about each other as they're fully decoupled (by the log) and autonomous. This makes the whole system extremely scalable, robust and extensible (flexible).

Thus, by design, it's a fully asynchronous and eventual consistent platform. Information takes time to propagate from consumer(s) to producer(s). The picture above shows this where the HPA hasn't caught up with the metrics.

There's NO guarantee that the system will converge to the desired state (even if you got an "OK"/ACK from the control plane). Eg: kubectl scale <deploy> --replicas <n> (with cpu requests > cluster_capacity). You'll get a HTTP 200, even though there is no capacity left.

Another example for some unwanted "racy" conditions between controllers when scaling down services:

Handling Client Requests Properly with Kubernetes

Controllers are stateless. For efficiency and speed received events are placed in an in-memory buffer (typically also modelled as a queue) per controller. What if a controller crashes as it does not persist state?

This is not needed! The event-driven design will replay all (appropriate) events when the controller starts, also known as event-sourcing. This is also very useful as events are "at most once" delivered, i.e. could be lost during transmission.

Kubernetes is level-triggered. If an event gets lost during transmission (e.g. network issue), next time there's a sync you're going to receive the desired state

Kubernetes: Edge vs Level Triggered Logic

Reconciliation in case of the scheduler

Since information can get delivered more than once (e.g. after failure, resync, etc.) and controllers don't talk to each other directly, there's a potential of race conditions when state is to be changed (e.g. a write to same object from different controllers).

This is called optimistic concurrency and needs to be handled in the application layer (i.e. in each controller logic). Idempotency and compare and set (based on monotonically increasing resource versions) are patterns used here.

Event-driven design has many more advantages and can address a lot of problems in distributed systems (backpressure, queuing, retries, scale-out, etc.).

It's not a new idea. Relational databases work the same way (transaction log). The databases is said to be the (stale) cache of the immutable log :)

If you want to learn more about event-driven architecture, make sure to check out this video from Designing Events-first Microservices

A deep dive into Kubernetes API Server

Closing remark: this is not specific to computer science. Events are the reason for all being. Without events, there would be no change! Think about it, it's an applied and battle-tested design.

Credits

Bettina Ostermann

Private Health Insurance consultant

10 个月

Sambit, thanks for sharing!

要查看或添加评论，请登录

Sambit Tripathy的更多文章

A Solutions Architect's Guide to Getting Things Done

2025年2月25日

A Solutions Architect's Guide to Getting Things Done

As a Solutions Architect who's spent years designing and implementing cloud solutions, I've learned that the key to…

2 条评论
The Unspoken Rules of Job Interviews in Germany: An Insider's Guide

2025年2月12日

The Unspoken Rules of Job Interviews in Germany: An Insider's Guide

After spending a decade in Germany and witnessing countless talented professionals struggle with job interviews, I've…

2 条评论
Why MDF4 and Amazon Timestream Don't Play Nice: A Deep Dive into Time Series Data Challenges

2025年2月6日

Why MDF4 and Amazon Timestream Don't Play Nice: A Deep Dive into Time Series Data Challenges

Hey there, data enthusiasts! ?? As a solutions architect working with various data platforms, I frequently encounter…

2 条评论

Kubernetes platform - at the fundamental level

Sambit Tripathy

Solutions Architect - Machine Learning & AI

Sambit Tripathy的更多文章

社区洞察

其他会员也浏览了

Kubernetes-a necessity in the Cloud-Native world

SEON delivers 99.9999% uptime across 351,000,000 alarm signal activations in 2023

Day #28 - Troubleshooting - Handling common K8s issues

Gateway API vs Ingress Controller in Kubernetes

Mastering Kubernetes Scaling: HPA, VPA, KEDA, and Cluster Autoscaler

Karpenter Mastery: NodePools & NodeClasses for Workload Nirvana

Implement gRPC Effectively

Controllers and Operators

Day14 Scheduling in Kubernetes

Will Platform thinking solve the "Update" challenges

Sambit Tripathy的更多文章

A Solutions Architect's Guide to Getting Things Done

The Unspoken Rules of Job Interviews in Germany: An Insider's Guide

Why MDF4 and Amazon Timestream Don't Play Nice: A Deep Dive into Time Series Data Challenges

社区洞察

其他会员也浏览了

Kubernetes-a necessity in the Cloud-Native world

SEON delivers 99.9999% uptime across 351,000,000 alarm signal activations in 2023

Day #28 - Troubleshooting - Handling common K8s issues

Gateway API vs Ingress Controller in Kubernetes

Mastering Kubernetes Scaling: HPA, VPA, KEDA, and Cluster Autoscaler

Karpenter Mastery: NodePools & NodeClasses for Workload Nirvana

Implement gRPC Effectively

Controllers and Operators

Day14 Scheduling in Kubernetes

Will Platform thinking solve the "Update" challenges