登录查看更多内容

Running Database Services On Kubernetes

Leonid Mirsky

CEO & Founder @ Opsfleet | DevOps Services Agency for Tech-Companies

发布日期: 2018年4月10日

+ 关注

Many Kubernetes examples you find online usually concentrate on running stateless applications.

Typically, these are your standard Nodejs express applications or a python based API written with Flask.

Running these types of application on Kubernetes today is relatively easy. You have everything you need to run and operate them at scale: rolling deployments, ingress controllers, control over termination timeouts, and more.

But how about running a stateful application that occasionally needs to write data on disk and make sure this data persists between container restarts or when the container is rescheduled to another node? Or running a database like MongoDB on Kubernetes?

That’s where things aren’t so straightforward. Fortunately, Kubernetes and its vibrant community provide many options for how to run these stateful workloads.

We’ll dive a bit deeper to review these options, but you might ask —

Why Is It Harder To Deploy Stateful Apps On Kubernetes?

Can we just attach a volume to our pod template? shouldn’t it be enough? Theoretically, your application can now write to disk, and if the container will be restarted or travel to another node, the volume will be re-attached to the container in its new location.

That’s true for simple cases, but the situation is much more complicated for services like Elasticsearch, etcd, Consul, and such.

These services have a few requirements that aren’t satisfied by the regular Kubernetes Deployment controller.

For example, you may need to have predictable DNS names for each pod to make initial cluster formation easier. Or your deployed system may need to ensure that the pods will be started in a certain predefined order.

Additionally, you may want to create and attach a separate volume to each of your pods that will be tied to it through the whole pod’s lifecycle. With regular pods, you can only attach one volume that will be shared between all the pods created by the same deployment.

We also didn’t mention how you are going to operate your database. You’ll also need to make sure you have a plan for when and how backups are going to be performed or how a recovery/failover will be performed in case something bad happens.

Available Options For Running Stateful Applications

Here are a few options for how you can deploy your databases on Kubernetes:

1. Stateful sets

StatefulSet, which until recently was called a PetSet, is a built-in controller which in essence is similar to Kubernetes’ deployments.

Eventually, it will create and manage a set of pods based on the pod template you will specify.

The main difference is that it provides the following guarantees to the underlying applications:

Each pod will have a stable, unique network identifier
Each pod may have a stable, persistent storage volume
Deployment, scaling, or termination will be ordered and graceful

StatefulSets are generic, so you can use them to model your databases’ unique cluster formation or master/slave architecture.

However, the end result will lack on the operational side. You will need to add additional resources or automation to make sure you can perform periodic backups or add scripts that deal with edge cases such as a failover.

Eventually, the modeling of the more complex stateful services using StatefulSets may feel a bit clunky and not native to Kubernetes, and, as mentioned above, will lack management automation. This is where operators come into play:

2. Operators

If one of the reasons you decided to run your database on Kubernetes was to unify management for all your application’s components, operators will probably provide the experience you were looking for!

Instead of shoving your application into a StatefulSets model, you basically write (or use someone else’s) custom controller.

As a user, this allows you to use kubectl CLI to control your stateful application as a native kubernetes resource. For example, if you deployed an etcd operator, you may check your cluster’s backup status with the following kubectl command:

kubectl get EtcdBackup example-etcd-cluster

The main advantage of operators over StatefulSets is that they add an automation layer which is unique to the stateful application they operate. You won’t need to worry about how you’re going to add a backup cron to your Elasticsearch cluster implemented using StatefulSets. With operators, you just need to specify the bucket where this backup should be stored.

Unfortunately, since writing a new operator requires an understanding of Kubernetes and its APIs in addition to the specifics of the stateful applications, there aren’t many operators available at the moment, and the ones out there are still relatively new.

3. Other

This section is less defined, and basically meant to indicate that for specific databases, like the PostgreSQL example we’ll see in a second, there are other options for how to deploy and manage them as Docker containers on Kubernetes.

Sometimes, there are other options available rather than a StatefulSet or a dedicated operator implementation.

For example, Stolon, which I personally hadn’t have a chance to use but saw mentioned in a few threads, is a “cloud-native PostgreSQL manager for PostgreSQL high availability”.

To deploy Stolon on Kubernetes, you can use the supplied StatefulSets definition. However, because of Stolon’s capabilities, you won’t need to add your own cluster management automation to control PostgreSQL cluster. Stolon comes with its own CLI for that.

Summary

Kubernetes is a pretty intuitive platform when it comes to stateless applications. However, when dealing with database-like services, you need to put a bit more consideration into how you're going to deploy and manage them on Kubernetes. The good and bad news is that there are several options available.

***

Loved this article? Give this post some ? below.

The article was originally published at opsfleet.com

要查看或添加评论，请登录

Leonid Mirsky的更多文章

Why Kubernetes Is Winning?

2018年6月19日

Why Kubernetes Is Winning?

I started working with Puppet while I was still working full time in a regular job. Puppet was far from ideal, but…
Will EKS Simplify Your Migration To Kubernetes On AWS?

2018年4月24日

Will EKS Simplify Your Migration To Kubernetes On AWS?

Back in the day More than a year ago, setting up a Kubernetes cluster on AWS wasn’t an easy task. Here’s a quick recap…

8 条评论
Why You Should Consider Kubernetes Over a Custom Docker Deployment

2017年11月7日

Why You Should Consider Kubernetes Over a Custom Docker Deployment

A few years ago, when Docker burst into the tech world’s consciousness, best practices for deploying Docker containers…
Why Do Kubernetes Applications Need a Package Manager?

2017年10月15日

Why Do Kubernetes Applications Need a Package Manager?

You’re probably familiar with the concept of packages. Every time you install a new software using the apt-get or brew…
What Makes Kubernetes Difficult for Beginners?

2017年9月24日

What Makes Kubernetes Difficult for Beginners?

I just finished teaching a two-day Kubernetes workshop, and I had a few surprises. This wasn’t my first time presenting…

6 条评论
Will Heroku Always Be?Perfect?

2017年1月3日

Will Heroku Always Be?Perfect?

During a lifespan of a startup, your company’s needs will change and flex as you grow. At each phase of development…

See all articles

Running Database Services On Kubernetes

Leonid Mirsky

CEO & Founder @ Opsfleet | DevOps Services Agency for Tech-Companies

Why Is It Harder To Deploy Stateful Apps On Kubernetes?

Available Options For Running Stateful Applications

1. Stateful sets

2. Operators

3. Other

Summary

Leonid Mirsky的更多文章

社区洞察

其他会员也浏览了

Testing DBtune, showing PostgreSQL double buffering, and some thoughts about automated database tuning for SQL databases

Postgres for Everything IRL

Learning Kubernetes through Example (1/3): Deploying Django Web App with PostgreSQL on K8s Cluster

Optimizing Performance with MongoDB in Dockerized FastAPI Applications: Understanding the Strategy Behind Non-Dockerized Databases

What is MongoDB? How does SotaTek leverage MongoDB in Software Development projects?

Mastering Database Testing with Jest and SuperTest: A Hands-On Approach for PostgreSQL

Journey To Database World: Part 7 (Document Database - MongoDB As Example)

How to migrate the MERN stack MongoDB application to Oracle Autonomous Database 23ai

Discovering Docker Hub: The Central Repository of Docker Images

Building Scalable Multi-Tenant Systems with Django and PostgreSQL

Why Is It Harder To Deploy Stateful Apps On Kubernetes?

Available Options For Running Stateful Applications

1. Stateful sets

2. Operators

3. Other

Summary

Leonid Mirsky的更多文章

Why Kubernetes Is Winning?

Will EKS Simplify Your Migration To Kubernetes On AWS?

Why You Should Consider Kubernetes Over a Custom Docker Deployment

Why Do Kubernetes Applications Need a Package Manager?

What Makes Kubernetes Difficult for Beginners?

Will Heroku Always Be?Perfect?

社区洞察

其他会员也浏览了

Testing DBtune, showing PostgreSQL double buffering, and some thoughts about automated database tuning for SQL databases

Postgres for Everything IRL

Learning Kubernetes through Example (1/3): Deploying Django Web App with PostgreSQL on K8s Cluster

Optimizing Performance with MongoDB in Dockerized FastAPI Applications: Understanding the Strategy Behind Non-Dockerized Databases

What is MongoDB? How does SotaTek leverage MongoDB in Software Development projects?

Mastering Database Testing with Jest and SuperTest: A Hands-On Approach for PostgreSQL

Journey To Database World: Part 7 (Document Database - MongoDB As Example)

How to migrate the MERN stack MongoDB application to Oracle Autonomous Database 23ai

Discovering Docker Hub: The Central Repository of Docker Images

Building Scalable Multi-Tenant Systems with Django and PostgreSQL