登录查看更多内容

The Role of A Container Cluster Manager

Imesh Gunaratne

Senior Technical Solutions Engineer, Google

发布日期: 2016年5月10日

Deploying Containers on a Single Container Host

Almost all the container runtimes have been designed to run containers on a single container host. This is by design; containers share the host operating system kernel and features such as cgroups, namespaces, chroot, SELinux & seccomp, etc for providing the isolation and security. Therefore a given set of containers may need to run on a single container host. At the moment none of the container runtimes available today provide a mechanism for integrating multiple container hosts together for sharing the workload except by using a container cluster manager. Figure 1 illustrates how a software solution deployed on a set of VMs can be moved to a containerized environment using a single container host:

This deployment model is straightforward, easier to setup and simple to use. It would fit very well for setting up a development environment on developer machines. However when moving the software solution beyond the dev environment and deploying in QA, performance test, pre-production and production environments the following limitations might have to be bared because of containers being deployed on a single container host:

Single point of failure: Since all the containers are running on a single host if the host fail at some point, all the containers would also fail. As a result the entire software system which ran on containers will become unavailable.
Resource constraints: Since there is only one container host available, containers would reach it’s resource (CPU, memory, disk) limitation at some point. Afterwards the system will not be able to scale up unless the host is vertically scaled.
No auto healing & autoscaling features: Currently none of the container runtimes provide auto healing and autoscaling features for containers. Therefore those might need to be managed by a human or automated using additional software components.
Limited container orchestration features: Containers may need some level of orchestration features for container grouping, container cluster grouping, handling dependencies, health checking, etc when deploying a composite application. Docker has provided a solution for this with Docker Compose, however it has some limitations even if it’s own container cluster manager Docker Swarm is used.
Limited service discovery features: Components of a composite application may need to interact with each other using some mechanism. The easiest method is to use domain names to discover their dependents. Docker Compose has solved this problem with Docker-links; what it does is, if container A depends on container B, when starting container A, we can specify a link to container B. Then Docker generates an /etc/hosts entry in container A pointing to container B’s IP address. This would work for 1:1 scenarios but may not work for 1:M usecases.

Deploying Containers on a Collection of Container Hosts

The most easiest way to solve the above problems might be to use a collection of container hosts. Please refer the below diagram:

This approach may look simple and straight forward but it may have following implications;

Independently operated container hosts: Even though a collection of container hosts are used they would not have any knowledge on each other for distributing the workload.
Container management overhead: A human would need to manually interact with the system and distribute the containers among the container hosts to ensure the high availability of each application component, this is called container scheduling. In real life, this might not be practical with immutable, short lived containers. An programmatic approach might be needed for scheduling containers, auto healing and auto scaling them.
Disconnected bridge container networks: Since each container host would have it’s own container bridge network, the container IP addresses will get leased by the container host. As a result container IP addresses would get duplicated across the hosts. More importantly there would be no direct routing between containers. This might become a problem when deploying a composite application which needs internal routing among application component containers.
No dynamic load balancing: Let’s assume that containers expose their ports using host ports. A given application component containers may available in multiple container hosts. In a such a situation a load balancer needs to be manually configured according to the container availability by pointing to container host IP and host ports.

Deploying Applications on a Container Cluster Manager

The above diagram illustrates a reference architecture for a container cluster manager. In this approach almost all the issues identified in the second approach on using a container host collection have been solved by programatically managing the container host cluster and the container clusters that run on top of them:

Scheduler

The scheduler is the brain of the container cluster manager. It monitors the entire container cluster by analyzing the resource utilization of each host and takes container scheduling decisions for optimizing the resource usage and high availability of the containers.

Agent

The agent component runs in each container host for providing host management capabilities and sending resource usage statistics to the schduler. Whenever schedulers wants to create or terminate a container instance in a host, it talks to the relevant agent and let it execute a container management command.

Overlay Network

The overlay network can be implemented using a software defined network (SDN) framework such as Flannel, Open VSwitch or Weave. The main advantage of using a such solution is that all the containers in the container cluster would get connected to single network with container to container routing. More importantly containers would get unique IP addresses across the container hosts leased by the SDN and if needed will be able to integrate with the physical network of the container host cluster.

DNS

DNS is another key element of a container cluster management system. It mainly serves two purposes;

Providing domain names for containers and container clusters. For an example say if an application server is deployed on a set of containers, each container and the container cluster may need domain names for accessibility.

Service discovery with DNS round robin, if application layer routing is not needed, a DNS server can be used for round robin load balancing at the network layer of the OSI model.

Load Balancer

The container cluster manager can dynamically configure a third party load balancer such as Nginx, haproxy for providing load balancing for containers at the application layer. This would suit well for routing HTTP traffic if session affinity is needed for UI components. Moreover it would provide the ability to do hostname based routing while exposing well known HTTP ports such as 80, 443 without having to expose dynamic host ports.

Key Features Required in a Container Cluster Manager

The following key features would be needed in a container cluster manager for deploying composite, complex applications in production;

Container grouping; Group a set of containers together by sharing disk, processes, users, etc using Linux namespaces.
Container cluster management; Managing a group of container groups as a cluster of an application component.
Application health checking; This is essential for managing a list of active containers in the load balancer routing rules and for auto healing.
Auto healing; Application components can try to auto heal from catastrophic situations by restarting the containers.
Horizontal autoscaling; An application component cluster can be scaled horizontally by increasing the number of containers.
Domain naming and service discovery; Domain naming and service discovery is important for deploying a composite application on containers. Kubernetes
Dynamic load balancing; A container cluster manager needs to dynamically configure a load balancer as the container ports, host ports can change in runtime according to the deployment.
Centrailzed log access; Accessing logs of hundreds of containers on the container itself would be nearly impossible. Therefore the cluster manager needs to provide a mechanism to access logs from a central location.
Multi-tenancy; Multi-tenancy might be an essential for sharing a single container cluster manager instance with multiple tenants.
Identity and authorization; Identity and authorization management would be needed for both cluster manager and applications deployed on top.
Mounting storage systems; Applications which need a persistent storage would need to use volume mounts to avoid loosing data written to disk when restarting containers.

Conclusion

In summary containers can be run on a single container host while baring some limitations; single point of failure, resource constraints, without auto healing & autoscaling, limited container orchestration features, limited service discovery features, etc. On the positive side it would reduce the overhead of setting up a container cluster manager. Docker has solved the problem of deploying composite applications on a single container host using Docker Compose. It also works on Docker owned container cluster manger Docker Swarm, but with some limitations. Therefore a production grade composite application deployment may need a container cluster manager which can handle complex deployment requirements such as container grouping, container cluster management, application health checking, auto healing, horizontal autoscaling, domain naming, service discovery, dynamic load balancing, centralized log access, multi-tenancy, identity, authorization, mounting storage systems, etc.

References

[1] Docker, What is Docker?: https://www.docker.com/what-docker

[2] Docker Docs, Docker Compose: https://docs.docker.com/compose/

[3] Docker Docs, Docker Swarm: https://docs.docker.com/swarm/overview/

[4] Kubernetes Docs, What is Kubernetes: https://kubernetes.io/docs/whatisk8s/

[5] Kubernetes Github Repository, Kubernetes Architecture: https://github.com/kubernetes/kubernetes/blob/release-1.2/docs/design/architecture.md

[6] Apache Mesos Docs, Mesos Architecture: https://mesos.apache.org/documentation/latest/architecture/

要查看或添加评论，请登录

Imesh Gunaratne的更多文章

How to Deploy Pivotal Cloud Foundry on AWS Flawlessly

2018年3月17日

How to Deploy Pivotal Cloud Foundry on AWS Flawlessly

Installing Pivotal Cloud Foundry on AWS using AWS Quick Start Reference Deployment At WSO2 we have been developing BOSH…

2 条评论
How to Create a Kubernetes Cluster on AWS in Few Minutes

2018年3月17日

How to Create a Kubernetes Cluster on AWS in Few Minutes

Installing Kubernetes on AWS Amazon Web Services (AWS) recently introduced a managed Kubernetes service called EKS…
Architecting API Management Solutions with WSO2 API Manager

2018年3月6日

Architecting API Management Solutions with WSO2 API Manager

Designing the Solutions Architecture, Planning Capacity Requirements, Designing the Deployment Architecture and…
An OAuth2 Grant Selection Decision Tree for Securing REST APIs

2018年3月6日

An OAuth2 Grant Selection Decision Tree for Securing REST APIs

OAuth2 protocol, grants, and guidelines for selecting grants One of the most widely used security protocols for…

1 条评论
Pivotal Partner Days in Seattle

2017年12月19日

Pivotal Partner Days in Seattle

PCF workshop, WSO2 Pivotal partnership, WSO2 PCF service integrations Few months back WSO2 initiated a discussion with…
Wiring Microservices, Integration Microservices & APIs

2017年12月5日

Wiring Microservices, Integration Microservices & APIs

Composing a Microservices based Enterprise Solution with MSF4J, BallerinaLang and WSO2 API Manager Implementing a…
WSO2 API Manager in a Nutshell

2017年10月24日

WSO2 API Manager in a Nutshell

WSO2 API Manager V2, Component Architecture, Features, Internals, and Future What is API Management? APIs provide…
Integrating Platform Services with Pivotal Cloud Foundry

2017年9月5日

Integrating Platform Services with Pivotal Cloud Foundry

PCF Architecture and Four Levels of Service Integrations Pivotal CloudFoundry (PCF) is a Platform as a Services (PaaS)…
Implementing Serverless Functions with Ballerina on AWS Lambda

2017年6月22日

Implementing Serverless Functions with Ballerina on AWS Lambda

Integration Services, Ballerina and AWS Lambda Ballerina is now reaching its initial GA release by adding more and more…

2 条评论
Is EC2 Container Service the Right Choice on AWS?

2017年5月29日

Is EC2 Container Service the Right Choice on AWS?

As of today, there are a handful of container cluster management platforms available for deploying applications in…

See all articles

The Role of A Container Cluster Manager

Imesh Gunaratne

Senior Technical Solutions Engineer, Google

Deploying Containers on a Single Container Host

Deploying Containers on a Collection of Container Hosts

Deploying Applications on a Container Cluster Manager

Key Features Required in a Container Cluster Manager

Conclusion

References

Imesh Gunaratne的更多文章

社区洞察

其他会员也浏览了

Harnessing the Power of Event-Driven, Evolutionary Software Architecture While Managing Complexity

Kubernetes Core Concepts

Why should you consider Headless Architectures as important for your Enterprise?

Unlocking the Secrets of API Architecture

Kubernetes

Imperative Commands in Kustomize

How and when to introduce architectural changes amid urgent development and production issues?

Integration Digest for December 2024

Monolithic Architecture in 2025: Smart Choice or Legacy Trap?

Mastering Kubernetes Labels and Selectors

Deploying Containers on a Single Container Host

Deploying Containers on a Collection of Container Hosts

Deploying Applications on a Container Cluster Manager

Key Features Required in a Container Cluster Manager

Conclusion

References

Imesh Gunaratne的更多文章

How to Deploy Pivotal Cloud Foundry on AWS Flawlessly

How to Create a Kubernetes Cluster on AWS in Few Minutes

Architecting API Management Solutions with WSO2 API Manager

An OAuth2 Grant Selection Decision Tree for Securing REST APIs

Pivotal Partner Days in Seattle

Wiring Microservices, Integration Microservices & APIs

WSO2 API Manager in a Nutshell

Integrating Platform Services with Pivotal Cloud Foundry

Implementing Serverless Functions with Ballerina on AWS Lambda

Is EC2 Container Service the Right Choice on AWS?

社区洞察

其他会员也浏览了

Harnessing the Power of Event-Driven, Evolutionary Software Architecture While Managing Complexity

Kubernetes Core Concepts

Why should you consider Headless Architectures as important for your Enterprise?

Unlocking the Secrets of API Architecture

Kubernetes

Imperative Commands in Kustomize

How and when to introduce architectural changes amid urgent development and production issues?

Integration Digest for December 2024

Monolithic Architecture in 2025: Smart Choice or Legacy Trap?

Mastering Kubernetes Labels and Selectors