登录查看更多内容

DevOps for On-premises Software

Max Lapshin

Board Member of Otter Video | Founder and CTO at Flussonic | Entrepreneur | Developing video streaming software

发布日期: 2019年11月15日

The very beginning: handmade packages

Years ago, Flussonic was living in a single repository with its frontend, and the Debian package was created by the commands:

Such an approach was enough for a start. Any build server (especially in the late 2000s) is not easy to maintain. If you configure it properly and leave it be for a couple of years, you will forget everything about it when it breaks. So it is really very convenient if you have everything to build an installable package from your sources.

What is inside this make deb? There is an https://github.com/flussonic/epm: erlang package manager. The tool was rewritten from excellent https://github.com/jordansissel/fpm

We write in erlang and do not want to install ruby to make a package, it was very important in the pre-Docker era. This make deb was launched on macOS X. So, as a Mac developer, I make a package there and immediately install the tool on Debian.

But what about RPM? People use it too. A big customer of ours forced us to build rpm packages for it and I became a (possibly) second man on Earth who has implemented rpm package writer. It was a very traumatic experience and I hope I will eventually forget it because rpm is the most undesigned and brain-damaging thing. Why I haven’t just compiled rpm for MacOS? Because it was an even more complicated task than to implement rpm writer.

It was the easy time when it was possible to run all tests before pushing commit and they all were green.

So what we had several years ago:

The build server is (was) hard to maintain with a small team when no dedicated engineer is responsible for it
It is possible to compile erlang to bytecode from Mac to Debian
We had a simple erlang tool to create the package by hands
Releases are rare and uploading by hand is OK
RPM is possible but never try to repeat this experience. Use alien

Nightly build automation

Our support team was growing together with the user base and was giving intermediate builds to customers more often. They got tired from the monotone process of building the package and uploading it themselves. At that time we moved to self-hosted gitlab and decided to try its CI mechanism:

The easiest step was to set up package building and uploading it to another repository from the CI runner. Now we had two repositories: one for releases (still made and uploaded manually) and the other for packages built from the master branch.

We hadn’t adopted Docker yet, so it was required to properly configure the runner machine for this task. However, the first step was done: the master branch was packaged by a robot at that step, making “nightly builds” several times per day.

Multi-branch packaging

As our development team was also growing bigger, some practices were changed and git branches were used more and more often. The support team started deploying packages from branches to customers during the integration or support process.

A new problem appeared: we decided to create per-branch repositories to install the flussonic package from a git branch.

Why is it complicated? Because we haven’t discussed Flussonic dependencies. We have our own package for erlang, python, etc, and all these packages have to be pushed to each repository that we want to distribute.

So, when we try to build a package from a new branch, we need to somehow put all dependencies into this new branch.

Here we’ve got two problems:

our dependencies had grown almost to 1 gigabyte
they all were still compiled and packed by hand. Why? Because it was working, and they are changed rarely.

Dependencies packaging

Flussonic dependencies change very rarely. It takes us several months to test and migrate to a new version of Erlang. All libraries like Cowboy (web server) are melded inside Flussonic repository and are considered as a part of the main package, so here we speak only about Erlang, Python, Caffe, etc.

During the development of the dependency maintaining process, we have moved to Docker and to cross-compilation. Now we have 5 architectures: amd64, arm64, musl64 (alpine on amd64), armhf and e2k (Elbrus processor).

All dependencies (totally about 20 packages+their architectures) are built in the Docker ecosystem. We have Dockerfile for each of them and use docker build. This is very important because docker run always runs all commands, while docker build can aggressively cache every line in Dockerfile.

Always try to use docker build when possible! Take care of the command order because it can kill your cache when it is not required. For example:

This is not good code, because if you just change a single line in any file in tools, you will download everything again. Here is a better code:

It is better but it would be even better not to write fetch-prerequisites.shcode at all:

You do not need to reduce Dockerfile steps too much because small granularity will help you to get a good cache/miss ratio during the build process.

Why is it important? When I take my 32 core AMD Threadripper to rebuild all our dependencies, it takes around 2 hours to rebuild all of them. With proper caching, it takes about 3 seconds and about 3 seconds to “upload” them. With such optimization, we can virtually build dependencies on each commit.

There is an alternative approach: build them once, upload them to Docker Registry as a tagged image and take them from this image. This approach is also good, but we do not use it because:

we have source code as a single source of truth and we maintain rebuildability of prerequisites
Docker Registry doesn’t have a good atomic mechanism for “pull or build”. It is not easy to write code like: docker pull || (docker build && docker push)

I have mentioned that it took 3 seconds to upload 1 GB of dependencies. How is it possible? We have implemented a tricky conditional upload mechanism.

Uploading done right

The problem is: you have a file that weights hundreds of megabytes and you are 99% sure that this file already exists somewhere nearby on the upload target server. Why? Because you have just made a branch and you are uploading a package to a separate branch folder. Dependency packages almost never change (very rarely), so you can be quite sure that just nearby in master/ folder you have the same erlang package and you can just link it to a new folder.

We have put a special upload script that can handle GET requests before uploading with X-Sha1 header. If there is no file with such a name in required folder, this script will look for file with this sha1 in nearby folders. SHA1 of files are of course written on disk in erlang_22.4_amd64.deb.sha1 files.

If such a file exists nearby, it is linked to the new folder. It takes several milliseconds, so uploading of all dependencies usually takes no more than several seconds.

An important moment: it is good to upload dependencies directly from the Docker image that was built in the previous step.

Our upload tool also sends X-Md5, X-Sha256 headers: everything thas is required for creating the index for the Debian repository. It is important because when the server script rebuilds repository, it doesn’t recalculate the checksum of content, but just takes it from cached files.

Package validation

Usually, tests are running in sources. It means that after we have built a package, we can break something, so the package also has to be tested.

We have added it to the Gitlab CI pipeline. Right after building+uploading all dependencies and building a flussonic package, we launch the Docker image and install our new package. Now we can run a set of quick acceptance tests, which are run rather trivially: we configure Flussonic with a real license, give it a real config file with real file source and run checks via HTTP.

So we can be absolutely sure that our package is not broken during package creation.

Multi-project packaging

As our team continued growing, we extracted the Flussonic Watcher project out of the Flussonic git repository. Flussonic Watcher is our product for video surveillance. It is a python backend + React frontend. They live now in two different repositories that are periodically merged in a single package flussonic-watcher_vsn_amd64.deb

They also have branches and their own versioning. How can we combine two projects together in a single repo? Why is it important? Because we have one support team and we give a single repository to our customers.

How can we do so that when a Watcher team member creates his own branch, the corresponding folder on the repository server will have also Flussonic and Flussonic dependencies?

The trick is very simple: when a new branch is created and some package is uploaded to the repository, our server script links all the latest packages from the master/ folder to this new branch folder. So flussonic always has flussonic-watcher and flussonic watcher always has flussonic in the branch folder.

TL;DR

we were building and uploading Debian packages by hands
then we have moved to automated building and uploading them
migrated to building packages inside Docker images (not containers)
then we moved flussonic dependencies to Docker images
implemented virtual (caching) upload mechanism
created a repository handling script that maintains the per-branch repository
combine several projects in a single repository server
install and validate the package before uploading

What’s next?

Right now we need to solve a problem with branch folder deletion. When Gitlab deletes a branch (on merge request acceptance), no hook is called. We cannot delete a folder, so we need to run a cron script that deletes empty folders.

I don’t like cron scripts because someone must remember them. It would be good to integrate the Gitlab and Debian repository, but we have no solution here yet.

要查看或添加评论，请登录

Max Lapshin的更多文章

How to Stream Video from Remote Locations

2021年3月9日

How to Stream Video from Remote Locations

Read the original article on our blog. The factories of many enterprises are often located outside of large cities, and…
How We Implemented Extended Technical Support for Our Clients

2021年2月10日

How We Implemented Extended Technical Support for Our Clients

You can read this article on our website. Clients often rush to us for expertise and project support with service…
Partner Spotlight: BuyDRM

2021年1月29日

Partner Spotlight: BuyDRM

You can read the original content on our website. Our partners actively innovate with Flussonic and with our joint…
How to find the perfect HelpDesk for you? Hint: You have to create it

2021年1月27日

How to find the perfect HelpDesk for you? Hint: You have to create it

You can read the original article on our blog. In one of our previous articles, we told you how and why we created our…
The Reason We Developed Our Own CRM System

2020年12月21日

The Reason We Developed Our Own CRM System

You can read the original article on our blog. The most valuable asset of any company is their customers' information…
Erlyvideo and Streamlabs Partnered to Develop a Digital Cable and OTT TV Headend Solution for FlyNet

2020年12月8日

Erlyvideo and Streamlabs Partnered to Develop a Digital Cable and OTT TV Headend Solution for FlyNet

Erlyvideo and Streamlabs have just announced that they have worked together to create a digital cable and OTT TV…
How to Store Video from CCTV Cameras

2020年11月30日

How to Store Video from CCTV Cameras

How often do you need a video archive? For real-time video surveillance, archives are often not necessary as an…
How to Add Subtitles to TV Broadcast and Why it Matters

2020年11月12日

How to Add Subtitles to TV Broadcast and Why it Matters

Read the original story on our blog Subtitles, or closed captions, can play a decisive role for many viewers of…
How to Extend Cloud Video Surveillance Service: The Flynet Story

2020年11月2日

How to Extend Cloud Video Surveillance Service: The Flynet Story

Now, more and more telecom operators are launching a cloud video surveillance service for their subscribers. Typically,…
How We Set Up a Broadcast for 25,000 Viewers Using the SRT Protocol

2020年10月22日

How We Set Up a Broadcast for 25,000 Viewers Using the SRT Protocol

From the 23rd to 25th of September, Nizhny Novgorod hosted the CIPR 2020 (Digital Industry of Industrial Russia)…

See all articles

DevOps for On-premises Software

Max Lapshin

Board Member of Otter Video | Founder and CTO at Flussonic | Entrepreneur | Developing video streaming software

The very beginning: handmade packages

Nightly build automation

Multi-branch packaging

Dependencies packaging

Uploading done right

Package validation

Multi-project packaging

TL;DR

What’s next?

Max Lapshin的更多文章

社区洞察

其他会员也浏览了

DevOps (Day-80): Project-1Deploy a Project Webapp Application with help of Jenkins Pipeline, GitHub and Docker

90DaysOfDevOps

Why I Recommended My Client Switch from Jenkins to GitLab (and How We Made It Happen)

GitHub Foundations: Essential Skills for Aspiring DevSecOps Professionals

Automating Jenkins Docker Setup

Bitbucket to GitLab Migration in a Few Simple Steps

Day 24 &25 Task: Complete Jenkins CI/CD Project

Advance Git & GitHub for DevOps Engineers (Part-2)

GitHub Actions

Day 08/90 Task: Basic Git & GitHub for DevOps Engineers

The very beginning: handmade packages

Nightly build automation

Multi-branch packaging

Dependencies packaging

Uploading done right

Package validation

Multi-project packaging

TL;DR

What’s next?

Max Lapshin的更多文章

How to Stream Video from Remote Locations

How We Implemented Extended Technical Support for Our Clients

Partner Spotlight: BuyDRM

How to find the perfect HelpDesk for you? Hint: You have to create it

The Reason We Developed Our Own CRM System

Erlyvideo and Streamlabs Partnered to Develop a Digital Cable and OTT TV Headend Solution for FlyNet

How to Store Video from CCTV Cameras

How to Add Subtitles to TV Broadcast and Why it Matters

How to Extend Cloud Video Surveillance Service: The Flynet Story

How We Set Up a Broadcast for 25,000 Viewers Using the SRT Protocol

社区洞察

其他会员也浏览了

DevOps (Day-80): Project-1Deploy a Project Webapp Application with help of Jenkins Pipeline, GitHub and Docker

90DaysOfDevOps

Why I Recommended My Client Switch from Jenkins to GitLab (and How We Made It Happen)

GitHub Foundations: Essential Skills for Aspiring DevSecOps Professionals

Automating Jenkins Docker Setup

Bitbucket to GitLab Migration in a Few Simple Steps

Day 24 &25 Task: Complete Jenkins CI/CD Project

Advance Git & GitHub for DevOps Engineers (Part-2)

GitHub Actions

Day 08/90 Task: Basic Git & GitHub for DevOps Engineers