登录查看更多内容

Scaling from zero to millions of users - Database replication

Lucas Ferreira

Senior Software Engineer | Scala | Javascript | Clojure

发布日期: 2025年1月22日

In the previous chapter, we talked a lot about load balancers, strategies and how, using a LB we can add more server nodes.

For this chapter, we are getting back to databases, specially database replication. With this topic, we're looking to bring to our solution a better performance, reliability and availability. And some more complexity, sorry about that, as trade-off.

Let’s go for it!

From where we stopped, this is the current state of our application (Image 1).

Our current architecture (Image 1) relies on a single database server. With three application servers now in place, this single point of failure and potential bottleneck needs to be addressed.

So, the big question here is: What is Database Replication?

Before moving on with this topic, I wanted to let you know that the words "master" and "slave" here are not ideal nor appreciated by the author. They're just the common terms used by the whole industry. For the sake of my mind, I'll call it main and replicas.

Imagine making copies of your important files and storing them in different places. That's essentially what database replication does. It creates copies of your database and keeps them synchronized.

A common way to do this is with a "master/slave" setup. Think of this "master" as the original, main database. It's the only one that accepts changes (like adding new data, updating existing data, or deleting data). The "slaves" are the replicas of the master. They only allow reading data.

Most of the time, people read data much more often than they change it. So, it's common to have more replica databases than main databases. (Image 2)

So, why using database replication can be helpful:

Faster Performance: When someone needs to read data, they can get it from a replica. This spreads out the work and makes things faster, especially when lots of people are using the system at the same time.
Reliability: If one database server has a problem (like a hardware failure or a natural disaster), the data is still safe because it's copied on other servers.
Availability: Even if one database goes offline, your website or application can keep working by using one of the other copies.

领英推荐

How databases are managed in production?

Arpit Bhayani 2 年前

SELECT news FROM Yugabyte - June 24

Yugabyte 8 个月前

Load Balancing with Pgpool

Thiago Azadinho - MBA/OCP/OCE/MCSE 1 个月前

Getting back to reliability, what would happen if one of the databases went offline?

Remember how load balancers help keep your website running even if a server goes down? Database replication does something similar for your data. Check again Image 2.

For a replica failure: If you have multiple replicas, the system simply uses the other healthy ones. If you only have one replica, the system can temporarily read directly from the main. Then, a new replica is created to replace the broken one.

For a main db failure: One of the replicas is promoted to become the new main. This is a bit more complicated because the replica might not have the latest changes. Some extra steps might be needed to make sure everything is up-to-date. There are more advanced ways to handle this, but they are more complex and we won't cover them here.

So, this is how all of this fit together (Image 3):

A user's computer asks a DNS server for the website's address (IP address).
The user's computer connects to a load balancer using that IP address.
The load balancer sends the user's request to one of the web servers (Server 1, 2 or 3).
The web server reads data from a replica database.
If the user needs to change data (write, update, or delete), the web server sends that request to the main database.

Sounds better this way, huh?

But we can get faster. For next chapter, we'll talk about using a cache to store frequently used data and using a Content Delivery Network (CDN) to deliver static files like images and videos.

See you in a few days!

Previous chapter:

要查看或添加评论，请登录

Lucas Ferreira的更多文章

Scaling from zero to millions of users - Load balancer

2025年1月17日

Scaling from zero to millions of users - Load balancer

In the previous chapter, we enhanced our foundation, by discussing scaling horizontally vs vertically. We also…

1 条评论
Scaling from zero to millions of users - Databases

2025年1月13日

Scaling from zero to millions of users - Databases

In the previous chapter, we built the initial structure of our system. Now, let’s move our focus to databases - the…

1 条评论
Scaling from zero to millions of users - Single server setup

2025年1月10日

Scaling from zero to millions of users - Single server setup

Designing a system capable of handling millions of users is no easy job—it’s an iterative process that demands constant…

2 条评论

Scaling from zero to millions of users - Database replication

Lucas Ferreira

Senior Software Engineer | Scala | Javascript | Clojure

领英推荐

Lucas Ferreira的更多文章

社区洞察

其他会员也浏览了

Tuning Tips to Maximize Your PostgreSQL Performance

SQL database replication: Logical or Physical?

Azure SQL Database Service Tiers: In-Depth Analysis of Backup Retention and Point-in-Time Restore

Simplify Database Management with Tessell: The Benefits of "One Throat to Choke" in the Cloud

MariaDB Master Slave Replication in Docker Containers

Best practices for running PostgreSQL on Kubernetes

AWS Read Replica vs Multi-AZ

SQL Backup Master

A Comprehensive Guide to Migrating from Oracle to Packet

Database Management and Security

领英推荐

Lucas Ferreira的更多文章

Scaling from zero to millions of users - Load balancer

Scaling from zero to millions of users - Databases

Scaling from zero to millions of users - Single server setup

社区洞察

其他会员也浏览了

Tuning Tips to Maximize Your PostgreSQL Performance

SQL database replication: Logical or Physical?

Azure SQL Database Service Tiers: In-Depth Analysis of Backup Retention and Point-in-Time Restore

Simplify Database Management with Tessell: The Benefits of "One Throat to Choke" in the Cloud

MariaDB Master Slave Replication in Docker Containers

Best practices for running PostgreSQL on Kubernetes

AWS Read Replica vs Multi-AZ

SQL Backup Master

A Comprehensive Guide to Migrating from Oracle to Packet

Database Management and Security