登录查看更多内容

Database Replication: A Deeper Dive

Joel Ndoh

Software Engineer | Software Architect | DevOps | Cloud

发布日期: 2024年12月30日

In the previous episode, we explored replication as a horizontal scaling method, focusing on its application at the web application layer. We discussed how replication can be categorized as stateful or stateless and examined strategies like caching, sticky sessions, and session clustering to optimize performance and reduce latency.

Building on that foundation, today's episode delves into database replication, another essential facet of horizontal scaling. By replicating the database layer, we can achieve higher availability, better read scalability, and improved performance for distributed systems. Let's explore the two main approaches to database replication: Master-Slave (Primary-Secondary) and Master-Master (Peer-to-Peer) replication.

Database Replication

Database replication is the process of creating multiple copies of a database to improve performance, scalability, and availability.

When implementing database replication, two primary architectures are commonly used:

1. Master-Slave (Primary-Secondary) Replication

This model involves multiple replicas of the database:

A master (primary) handles both read and write operations.
Secondary replicas are provisioned for read operations only.

To keep these replicas synchronized, two update methods are used:

a. Asynchronous Replication

Updates to the secondary replicas occur after the master completes its write operation.

When a write operation is sent to the database, the primary replica first completes the update. After completion, it gradually propagates the changes to the secondary replicas. MongoDB provides this functionality by default when using Atlas clusters.

Pros:

Low latency for write operations on the master.

Cons:

Data loss risk if the master fails before updates propagate to secondaries.

Eventual consistency between replicas after the write operation completes.

Illustration of Master-slave Replication in database — Operation on Master and Delayed Update to Secondary Replicas

b. Synchronous Replication

Updates to both master and secondary replicas occur simultaneously during a write operation.

Pros:

Always consistent data between the master and secondaries.

Cons:

领英推荐

PostgreSQL Replication: A Detailed Guide

Thiago Azadinho - MBA/OCP/OCE/MCSE 9 个月前

Secrets to Database Scalability!

Pavan Belagatti 1 年前

Database Scalability Secrets!

Samson O. Sanyaolu 1 年前

High latency for write operations due to synchronization overhead.

Risk of deadlocks if a replica becomes unavailable during a write operation.

Proximity between replicas is crucial to minimize latency.

Difference between asynchronous replication and synchronous replication — Asynchronous replication vs Synchronous replication

Advantages of Master-Slave Replication:

High read scalability.
High read availability.
No write conflicts.

2. Master-Master (Peer-to-Peer) Replication

Master-Master replication reduces write latency by allowing all replicas to handle both read and write operations. Each replica synchronizes with others, ensuring bi-directional data replication.

Use Case:

Ideal for geographically distributed systems where proximity to users is essential for low write latency.

Pros:

High read scalability.
High read and write availability.

Cons:

Risk of write conflicts if simultaneous updates occur on different replicas.
Transaction ordering issues due to time zone differences (data skew).

Representation of master to master database replication — World map with multiple database nodes showing bi-directional replication

Choosing the Right Replication Strategy

Your choice between Master-Slave and Master-Master replication depends on:

Workload: High read-heavy workloads may benefit more from Master-Slave setups.
Latency requirements: Geographically distributed systems favor Master-Master replication.
Consistency needs: Synchronous replication ensures data accuracy, while asynchronous replication prioritizes speed.

Difference between Master-Slave Replication and Master-Master Replication — Master-Slave vs Master-Master Replication

This episode delves into the core concepts of database replication and its practical applications. In upcoming episodes, we'll explore asynchronous processing in depth, including its use cases in e-commerce applications.

Stay tuned!

Software Architecture

488 位关注者

Promise Uchegbunam

Software Engineer | Cyber Security | Building & Breaking Systems | Go & TypeScript

2 个月

Great Article Joel Ndoh. Might wanna consider using “Primary/Secondary” as opposed to “Master/Slave” going forward. The “Master/Slave” connotation is continuously being phased out in modern DB architecture.

3 次回应

查看更多评论

要查看或添加评论，请登录

Joel Ndoh的更多文章

Routing Techniques for Database Partitioning

2025年3月26日

Routing Techniques for Database Partitioning

Why Routing Matters in Database Partitioning When dealing with large, distributed databases, routing becomes essential…
Scaling Databases with Partitioning: Vertical vs. Horizontal

2025年3月21日

Scaling Databases with Partitioning: Vertical vs. Horizontal

In previous editions of my newsletter, I've discussed how splitting services in a system enhances scalability in large…

1 条评论
Asynchronous Processing in Large Systems

2025年1月14日

Asynchronous Processing in Large Systems

Asynchronous processing involves delegating tasks that don’t require immediate user feedback or interaction. This…
Scalability: Replication in Web Applications to Handle 50 million Concurrent Requests

2024年10月11日

Scalability: Replication in Web Applications to Handle 50 million Concurrent Requests

In the previous episode, we introduced the concept of scalability—the ability to handle increasing traffic or data by…

3 条评论
Scalability: An Overview

2024年10月4日

Scalability: An Overview

In today's digital world, scalability is crucial for ensuring that a system can handle increasing workloads without…
Dynamic Caching (Part 2)

2024年9月23日

Dynamic Caching (Part 2)

In the last episode, we discussed two types of caching that can be used in a web application: Static Caching: This is…
Caching: Exploring the Five Key Layers in a Modern Web Application

2024年9月13日

Caching: Exploring the Five Key Layers in a Modern Web Application

Introduction: Caching is a crucial technique for optimizing the performance of web applications. By strategically…
Deadlock in Large Systems

2024年9月5日

Deadlock in Large Systems

In the complex world of software architecture, deadlocks are one of those issues that can silently creep in and bring…
Locking in Systems: Minimizing Contention for a Smoother Ride

2024年8月30日

Locking in Systems: Minimizing Contention for a Smoother Ride

When we talk about locking in databases, it’s like controlling access to specific data to ensure that everything stays…
Understanding Latency in Concurrent Request Processing Systems

2024年8月22日

Understanding Latency in Concurrent Request Processing Systems

In today’s high-performance systems, one of the most critical factors that can make or break user experience is…

See all articles

Database Replication: A Deeper Dive

Joel Ndoh

Software Engineer | Software Architect | DevOps | Cloud

Database Replication

1. Master-Slave (Primary-Secondary) Replication

领英推荐

2. Master-Master (Peer-to-Peer) Replication

Choosing the Right Replication Strategy

Software Architecture

488 位关注者

Joel Ndoh的更多文章

社区洞察

其他会员也浏览了

Understanding Replication: Statement, WAL, Logical Log, and Trigger-Based Approaches

Best practices for running PostgreSQL on Kubernetes

Raft Replication in Oracle Database 23ai: High Availability and Scalability made simple

Unlocking the Power of Wal2Json in Ubuntu with PostgreSQL

Understanding MongoDB Replication

System Design : SCALE FROM ZERO TO MILLIONS OF USERS: Part 2(Final)

Multi-master replication solution for PostgreSQL

Mastering Database Management Strategies, Techniques, and Best Practices for Success

When the Lights Go Out: How MongoDB Replication Keeps Your Data Alive

Database Replication

1. Master-Slave (Primary-Secondary) Replication

领英推荐

2. Master-Master (Peer-to-Peer) Replication

Choosing the Right Replication Strategy

Software Architecture

488 位关注者

Joel Ndoh的更多文章

Routing Techniques for Database Partitioning

Scaling Databases with Partitioning: Vertical vs. Horizontal

Asynchronous Processing in Large Systems

Scalability: Replication in Web Applications to Handle 50 million Concurrent Requests

Scalability: An Overview

Dynamic Caching (Part 2)

Caching: Exploring the Five Key Layers in a Modern Web Application

Deadlock in Large Systems

Locking in Systems: Minimizing Contention for a Smoother Ride

Understanding Latency in Concurrent Request Processing Systems

社区洞察

其他会员也浏览了

Understanding Replication: Statement, WAL, Logical Log, and Trigger-Based Approaches

Best practices for running PostgreSQL on Kubernetes

Raft Replication in Oracle Database 23ai: High Availability and Scalability made simple

Unlocking the Power of Wal2Json in Ubuntu with PostgreSQL

Understanding MongoDB Replication

System Design : SCALE FROM ZERO TO MILLIONS OF USERS: Part 2(Final)

Multi-master replication solution for PostgreSQL

Mastering Database Management Strategies, Techniques, and Best Practices for Success

When the Lights Go Out: How MongoDB Replication Keeps Your Data Alive