ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

CQRS (Command Query Responsibility Segregation) in Distributed Systems

Diwakar Shukla

Technical Lead @ Paytm | Fintech | Lending | Problem Solver | IoT

å‘å¸ƒæ—¥æœŸ: 2024å¹´9æœˆ6æ—¥

Introduction

In distributed systems, handling the complexity of reads and writes is essential for scalability and performance. CQRS (Command Query Responsibility Segregation) is a pattern that separates the responsibility of handling commands (writes) from queries (reads). This separation allows for independent scaling, optimization, and distinct handling strategies for read and write operations.

This blog post dives deep into the CQRS pattern, its benefits, and how it can be implemented in a distributed system. I have tried to put code examples, real-world use cases, and best practices for using CQRS in large-scale distributed systems.

What is CQRS?

CQRS stands for Command Query Responsibility Segregation, a design pattern that separates the models used to handle read operations (queries) from those used to handle write operations (commands).

Commands change the state of the system and represent business operations (like creating, updating, or deleting entities).
Queries are used to read data from the system without changing its state.

By decoupling these responsibilities, CQRS allows for optimized handling of read and write operations in large-scale distributed systems.

Why Use CQRS in Distributed Systems?

Scalability: Since reads and writes can have different performance and scaling requirements, CQRS allows you to scale them independently.
Performance Optimization: In a typical CRUD (Create, Read, Update, Delete) model, both reads and writes are handled by the same data model. This can lead to inefficiencies. With CQRS, each operation can use optimized data models, improving performance.
Eventual Consistency: CQRS naturally complements the Event Sourcing pattern, where writes are handled as events and read models are eventually consistent.
Flexibility: In distributed systems, different services might need different data models for reads and writes. CQRS enables this flexibility by allowing separate models for different use cases.

Basic Architecture of CQRS

In a distributed system, the CQRS pattern looks like this:

Command Side (Write Model): Handles the incoming commands that modify the systemâ€™s state. These commands are processed by the business logic layer, which updates the database.
Query Side (Read Model): Handles queries to retrieve data. The read side can be optimized with different data storage, denormalized databases, or caching layers.
Eventual Consistency (optional): Commands often generate events, which are propagated to update the read side asynchronously.

CQRS Code Example

Letâ€™s walk through a simple implementation of the CQRS pattern using Java and Spring Boot in a microservices architecture.

Command Side (Write Model)

This side processes commands like creating and updating entities.

é¢†è‹±æŽ¨è

Kafka vs RabbitMQ

Ahmed El-Sayed 2 å¹´å‰

Caching - In Code or External?

Shrey Batra 3 å¹´å‰

JobTarget Internal Batch Framework that runs 5400 Jobs/Month 60,000 Jobs/Year on AWS Batch (Enhanced Architecture) Version 2.0 Future Plans

JobTarget Internal Batch Framework that runs 5400â€¦

Soumil S. 2 å¹´å‰

// Command: CreateUserCommand.java
public class CreateUserCommand {
    private final String userId;
    private final String name;
    private final String email;

    public CreateUserCommand(String userId, String name, String email) {
        this.userId = userId;
        this.name = name;
        this.email = email;
    }

    // Getters
}

// Command Handler: UserCommandHandler.java
@Service
public class UserCommandHandler {

    private final UserRepository userRepository;

    @Autowired
    public UserCommandHandler(UserRepository userRepository) {
        this.userRepository = userRepository;
    }

    public void handle(CreateUserCommand command) {
        User user = new User(command.getUserId(), command.getName(), command.getEmail());
        userRepository.save(user);
    }
}

// Entity: User.java
@Entity
public class User {
    @Id
    private String userId;
    private String name;
    private String email;

    public User(String userId, String name, String email) {
        this.userId = userId;
        this.name = name;
        this.email = email;
    }

    // Getters and setters
}

Query Side (Read Model)

The query side retrieves data. We can use a denormalized, read-optimized model, such as a SQL view or NoSQL database, to speed up read performance.

// Query: GetUserQuery.java
public class GetUserQuery {
    private final String userId;

    public GetUserQuery(String userId) {
        this.userId = userId;
    }

    // Getter
}

// Query Handler: UserQueryHandler.java
@Service
public class UserQueryHandler {

    private final UserViewRepository userViewRepository;

    @Autowired
    public UserQueryHandler(UserViewRepository userViewRepository) {
        this.userViewRepository = userViewRepository;
    }

    public UserView handle(GetUserQuery query) {
        return userViewRepository.findById(query.getUserId()).orElseThrow(() -> new UserNotFoundException());
    }
}

// Read-Optimized View: UserView.java
public class UserView {
    private String userId;
    private String name;
    private String email;

    public UserView(String userId, String name, String email) {
        this.userId = userId;
        this.name = name;
        this.email = email;
    }

    // Getters
}

// UserViewRepository.java
@Repository
public interface UserViewRepository extends JpaRepository<UserView, String> {
}

Real-World Use Cases of CQRS

E-Commerce Platforms (e.g., Amazon, eBay): In an e-commerce system, the read and write patterns can be vastly different. For instance, the product catalog (query side) requires fast lookups, while order creation and updates (command side) involve complex business logic. By separating the command and query sides, the system can independently scale and optimize both.
Financial Systems (e.g., Payment Processing Systems): Payment processing involves complex transaction logic on the write side (validating, deducting balances, etc.), whereas the query side (account balance, transaction history) needs to be read-optimized for quick access.
Social Media Platforms (e.g., Twitter, Facebook): In social media, posting content (write side) involves various validations, while retrieving the news feed (query side) requires optimized, fast queries. CQRS helps separate these two concerns, ensuring the system can scale for millions of users.

Eventual Consistency in CQRS

In large-scale distributed systems, achieving strict consistency between the command and query sides is often impractical. With CQRS, the system can embrace eventual consistency, where the read model lags slightly behind the write model.

When a command updates the state, an event can be published to notify the read model of the change. The read model updates asynchronously, ensuring that the query side catches up eventually.

Hereâ€™s an example of event propagation in CQRS:

// Event: UserCreatedEvent.java
public class UserCreatedEvent {
    private final String userId;
    private final String name;
    private final String email;

    public UserCreatedEvent(String userId, String name, String email) {
        this.userId = userId;
        this.name = name;
        this.email = email;
    }

    // Getters
}

// Event Listener: UserCreatedEventListener.java
@Service
public class UserCreatedEventListener {

    private final UserViewRepository userViewRepository;

    @Autowired
    public UserCreatedEventListener(UserViewRepository userViewRepository) {
        this.userViewRepository = userViewRepository;
    }

    @EventListener
    public void handle(UserCreatedEvent event) {
        UserView userView = new UserView(event.getUserId(), event.getName(), event.getEmail());
        userViewRepository.save(userView);
    }
}

This approach allows for scalability and fault tolerance in large distributed systems.

Best Practices for Implementing CQRS

Independent Scaling: Ensure that the command and query sides can scale independently based on load. Read-heavy systems can benefit from optimized read models with separate databases or caches.
Eventual Consistency: Embrace eventual consistency where strict consistency is not needed. Ensure proper design for propagating events to keep the read model up to date.
Caching Strategies: Implement caching strategies (e.g., Redis) to further optimize the read side in systems where read latency is crucial.
Testing: Testing CQRS can be challenging due to the decoupled nature of the command and query sides. Implement comprehensive unit tests and end-to-end tests to ensure consistency across the system.

Conclusion

CQRS is a powerful pattern for large-scale distributed systems where read and write operations have different requirements. By decoupling these responsibilities, CQRS enables scalable, efficient, and maintainable architectures.

However, it also introduces complexity, so it's essential to carefully assess whether CQRS is the right fit for your system. With the right use case and careful implementation, CQRS can dramatically improve performance and scalability in distributed systems.

LN Pandey

Making AI simple for all | IIT Madras

6 ä¸ªæœˆ

Good to know this!

èµž

å›žå¤

1 æ¬¡å›žåº”

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Diwakar Shuklaçš„æ›´å¤šæ–‡ç«

?? Microservices & DTO JARs: Smart Reuse or Hidden Coupling?

2025å¹´3æœˆ21æ—¥

?? Microservices & DTO JARs: Smart Reuse or Hidden Coupling?

In a modern Spring Boot microservices architecture, one common design choice is to package DTOs (Data Transfer Objects)â€¦
RSA and ECDSA: Modern Cryptography Algorithms Analysis

2024å¹´10æœˆ18æ—¥

RSA and ECDSA: Modern Cryptography Algorithms Analysis

RSA and ECDSA: A Technical Dive into Modern Cryptography Cryptography plays a crucial role in securing data in modernâ€¦
Erasure Coding

2024å¹´9æœˆ27æ—¥

Erasure Coding

Erasure coding is a data protection technique used in distributed storage systems to ensure data availability andâ€¦
Building Resilient and Fault-Tolerant Systems: An In-Depth Guide

2024å¹´9æœˆ8æ—¥

Building Resilient and Fault-Tolerant Systems: An In-Depth Guide

In distributed systems, failures are inevitable. A resilient and fault-tolerant system can continue to function despiteâ€¦
Designing High-Performance APIs: A Technical Deep Dive

2024å¹´9æœˆ7æ—¥

Designing High-Performance APIs: A Technical Deep Dive

High-performance APIs are crucial for building responsive and scalable systems in today's data-driven world. Whetherâ€¦

2 æ¡è¯„è®º
Anti-Corruption Layer (ACL): Protecting System Integrity in Complex Architectures

2024å¹´8æœˆ29æ—¥

Anti-Corruption Layer (ACL): Protecting System Integrity in Complex Architectures

Why this? In today's enterprise environments, integrating new systems with legacy systems or third-party services is aâ€¦
Understanding Zero Copy Architecture: Boosting Performance in Modern Systems

2024å¹´8æœˆ28æ—¥

Understanding Zero Copy Architecture: Boosting Performance in Modern Systems

Introduction In today's high-performance computing environments, data movement can be a significant bottleneckâ€¦
Kafka Architecture: A Deep Dive

2024å¹´8æœˆ27æ—¥

Kafka Architecture: A Deep Dive

Kafka's architecture is designed to be scalable, fault-tolerant, and distributed, capable of handling large volumes ofâ€¦

See all articles

CQRS (Command Query Responsibility Segregation) in Distributed Systems

Diwakar Shukla

Technical Lead @ Paytm | Fintech | Lending | Problem Solver | IoT

What is CQRS?

Why Use CQRS in Distributed Systems?

Basic Architecture of CQRS

CQRS Code Example

Command Side (Write Model)

é¢†è‹±æŽ¨è

Query Side (Read Model)

Real-World Use Cases of CQRS

Eventual Consistency in CQRS

Best Practices for Implementing CQRS

Conclusion

Diwakar Shuklaçš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

LakeBoost:Maximizing Efficiency in Data Lake (Hudi) Glue ETL Jobs with a Templated Approach and Serverless Architecture with Source Code

Elastic Search Performance Tuning and Optimization How We Got 80X Faster Searches a Case Study

Understanding JSON: The Backbone of Modern Data Exchange

Kafka Streams vs. Apache Flink: Choosing the Right Tool for Stream Processing

What Is Kalix And Its Advantages

Backfilling Apache Hudi Tables in Production: Techniques & Approaches Using AWS Glue by Job Target LLC

Kafka Architecture

"Real-Time End-to-End Integration with Apache Kafka in Apache Sparkâ€™s Streaming"

What is CQRS?

Why Use CQRS in Distributed Systems?

Basic Architecture of CQRS

CQRS Code Example

Command Side (Write Model)

é¢†è‹±æŽ¨è

Query Side (Read Model)

Real-World Use Cases of CQRS

Eventual Consistency in CQRS

Best Practices for Implementing CQRS

Conclusion

Diwakar Shuklaçš„æ›´å¤šæ–‡ç«

?? Microservices & DTO JARs: Smart Reuse or Hidden Coupling?

RSA and ECDSA: Modern Cryptography Algorithms Analysis

Erasure Coding

Building Resilient and Fault-Tolerant Systems: An In-Depth Guide

Designing High-Performance APIs: A Technical Deep Dive

Anti-Corruption Layer (ACL): Protecting System Integrity in Complex Architectures

Understanding Zero Copy Architecture: Boosting Performance in Modern Systems

Kafka Architecture: A Deep Dive

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

LakeBoost:Maximizing Efficiency in Data Lake (Hudi) Glue ETL Jobs with a Templated Approach and Serverless Architecture with Source Code

Elastic Search Performance Tuning and Optimization How We Got 80X Faster Searches a Case Study

Understanding JSON: The Backbone of Modern Data Exchange

Kafka Streams vs. Apache Flink: Choosing the Right Tool for Stream Processing

What Is Kalix And Its Advantages

Backfilling Apache Hudi Tables in Production: Techniques & Approaches Using AWS Glue by Job Target LLC

Kafka Architecture

"Real-Time End-to-End Integration with Apache Kafka in Apache Sparkâ€™s Streaming"

é¢†è‹±æŽ¨è

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†