登录查看更多内容

Using SAGAS to maintain data consistency in Microservices

Priyal Walpita

CTO & Co-Founder @ Zafer | Expert in AI/ML, Blockchain, Quantum Computing, Cybersecurity & Secure Coding | Digital Security Innovator | Mentor & Trainer in Advanced Tech

发布日期: 2019年10月17日

During the past age when Monoliths were ruling the world, we had the ACID to take care about data consistency and management. In those days what we used to do was create a transaction object and perform all our required transaction within that transaction scope. It guarantees the Atomicity, Consistency, Isolation, Durability of all database actions we did within the scope of that transactions.

But that day and age has long gone now. Now most of the back-end systems are moving towards microservices and more or less distributed architecture pattern. If you implement the microservices architecture pattern correctly, you would be having one database per service. Following is an example of microservices architecture pattern.

The question is can we use ACID properties across multiple databases which are associated with each microservice?

Imagine you are designing a system for an online store like an e commerce online store. Assume that customer has a credit limit. The system should check for particular customer's credit limit before placing an order for that particular customer.

Example :

sum(order.total) <= cutomer.CreditLimit

This is not a challenge in a system that designed using Monolithic architecture. Because we have the ACID properties to manage all database transactions. If we do our transactions properly ( we used to do it even without knowing it much in those good old days :D ), the ACID properties ensures the integrity of the transaction irrespective to the number of requests that are made to the API. Following is a very basic example of usage of transactions.

BEGIN TRANSACTION

 

            SUM AMOUNT FROM ORDER LINE WHERE ORDER ID = ? AND CUSTOMER_ID=?

 

             SELECT CREDIT_LIMIT FROM CUSTOMER WHERE CUSTOMER = ?

 

            INSERT INTO ORDERS

 

            UPDATE CUSTOMER CREDIT LIMIT

 

END TRANSACTION

So in a Monolithic application we can guarantees that concurrent transactions for the customer will be serialized. But how we can achieve this in our microservices back end ? Remember we have two independent services to maintain customers ( credit limits ) and orders.

We would be having a loosely coupled encapsulated services and data management as in the above diagram in our Order and Customer services.

There wont be any issue if there is only request coming into these services. But that is not the reality. What happens when there are 10 or 20 other services are calling your service simultaneously? In a situation like this, how can we maintain the data consistency across multiple databases?

BEGIN TRANSACTION  // This is the transaction

 

  SUM AMOUNT FROM ORDER LINE WHERE ORDER ID = ? AND CUSTOMER_ID=?

 

  SELECT CREDIT_LIMIT FROM CUSTOMER WHERE CUSTOMER = ? // private to the           
                                                      //customer service

 

  INSERT INTO ORDERS // private to the order service

 

  UPDATE CUSTOMER CREDIT LIMIT // private to the customer service

 

END TRANSACTION

So how we can we overcome this problem ? How about using a 2 phase commit ( 2PC ) . Well.. it is sounds like a good solution but 2PC in a distributed architecture inherently contains following problems.

· The 2PC coordinator is a single point of a failure

· It is chatty and creates large network traffic O(4n) messages and O(n^2) with retries

· Reduced throughput due to locks

· Not supported by many No-SQL databases

· CAP theorem ( the 2PC impacts the availability )

So what can we do?

SAGAS to the rescue

So the SAGA mechanism comes to the rescue for distributed systems.

The principle behind the SAGA is fairly simple where you get rid of distributed transactions agent(s) and come up with set of self coordinated local transactions. This idea first introduced by Hector Garcaa Molrna and Kenneth Salem ( Princeton university ) in 1987.

So if we come back to our example , the first step is to create the order saga. So, once it is done the 2nd step is triggered, which is reserve credit in the customer service. Once the customer service reserves the credit, the order service would analyze the reserved credit and approves the order. Then the customer service can update the credit limit of the particular customer upon order status update.

So easy eh ? Nope .. ?? the problems happen when you want to rollback. Now each individual database would execute its own private transaction. There is no automatic rollbacks and if you are in a middle of a transaction you need to undone everything before that manually in respective services. So lets examine how SAGAS can solve this problem.

Solution : Every Transaction Ti have a compensating transaction Ci

What they suggested in the original paper ( as a remedy to the problem that we discussed earlier ) , is to have compensating transactions per each transaction. This compensating transaction would contains what should be undone when there is a requirement of a rollback.

The C1, C2 are the execution blocks which do have the knowledge of what to in order to rollback a particular step of the distributed transaction. So this solves the problems but it makes the API design more complicated. Following are the options that we have when designing the API.

1. Send response when SAGA completes. With this method, the response contains the outcome of the SAGA. This is more or less a wait call and it can lead to a problem of reduce the availability of the service.

2. Send the response immediately ( Async ). In this method the service response do not contains the result of the saga. Client needs to poll or get notified about the result of the saga. Can use event based mechanism to notify the client.

Now the question arise. Who would manage this SAGAS? There are main two ways we can do it .

Choreography : This is more or less a distributed decision making engine. The downside of this method is, it would lead into a high coupling problem in between sagas and services. So you know that high coupling is a big NO NO when its comes to distributed computing !
Orchestration : This is more or less a centralized decision making. So isnt centralization is bad when it comes to distributed computing ? yes it is, but in this instance the orchestrator would be the main service that is responsible of the transaction ( in our example the order service ). Hence it is order service's ( orchestrator in this instance ) to make sure that this particular request is executed without any issues. Hence Orchestration is the preferred solution to this.

The above picture depicts the sagas orchestration. This orchestrator can be implement in two different ways.

Implicit Orchestrator
Explicit Orchestrator

Implicit Orchestrater

The implicit orchestrator is simpler to implement. We can build this into an existing domain object as well. One of the problems of that is it leads to violates the SRP. Because apart from performing order related functionalities now it has to manage the orchestration responsibilities as well. This is not that good because there might be cyclic dependencies between services via events.

The following diagram depicts an event based implicit orchestrator. We can use any distributed message queue to implement the event driven architecture.

Explicit Orchestrator

The explicit orchestrator would not violate the SRP. It would add an extra dedicated component to the order service to manage the sagas.

Following diagram depicts the usage of explicit orchestrator.

Conclusion

Data consistency across multiple services is one of the major challenges in microservices and other distributed architecture patterns. This was not a problem for a monolithic architecture pattern because of the ACID properties when it comes to database transactions.SAGS is one one way that we can over come this problem in distributed patterns.

There are few saga frameworks been build for .Net and Java such as Eventuate Tram and NSaga. Stay tuned for another post on how to implement saga in a microservices architecture pattern.

Thanks for reading !

要查看或添加评论，请登录

Priyal Walpita的更多文章

Building Better Distributed Systems: From Evolution to Best Practices

2025年1月22日

Building Better Distributed Systems: From Evolution to Best Practices

The evolution of distributed systems mirrors the fascinating journey of software architecture itself. As someone who…

1 条评论
Mastering Modern Software Complexity: An Architect's Perspective on Developer Productivity

2024年11月20日

Mastering Modern Software Complexity: An Architect's Perspective on Developer Productivity

As a seasoned professional in Software Architect with over 20 years of industry experience, I've witnessed firsthand…
Large Action Models(LAM): Ushering in a New Era of AI Autonomy

2024年9月26日

Large Action Models(LAM): Ushering in a New Era of AI Autonomy

In the rapidly evolving landscape of artificial intelligence, a groundbreaking technology is emerging that promises to…
Software 3.0: The Next Evolution in Software Development

2024年8月2日

Software 3.0: The Next Evolution in Software Development

In the ever-evolving landscape of technology, we stand on the brink of a new paradigm shift in software development…
Unlocking the Power of Local Large Language Models with Llamafiles — Part 01

2024年7月22日

Unlocking the Power of Local Large Language Models with Llamafiles — Part 01

As artificial intelligence and natural language processing continue to advance at a rapid pace, large language models…
The Dawn of AI Agents: Reshaping the Future

2024年7月10日

The Dawn of AI Agents: Reshaping the Future

Introduction In the fast-paced world of artificial intelligence, a new trend has emerged, capturing the attention of…
Harnessing the Power of Event-Driven, Evolutionary Software Architecture While Managing Complexity

2024年7月8日

Harnessing the Power of Event-Driven, Evolutionary Software Architecture While Managing Complexity

Introduction In the realm of software development, the pursuit of agility and innovation has led to the emergence of…
Major Changes in Large Language Models (LLMs) You Need to Know?in 2024

2024年7月3日

Major Changes in Large Language Models (LLMs) You Need to Know?in 2024

The landscape of large language models (LLMs) is rapidly evolving, and it’s imperative for developers, startups, and…
Securing the Future of AI: A Deep Dive into OWASP’s Top 10 Security Risks for Large Language Models

2023年7月20日

Securing the Future of AI: A Deep Dive into OWASP’s Top 10 Security Risks for Large Language Models

In an era where the digital universe is rapidly expanding, Artificial Intelligence, specifically Large Language Models…
REST, GraphQL, and gRPC: Comparing and Contrasting Modern API Design Patterns

2023年7月18日

REST, GraphQL, and gRPC: Comparing and Contrasting Modern API Design Patterns

As the digital world continually expands, the need for effective and efficient API design has never been more critical.…

See all articles

Using SAGAS to maintain data consistency in Microservices

Priyal Walpita

CTO & Co-Founder @ Zafer | Expert in AI/ML, Blockchain, Quantum Computing, Cybersecurity & Secure Coding | Digital Security Innovator | Mentor & Trainer in Advanced Tech

SAGAS to the rescue

Implicit Orchestrater

Priyal Walpita的更多文章

社区洞察

其他会员也浏览了

CQRS Pattern in Microservices

Bulkhead Architecture Pattern: Data Security & Governance

Simplify Your Microservices Architecture With a Data API

Distributed Transaction Handling in Microservice Architecture

Solving distributed data management problems in a Microservice Architecture

Mastering Data Management in a Microservices Ecosystem

June 10, 2021

Sagas in a Distributed World: Utopia or Dystopia

Data management in microservices

SAGAS to the rescue

Implicit Orchestrater

Priyal Walpita的更多文章

Building Better Distributed Systems: From Evolution to Best Practices

Mastering Modern Software Complexity: An Architect's Perspective on Developer Productivity

Large Action Models(LAM): Ushering in a New Era of AI Autonomy

Software 3.0: The Next Evolution in Software Development

Unlocking the Power of Local Large Language Models with Llamafiles — Part 01

The Dawn of AI Agents: Reshaping the Future

Harnessing the Power of Event-Driven, Evolutionary Software Architecture While Managing Complexity

Major Changes in Large Language Models (LLMs) You Need to Know?in 2024

Securing the Future of AI: A Deep Dive into OWASP’s Top 10 Security Risks for Large Language Models

REST, GraphQL, and gRPC: Comparing and Contrasting Modern API Design Patterns

社区洞察

其他会员也浏览了

CQRS Pattern in Microservices

Bulkhead Architecture Pattern: Data Security & Governance

Simplify Your Microservices Architecture With a Data API

Distributed Transaction Handling in Microservice Architecture

Solving distributed data management problems in a Microservice Architecture

Mastering Data Management in a Microservices Ecosystem

June 10, 2021

Sagas in a Distributed World: Utopia or Dystopia

Data management in microservices