登录查看更多内容

How Grab stores and processes millions of orders every day

Arpit Bhayani

发布日期: 2023年2月22日

Grab stores and processes millions of orders every day, here is the design of the systems that powers it ?

The architecture of Grab's order platform is divided into two main databases, one for transactional queries and the other for analytical queries.

The transactional database holds transactional data about the orders that serve as the single source of truth for ongoing orders. In contrast, the analytical database holds historical and statistical data and stores it for longer periods. This database is defined to be more efficient for read-heavy analytical queries.

Design Goals

Given the critical role that databases play in any system, the most important criteria for a good design are stability, availability, and consistency.

Stability is critical as the database must handle high volumes of queries per second. Availability is important because the database stores orders and any downtime could lead to revenue loss. Consistency ensures that users receive updated information when performing transactional queries. Therefore, strong consistency is necessary for transactional queries, while eventual consistency is sufficient for analytical queries.

Architecture

DynamoDB as Transactional Database

Grab uses DynamoDB as its primary data store. DynamoDB offers strong consistency, scalability, and high availability, making it an ideal choice for transactional queries.

Grab uses key-value queries such as get, create, and update orders, and DynamoDB's internal partition balancing feature to handle spikey traffic loads and hotkeys.

The orders table in DynamoDB contains several key attributes, including order ID, the state of the order (ongoing, completed, or canceled), the time the order was created, and the user ID.

To optimize performance, a global secondary index was created on the user ID, which allows for quick retrieval of ongoing orders for a particular user. When an order is completed, it is automatically removed from the index to keep it lean.

Grab has also designed its schema to keep its global secondary index (GSI) lean and performant by removing entries from the index after three months of inactivity.

Vivek Bansal 8 个月前

Distributing SQL Databases Globally

Kaivalya Apte 1 年前

Understanding Inline Table-Valued Functions in…

Shiv Iyer 1 个月前

MySQL as Analytical Database

For analytical queries, Grab uses MySQL as its Analytical database. It is partitioned by the created_at, ensuring minimal cross-partition queries, and it drops old partitions to keep the database lean and consistent.

Sync between databases

Grab uses Kafka to handle asynchronous batch ingestion from DynamoDB to MySQL. All events from the order service are sent to Kafka, which then updates the analytics database.

Conclusion

In conclusion, Grab's order platform is an example of a complex system that balances consistency, availability, and cost-effectiveness to process millions of orders every day. By separating transactional and analytical queries into two separate databases, using specialized processing units, and leveraging modern data storage technologies, Grab has built a system that is highly scalable, available, and cost-effective.

Here's the video of my explaining this in-depth ?? do check it out

Thank you so much for reading this edition of the newsletter ?? If you found it interesting, you will also love my

Members-only videos are more hands-on and cover some really interesting and essential topics that every engineer should know. So, support me by becoming a member of my YouTube channel.

ps: the references for this edition of the newsletter can be found in the description of the video attached (if any).

Arpit's Newsletter

108,073 位关注者

Patrick Warukira

Field Market Rep at Capwell Industry

1 年

What this order grabbing

Patrick Warukira

Field Market Rep at Capwell Industry

1 年

Rahul Agarwal

1 年

Thanks Arpit Bhayani as always your writing and examples are simple and straight to the point.Liked the example where you clearly mentioned about dropping indexes as soon as order completes and dropping index as soon as no read happening for older partitions for analytics databases.

Ashish Vijaywargiya

VP of Operations at HotWax Systems | Apache OFBiz Committer, PMC Member | ASF(Apache Software Foundation) Member | Empowering College Students, Freshers & Early IT Professionals ??

1 年

Super nice meme. I liked this one a lot. ???????? How do you generate it, Arpit?

Abhishek Verma

Software Professional

1 年

Arpit Bhayani Interesting you mention about DynamoDB's ability to handle spiky traffic pattern. There are potential scenarios however when an erratic write pattern can also lead to write throttles (even with a high cardinality partition key) -say the write requests within a timeframe is significantly higher than the requests in the preceding timeframe.

查看更多评论

要查看或添加评论，请登录

Arpit Bhayani的更多文章

The best resource does not exist.

2024年9月22日

The best resource does not exist.

This edition of the newsletter contains two quick write-ups about The best resource is mythical Convergent Encryption I…

3 条评论
It's not about what you know, but about how you think

2024年9月8日

It's not about what you know, but about how you think

This edition of the newsletter contains two quick write-ups about It's not about what you know, but about how you think…

1 条评论
Roadmaps are just satisfying your urge to follow a syllabus

2024年8月18日

Roadmaps are just satisfying your urge to follow a syllabus

This edition of the newsletter contains one quick write-up about Roadmaps are just satisfying your urge to follow a…

3 条评论
Always negotiate the offer you get

2024年8月11日

Always negotiate the offer you get

This edition of the newsletter contains two quick write-ups about Always negotiate the offer you get Why PostgreSQL…

2 条评论
Proving your Culture Fit

2024年8月4日

Proving your Culture Fit

This edition of the newsletter contains two quick write-ups about Proving your Culture Fit What not to say during…

1 条评论
Premature Abstractions

2024年7月28日

Premature Abstractions

This edition of the newsletter contains two quick write-ups about Quantify and show impact, whenever and wherever…

1 条评论
Tip the scale in your favor in interviews

2024年7月21日

Tip the scale in your favor in interviews

This edition of the newsletter contains two quick write-ups about How to tip the scale in your favor during interviews…

2 条评论
7 questions that you should ask your interviewer

2024年7月14日

7 questions that you should ask your interviewer

This edition of the newsletter contains two quick write-ups about Questions that you should ask your interviewers The 4…

5 条评论
Traits of a 10x engineer

2024年7月7日

Traits of a 10x engineer

Build your own Interpreter CodeCrafters launched a super interesting challenge on building your own interpreter. Give…

3 条评论
How PostgreSQL stores data in files, called forks

2024年6月30日

How PostgreSQL stores data in files, called forks

Thank you so much for reading this edition of the newsletter ?? If you found it interesting, you will also love my…

1 条评论

See all articles

How Grab stores and processes millions of orders every day

Arpit Bhayani

Design Goals

Architecture

DynamoDB as Transactional Database

领英推荐

MySQL as Analytical Database

Sync between databases

Conclusion

Arpit's Newsletter

108,073 位关注者

Arpit Bhayani的更多文章

社区洞察

其他会员也浏览了

Exploring Indexing Strategies: Optimizing Database Performance

Hasura GraphQL Remote JOINs on Distributed SQL Running on AKS & GKE

CAP Theorem: Optimizing Database Selection for Enhanced Application Performance

An Approach to Database Fine-Grained Access Controls

How The Single Table Design Can Help You Reduce Latency And Costs in DynamoDB

Deploying a Real-Time Location App with Hasura GraphQL Engine and Distributed SQL

To SQL, NoSQL or NewSQL, that’s the Query!

Integrating Databases with Backend: Your Data’s Best Friend ????

SingleStore Vector DB and Embeddings

Managing Performance in Azure SQL: Strategies for Handling Skewed Indexes

Design Goals

Architecture

DynamoDB as Transactional Database

领英推荐

MySQL as Analytical Database

Sync between databases

Conclusion

Arpit's Newsletter

108,073 位关注者

Arpit Bhayani的更多文章

The best resource does not exist.

It's not about what you know, but about how you think

Roadmaps are just satisfying your urge to follow a syllabus

Always negotiate the offer you get

Proving your Culture Fit

Premature Abstractions

Tip the scale in your favor in interviews

7 questions that you should ask your interviewer

Traits of a 10x engineer

How PostgreSQL stores data in files, called forks

社区洞察

其他会员也浏览了

Exploring Indexing Strategies: Optimizing Database Performance

Hasura GraphQL Remote JOINs on Distributed SQL Running on AKS & GKE

CAP Theorem: Optimizing Database Selection for Enhanced Application Performance

An Approach to Database Fine-Grained Access Controls

How The Single Table Design Can Help You Reduce Latency And Costs in DynamoDB

Deploying a Real-Time Location App with Hasura GraphQL Engine and Distributed SQL

To SQL, NoSQL or NewSQL, that’s the Query!

Integrating Databases with Backend: Your Data’s Best Friend ????

SingleStore Vector DB and Embeddings

Managing Performance in Azure SQL: Strategies for Handling Skewed Indexes