登录查看更多内容

System Design of a flash sale e-commerce.

Nikhil Kumar

Angel investor , Engineering Director | Ex Microsoft, Amazon, Intel, UIPath, Whatfix

发布日期: 2025年1月28日

Problem Statement:

The task is to design a Flash Sale Pre-Checkout System for a food delivery app that wants to sell a specific food item from a top-rated seller within a short time frame. The system needs to handle millions of users competing for a limited stock, ensuring scalability, efficiency, and accurate inventory allocation.

Key Functional Requirements

Flash Sale Inventory Management: The inventory (e.g., 20,000 burgers) is set aside before the sale starts.
Pre-Checkout Reservation: The item is mapped to a customer before payment, ensuring that an order does not get double-booked.
Concurrency Handling: The system must handle millions of requests per second, ensuring fair allocation of stock.

Key Non-Functional Requirements

Scalability: The system should handle 10 million+ users concurrently.
High Performance: Order reservation response time should be <100ms.
Reliability: The system should prevent race conditions and overselling.
Cost Efficiency: The architecture should minimize infrastructure costs, considering flash sales are short-lived events.

Back-of-the-Envelope Calculation

A rough estimation of concurrent users is necessary to determine how much load the system needs to handle.

Estimating Concurrent Users

Given:

Total daily active users (DAU) interested in the sale: 10 million
Flash sale duration: 24 hours
Total seconds in 24 hours = 24 × 60 × 60 = 86,400 seconds
Concurrent requests per second = 10 million / 86,400 → 115.74 ≈ 115 concurrent users per second

This means our system needs to handle at least 115 concurrent users every second, assuming a uniform traffic pattern. However, in reality:

Peak traffic happens in the first few minutes.
The load is not evenly distributed over 24 hours.
We may experience bursts of 10,000+ concurrent users during the peak.

Peak Load Assumption

If 80% of users participate in the first 10 minutes, that means 8 million users in 600 seconds.
Peak concurrent requests = 8M / 600 → ≈ 13,333 requests per second.

Thus, our system must be designed to handle at least 10,000–15,000 requests per second (RPS) during peak traffic.

Things That Are Out of Scope:

1. Payment Processing & Payment Failures

This system does not handle payments, transactions, or fraud detection.
We assume that once an item is added to the cart, the payment process happens in a separate system.
Any payment failures, refunds, or chargebacks are out of scope.

? Reason for Exclusion: Payment processing is handled by dedicated payment gateways (Stripe, Razorpay, PayPal, etc.) and requires PCI compliance and fraud detection mechanisms.

2. Order Delivery & Logistics

The system only handles inventory reservation but does not deal with delivery tracking, dispatching, or logistics management.
We assume that once an order is successfully placed, it moves to a separate order fulfillment system.

? Reason for Exclusion: Delivery systems involve fleet management, real-time tracking, and delivery slot optimizations, which are completely separate from pre-checkout operations.

3. Order Cancellation & Abandonment Handling

We do not handle cases where a user cancels an order after adding it to the cart.
If a user adds an item but does not complete payment, we assume inventory will be released after a pre-set timeout.
Any form of re-attribution or reallocation of inventory from abandoned carts is not covered.

? Reason for Exclusion: Order cancellations require timer-based inventory restoration and business logic to prevent abuse (e.g., users reserving multiple items without buying them). These optimizations can be handled by a cart management service.

4. Personalized Recommendations & Dynamic Pricing

The system does not incorporate AI-based recommendations to upsell or suggest alternative products.
Dynamic pricing (surge pricing, demand-based price fluctuations, or discounts) is out of scope.

? Reason for Exclusion: Such features require machine learning models and integration with pricing engines, which are not critical to the core inventory reservation problem.

5. Seller Inventory Management & Restocking

This system does not allow sellers to update stock dynamically or manage their own inventory replenishment.
We assume that inventory is fixed before the sale starts (e.g., 20,000 burgers pre-allocated).
Any inventory restocking logic is not considered.

? Reason for Exclusion: Flash sales typically work with pre-allocated stock, and dynamic inventory updates introduce complexity and inconsistencies during a high-traffic event.

Final Scope Summary

? Covered in This Article

Handling a high-scale pre-checkout reservation system.
Ensuring real-time inventory allocation with low latency.
Using Redis, Message Queues, and Order Accumulators to process reservations efficiently.
Addressing scalability concerns, including rate limiting & race conditions.

? Not Covered (Out of Scope)

Payment Processing & Failures
Delivery & Logistics
Order Cancellation & Abandonment
AI-based Recommendations & Dynamic Pricing
Seller Inventory Management & Restocking

System Design Architecture

High-Level Overview

Key Components:

RedisCache: Manages real-time inventory and reservations.
OrderService: Processes orders and updates inventory.
SQS (Simple Queue Service): Ensures orders are processed asynchronously.
OrderWorker: Picks orders from SQS, finalizes reservations, and updates the database.
DLQ (Dead Letter Queue): Stores failed orders for retry.

Detailed Component Breakdown:

领英推荐

10 Possible Causes of Low Conversion Rates in…

Enhencer | AI Ads - Meta & Google 5 个月前

Top Advantages, Disadvantages and Limitations of…

JK Lucent 1 年前

How Magento 2 Delivery Date and Time Extension Solves…

SunCart 5 个月前

Redis as the Primary Inventory Store

erDiagram REDIS_CACHE { int customer_id string order_status }

? Why Redis?

Atomic Operations (INCR, DECR) prevent race conditions.
Ultra-fast (sub-millisecond latency).
Eviction Policies can prevent stale reservations.

?? Redis Key Structure

inventory_count = 20,000 # Decremented atomically reservation:{customer_id} = {status} # Tracks reservations

?? How it Works:

DECR inventory_count → Ensures atomic reservation.
SET reservation:{customer_id} CONFIRMED → Prevents double allocation.
Orders failing payment are released back into inventory.

Order Processing Pipeline

To prevent bottlenecks and system crashes, we use an asynchronous, event-driven pipeline.

? Advantages of this pipeline

Asynchronous processing prevents bottlenecks.
Message queueing prevents sudden traffic spikes.
Retries failed orders without blocking the main system.

Handling Scalability Challenges

The Thundering Herd Problem

?? Issue: Millions of users rushing in at the same time overloads Redis. ?? Solution: Introduce Rate Limiting + Distributed Locks

Rate limiter (Token Bucket Algorithm) blocks excessive requests.
Distributed Locks ensure that only one request per user is processed.

Preventing Race Conditions in Order Placement

?? Issue: Two users reserving the last item at the same time. ?? Solution: Atomic Redis Transactions

MULTI DECR inventory_count SET reservation:{customer_id} CONFIRMED EXEC

?? This ensures either both succeed or both fail.

Order Failures & Retries

?? Issue: If an order fails, how do we retry? ?? Solution: Dead Letter Queue (DLQ) + Retry Worker

Failed orders go to DLQ.
Retry Worker retries failed orders up to 3 times.
If still failing, alert the customer support team.

Scaling Horizontally

?? Issue: Single server cannot handle millions of users. ?? Solution: Horizontally Scale Order Service

Auto-scaling ensures capacity adjusts dynamically.
Load balancer distributes requests evenly.

Final Thoughts

? Scalable & Resilient – Can handle millions of users.

? No Race Conditions – Atomic Redis transactions prevent double booking.

? High Availability – Asynchronous queue ensures no service downtime.

? Cost Efficient – Optimized architecture prevents wasteful spending.

Additional Improvements

?? Optimistic Locking for High Contention: Use WATCH in Redis to detect conflicts.

?? AI-powered Demand Prediction: Use ML to pre-stock inventory in high-demand locations.

?? User Experience Enhancements: Show real-time stock updates to users.

?? Use Stream Processor like apache Spark : Avoid availability overheads with embedded DB. Can be paired with CDC tools as well.

This architecture ensures a smooth, fair, and reliable flash sale experience for millions of users. ??

要查看或添加评论，请登录

Nikhil Kumar的更多文章

India’s Century: How the World’s Largest Democracy is Poised to Shape the 21st Century

2025年2月21日

India’s Century: How the World’s Largest Democracy is Poised to Shape the 21st Century

For centuries, human civilization has been marked by the rise and fall of empires, nations, and entire regions. In…
Understanding the Enterprise Engineer Role at Facebook for which Meta is Hiring in India.

2025年2月5日

Understanding the Enterprise Engineer Role at Facebook for which Meta is Hiring in India.

Understanding the Enterprise Engineer Role at Facebook Recent inquiries regarding the Enterprise Engineer position at…

4 条评论
Is Google’s Quantum Leap Just a Marketing Mirage?

2024年12月12日

Is Google’s Quantum Leap Just a Marketing Mirage?

In a bold announcement, Google CEO Sundar Pichai unveiled Willow, the company’s latest quantum computing chip…
Why fresher software engineers have raw end of the deal in this day and age.

2024年11月7日

Why fresher software engineers have raw end of the deal in this day and age.

Follow Nikhil Kumar for more such insights When I reflect on the journey of becoming a programmer, I can't help but…

2 条评论
Google Search quality is degrading. What can tech professionals do to deal with it?

2024年10月28日

Google Search quality is degrading. What can tech professionals do to deal with it?

Google search has become significantly worse over the years, and it's more noticeable now than ever. Let's take an…

1 条评论
How to interview for Data Structure, Algorithm Problem Solving, Coding round for Sr/ Staff/ Principal Engineer Roles

2024年10月20日

How to interview for Data Structure, Algorithm Problem Solving, Coding round for Sr/ Staff/ Principal Engineer Roles

When preparing for or conducting coding interviews for Senior (L5), Staff (L6), or Principal Software Engineer roles in…

2 条评论
My Painful Journey with Settlin: A Serious Warning to Potential Home Buyers

2024年10月1日

My Painful Journey with Settlin: A Serious Warning to Potential Home Buyers

I want to share my frustrating and painful experience with Settlin, a company that promises smooth and professional…

4 条评论

See all articles

Problem Statement:

Key Functional Requirements

Key Non-Functional Requirements

Back-of-the-Envelope Calculation

Estimating Concurrent Users

Peak Load Assumption

Things That Are Out of Scope:

1. Payment Processing & Payment Failures

2. Order Delivery & Logistics

3. Order Cancellation & Abandonment Handling

4. Personalized Recommendations & Dynamic Pricing

5. Seller Inventory Management & Restocking

Final Scope Summary

? Covered in This Article

? Not Covered (Out of Scope)

System Design Architecture

High-Level Overview

Detailed Component Breakdown:

领英推荐

Redis as the Primary Inventory Store

Order Processing Pipeline

Handling Scalability Challenges

The Thundering Herd Problem

Preventing Race Conditions in Order Placement

Order Failures & Retries

Scaling Horizontally

Final Thoughts

Additional Improvements

Nikhil Kumar的更多文章

India’s Century: How the World’s Largest Democracy is Poised to Shape the 21st Century

Understanding the Enterprise Engineer Role at Facebook for which Meta is Hiring in India.

Is Google’s Quantum Leap Just a Marketing Mirage?

Why fresher software engineers have raw end of the deal in this day and age.

Google Search quality is degrading. What can tech professionals do to deal with it?

How to interview for Data Structure, Algorithm Problem Solving, Coding round for Sr/ Staff/ Principal Engineer Roles

My Painful Journey with Settlin: A Serious Warning to Potential Home Buyers

社区洞察

其他会员也浏览了

Maximize Your Profits! Sell Anything with Typof's Affordable E-Commerce Platform

Shopify Vs Volusion: Choose Decisively for Your Online Business

What Are Direct Order Links? A Comprehensive Guide

Grow your business with Walletto solutions

WooCommerce Deposits & Partial Payments - Harness Partial Payments for your WooCommerce store

Trends in E-commerce Software Development

Shopify vs. Magento 2024 — Which Platform Is Best for You?

SunCart: Your One-Stop Shop for Powerful and User-Friendly Magento Extensions!

Shopify Australia Post Shipping App with Rates, Labels & Tracking

Improve the Magento 2 Store Checkout Process - A Real Winner!