登录查看更多内容

The BookMyShow Coldplay Conundrum: A Lesson in High-Scale System Design

Sunny R Gupta

Senior Director @JioStar (JioCinema/HotStar) | First Principles | Cloud Native | Scale | ex-Atlassian

发布日期: 2024年9月23日

On September 23, 2024, BookMyShow, India's leading ticketing platform, faced a significant challenge when tickets for Coldplay's highly anticipated Mumbai concerts went on sale. The platform experienced a crash just minutes before the scheduled ticket release, leaving thousands of eager fans frustrated and unable to access the booking page.

This incident serves as a prime example of the challenges software professionals face when designing large-scale ticketing systems, particularly in handling the "thundering herd" problem.

Understanding the Thundering Herd

The thundering herd problem occurs when a large number of processes or threads, waiting for a single event, are awakened simultaneously when that event occurs. In BookMyShow's case, this manifested as millions of users attempting to access the ticketing system at precisely 12 PM when sales opened.

Imagine a scenario where thousands of people are waiting outside a store for a limited-edition product. When the doors open, everyone rushes in at once, potentially causing chaos and overwhelming the store's capacity. This real-world analogy closely mirrors what happened to BookMyShow's servers.

Saravana Kumar 1 年前

AWS SXSW Sydney 2024 Guide

Brooke Moody 1 个月前

The Time is Now for Operators to Lead Business…

Openmind Networks 5 个月前

The Challenges of High-Scale Ticketing Systems

Designing a system to handle such massive concurrent traffic presents several challenges:

Load Balancing: Distributing incoming requests evenly across multiple servers to prevent any single point of failure.
Caching: Implementing efficient caching mechanisms to reduce database load and improve response times.
Queue Management: Creating a robust queueing system to handle excess traffic and prevent server overload.
Database Optimisation: Ensuring database operations can handle numerous simultaneous read and write operations.
Scalability: Designing the system to scale horizontally to accommodate traffic spikes.

Mitigating the Thundering Herd

To address these challenges, software professionals usually employ several strategies, including:

Implement a Virtual Waiting Room: Create a holding area for users before they enter the actual ticketing system. This helps manage traffic flow and prevents server overload.
Use Exponential Backoff with Jitter: When retrying failed requests, implement an exponential backoff algorithm with added randomness (jitter). This approach, similar to what PayPal used to solve their thundering herd problem, helps spread out retry attempts and prevents synchronised floods of requests.
Leverage Cloud Auto-scaling: Utilise cloud services that can automatically scale resources based on demand. This ensures the system can handle traffic spikes without manual intervention.
Employ Caching Strategies: Implement intelligent caching to reduce the load on backend services. This could include caching frequently accessed data like event details and seat availability.
Optimise Database Operations: Use database sharding, read replicas, and other optimisation techniques to handle high-volume concurrent operations efficiently.

As we all understand, no one can guarantee 100% resilience. It is always going to be a tug of war between cost and availability. As we continue to push the boundaries of what's possible in large-scale system design, incidents like these remind us of the importance of continuous improvement and adaptation in the face of ever-growing user demands.

The BookMyShow Coldplay Conundrum: A Lesson in High-Scale System Design

Sunny R Gupta

Senior Director @JioStar (JioCinema/HotStar) | First Principles | Cloud Native | Scale | ex-Atlassian

Understanding the Thundering Herd

领英推荐

The Challenges of High-Scale Ticketing Systems

Mitigating the Thundering Herd

When: a growth in tech series!

5,957 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

How Does Self-Service Model Benefit SaaS Business?

Learnings from a SaaS IPO - Forgerock

Unleashing the Power of SaaS: Exploring the Top Trends in the US Market

Building a User-Centric SaaS Product: Lessons Learned

The Blackberry moment of legacy SaaS

Three Mistakes to Avoid in Your SaaS Platform Content

Awards Season for IT, Zoho's Long Game, Cisco's Mixed Week

Case Study 50 : Sharing Made Smarter - How Dropbox's Product Management Evolved to Conquer Complexity

The Impact of Events on Observability in Booking.com

Microsoft Partner Summary - May 16th - May 20th 2022

Understanding the Thundering Herd

领英推荐

The Challenges of High-Scale Ticketing Systems

Mitigating the Thundering Herd

When: a growth in tech series!

5,957 位关注者

Navigating the Job Market in 2024: A Comprehensive Guide for Job Seekers

2024年10月20日

Navigating the Startup Journey: Why Domain Expertise Matters

2024年10月13日

Scale with a K.I.S.S: Keep It Simple, Stupid

2024年10月12日

Financial Advice for Young Software Developers

2024年10月9日

The Power of Feature Leads

2024年10月3日

Decoding Radical Candor - Managers, pay attention!

2024年9月11日

Climbing the Tech Ladder: From Software Engineer to Principal Architect

2024年9月1日

Cybersecurity in the Era of AI: Navigating the Digital Frontier with Advanced Intelligence

2024年6月14日

The Power of Transparency: Why Early-Career Developers Should Share Updates Regularly

2024年3月29日

The Wild and Wacky World of Software Testing: Why It's More Important Than Ever Before

2024年3月12日

社区洞察

其他会员也浏览了

How Does Self-Service Model Benefit SaaS Business?

Learnings from a SaaS IPO - Forgerock

Unleashing the Power of SaaS: Exploring the Top Trends in the US Market

Building a User-Centric SaaS Product: Lessons Learned

The Blackberry moment of legacy SaaS

Three Mistakes to Avoid in Your SaaS Platform Content

Awards Season for IT, Zoho's Long Game, Cisco's Mixed Week

Case Study 50 : Sharing Made Smarter - How Dropbox's Product Management Evolved to Conquer Complexity

The Impact of Events on Observability in Booking.com

Microsoft Partner Summary - May 16th - May 20th 2022