登录查看更多内容

?? No need to scale out a DB if it can run on one large instance?

Franck Pachot

Developer Advocate at ?? MongoDB ??AWS Data Hero, ?? PostgreSQL & YugabyteDB, ??? Oracle Certified Master

发布日期: 2023年12月3日

Whenever sharding over SQL databases or distributed SQL databases is brought up, some DBAs argue that horizontal scaling is unnecessary because you can run a single, large database on a single, powerful machine to handle massive workloads.

For instance, on AWS, you can provision an Amazon Aurora instance db.x2g.16xlarge with 64 vCPU, 1024 GB RAM, and 128 TB storage. You can run your massive workload on it, but that doesn't mean you necessarily want to.

Large instances can be quite expensive, and it's not always necessary to provision one upfront just in case of a peak in workload. This can happen due to business activity or a runaway query. Additionally, running such an instance all year round can be costly if your activity only peaks on a monthly or seasonal basis.

In the past, we used to do this on-premises, resulting in expensive bills for software licenses that covered a large number of mostly idle CPU resources. Oracle made a lot of revenue from idle processors because customers had to pre-provision for the highest peak of activity, plus a margin of error due to the fact that capacity planning is not an exact science.

Today, we require resource sharing, virtualization, network storage, and private or public cloud with elasticity to reduce the cost, especially when moving to the cloud.

Scaling is necessary, but do you think it's possible to scale up without scaling out? Of course, you could do it if you paused your application, but nobody wants to pause the database, especially when there's a peak in usage. For example, even with Amazon Aurora, Aurora Serverless may pause connections to migrate to another virtual machine when scaling up, and Aurora Limitless may pause write operations when re-sharding.

领英推荐

What are the storage options available in Microsoft…

Whizlabs 1 年前

Introducing Veeam Backup for AWS v5

Veeam Software 2 年前

Unlocking Azure: How to Build a Highly Flexible SQL…

Social Discovery Group 1 年前

You not only need elasticity, but you also require High Availability. When working with a monolithic database, the only option is to provision a new standby or read replica on a larger instance and perform a switchover. However, even a brief switchover results in downtime for the application. If system downtime occurs frequently for cost reduction purposes, it can impact the SLA guarantee by adding to the unavoidable downtimes for system patching or upgrades.

It's crucial to scale your application without any downtime, particularly when the reason for scaling is an increase in high application usage. Having to stop all activity for a minute and switch over to a replica with a cold cache is not the ideal solution at this point.

To summarize, even if scaling vertically seems sufficient, you need horizontal scaling to do it. If your database cannot scale out, your only solutions are permanently provisioning a large instance, which impacts costs, or having frequent downtime to change the instance size, which impacts availability.

Distributed SQL databases are crucial for managing large databases with high throughput that must run on more than one machine. However, even for smaller databases and use cases with seasonal peaks, horizontal scalability is necessary to ensure elasticity and resilience. The idea that "you can run a lot on a single instance" is true, but ignores the cost and availability requirements.

Sikkandar Badusha

Principal Engineer - Database, Hybrid and Multi-cloud Database Strategy Specialist

1 年

Very true, JFYI we scale up (vertical - online) and scale out (requires downtime) the monolithic database using VMware. Usually we run Standby with a lesser number of cores and switchover takes around 2 minutes. This is where applications are required to be paused, but I agree it compromises high availability.

2 次回应

查看更多评论

要查看或添加评论，请登录

Franck Pachot的更多文章

Relational and Document Data Modeling (50 years ago, 25 years ago, and 2025)

2025年3月11日

Relational and Document Data Modeling (50 years ago, 25 years ago, and 2025)

Relational data modeling or document data modeling? With different terms, this question has existed for 50 years of…
2025: I'm joining MongoDB

2025年2月6日

2025: I'm joining MongoDB

I have 30 years of experience with SQL databases, including Oracle Database, Amazon RDS, PostgreSQL, and YugabyteDB. I…

109 条评论
Where is the database schema? #SQL #NoSQL

2025年1月31日

Where is the database schema? #SQL #NoSQL

Although SQL databases can evolve using DDL (Data Definition Language), they are recognized for rigid schemas. In…

8 条评论
SQL Alone Isn’t Enough: Why Modern Applications Need More Than Just SQL

2024年11月11日

SQL Alone Isn’t Enough: Why Modern Applications Need More Than Just SQL

The long-standing debate between SQL and NoSQL used to be framed as a choice between structured and unstructured data…

1 条评论
No Vacuum, No Bloat, No Downtime on Failover, No Lock Escalation, No Manual Sharding, No Delays in Cloning or Backup, No Outage for Database Upgrades

2024年11月4日

No Vacuum, No Bloat, No Downtime on Failover, No Lock Escalation, No Manual Sharding, No Delays in Cloning or Backup, No Outage for Database Upgrades

YugabyteDB is recognized for its resilience and scalability. The distributed storage was also designed to overcome…

7 条评论
SQL database replication: Logical or Physical?

2024年9月26日

SQL database replication: Logical or Physical?

Traditional SQL databases have incorporated replication after their initial design. This can be achieved by…

2 条评论
Starting with YugabyteDB or MongoDB?

2024年9月12日

Starting with YugabyteDB or MongoDB?

MongoDB has gained popularity among developers due to its user-friendly interface and flexible schema-less design…
CQRS != Read-Only Database Replicas

2024年9月6日

CQRS != Read-Only Database Replicas

Command Query Responsibility Segregation (CQRS) is an important design pattern in microservices architectures. It…

2 条评论
A not-so-good idea: Pipe Syntax In SQL

2024年8月26日

A not-so-good idea: Pipe Syntax In SQL

Many SQL users have expressed frustration with the SQL query syntax for SELECT. They argue that beginning with the FROM…

13 条评论
Separation of compute and storage for YugabyteDB

2024年7月29日

Separation of compute and storage for YugabyteDB

Separating computing instances and persistence service, also known as disaggregation of compute and storage, gives the…

3 条评论

See all articles

?? No need to scale out a DB if it can run on one large instance?

Franck Pachot

Developer Advocate at ?? MongoDB ??AWS Data Hero, ?? PostgreSQL & YugabyteDB, ??? Oracle Certified Master

领英推荐

Franck Pachot的更多文章

社区洞察

其他会员也浏览了

Oracle on AWS: With Tessell, performance is no laughing matter

Data Storage Solutions on AWS: Comparing S3, EBS, and Glacier

Empowering GenAI App with Oracle Database at Azure

DAY-10

Azure Storage design

Oracle Database@AWS

Harnessing the Power of AWS Primary Storage: Uncover, Utilize, and Optimize

How To Achieve High Availability With DynamoDB Global Tables

The database is dead, long live the new database

The Rise of Serverless Databases: How They're Changing the IT Landscape

领英推荐

Franck Pachot的更多文章

Relational and Document Data Modeling (50 years ago, 25 years ago, and 2025)

2025: I'm joining MongoDB

Where is the database schema? #SQL #NoSQL

SQL Alone Isn’t Enough: Why Modern Applications Need More Than Just SQL

No Vacuum, No Bloat, No Downtime on Failover, No Lock Escalation, No Manual Sharding, No Delays in Cloning or Backup, No Outage for Database Upgrades

SQL database replication: Logical or Physical?

Starting with YugabyteDB or MongoDB?

CQRS != Read-Only Database Replicas

A not-so-good idea: Pipe Syntax In SQL

Separation of compute and storage for YugabyteDB

社区洞察

其他会员也浏览了

Oracle on AWS: With Tessell, performance is no laughing matter

Data Storage Solutions on AWS: Comparing S3, EBS, and Glacier

Empowering GenAI App with Oracle Database at Azure

DAY-10

Azure Storage design

Oracle Database@AWS

Harnessing the Power of AWS Primary Storage: Uncover, Utilize, and Optimize

How To Achieve High Availability With DynamoDB Global Tables

The database is dead, long live the new database

The Rise of Serverless Databases: How They're Changing the IT Landscape