登录查看更多内容

Table Joins: Benchmarks are Fine. What About Doing Real Work?

John Layden

发布日期: 2019年10月2日

In the real world we don’t build systems from benchmarks. The practical work in database systems gets done in table joins. Let’s use a simple example.

Sometimes the data we’re looking for doesn’t reside in a single table. To get the view we want requires the combination of two or more tables. For example, the SERIAL NUMBER (SN) table contains data about specific units, but the many details common to the part are found in the PART NUMBER (PN) table. This is done to avoid repeating the PN data in every SN record. To connect the two tables the part number is recorded in the SN table. This is called a foreign key and it tells us where to look in the PN table to find all those details. When we need to recover the complete information about a list of serial numbers, we merge the two in a process called a “Table Join.”

One of the most impressive benchmarks of the Ancelus system is the time required for complex table joins. This focus was not accidental. Our research shows that this is the single biggest time consumer in applications software (based on actual measurements in real systems). So, we went to great lengths to find and cure the root cause of the time delays.

The main observation is that traditional table joins require a large number of “compare” operations to isolate the data that satisfies the spec. The more tables involved in the join, the slower the operation. And the time increases exponentially as the number or size of tables grows.

In a comparative benchmark several years ago, Ancelus reduced the 3-table-join time from 230 seconds to 1.5 milliseconds in a 200 million record database. To show off a little we decided to test the time required to modify the database. We added a field to the primary table in both systems. In the relational system it required 3,940 seconds with the database locked the entire time. For Ancelus there was 700 milliseconds of prep time followed by 8 microseconds where the database was locked and out of service.

Ancelus dodges this problem with the unique combination of links and tags automatically included in the Ancelus schema. This delivers direct access to the target data. This eliminates the need for the compare operations, recovering most of the time lost in traditional systems.

For many database applications these time delays are an acceptable compromise. But in large, complex or many-user systems the time for table joins can run into hours (in one big-data case, days).

The Ancelus table-join benchmarks have generated more comments than any other feature. These current benchmarks are quoted for the Intel Broadwell server configured with 44 cores, running 44 concurrent queries. The 7-table specification involves a schema with billion row tables. All 44 queries are completed in less than 11 seconds.

While the size and complexity of this benchmark test is extreme by most standards, it demonstrates one awkward reality that most DBAs live with: Nobody even tries a 7 table join in traditional systems because “everyone knows” it can’t be done. Until now.

So, if your goal is to improve throughput (number of concurrent users), shrink hardware, or get better latency (response time to each query), you can use Ancelus to address the challenge.

To learn more contact us at www.ancelus.com

Craig Mullins

Craig Mullins, President & Principal Consultant at Mullins Consulting, Inc. IBM Gold Consultant and IBM Champion for Data and AI

5 年

The results for Ancelus are truly impressive. 44 queries using a 7-table join on billion row tables finishing in less than 11 seconds is blazingly fast.

要查看或添加评论，请登录

John Layden的更多文章

The Importance of Lists, Schema Demo

2019年12月1日

The Importance of Lists, Schema Demo

In this post we will offer several examples of Ancelus linked lists, some history on how and why the table-based model…

3 条评论
Ancelus Integration with Existing Oracle or DB2 Applications

2019年10月30日

Ancelus Integration with Existing Oracle or DB2 Applications

What if my current application works fine except for a few trouble spots? A hybrid approach may offer a simple solution…
What’s Special About Ancelus Concatenated Keys?

2019年10月23日

What’s Special About Ancelus Concatenated Keys?

The example above shows how a pricing is handled where more than one vendor exists for each part, and more than one…

1 条评论
Complexity: Major Relational Challenge Solved by Ancelus

2019年10月17日

Complexity: Major Relational Challenge Solved by Ancelus

Most database system designs have self-censored the level of schema complexity based on the general understanding of…
Big Data Puts DBAs in a Vice: A New Approach Emerges

2019年10月10日

Big Data Puts DBAs in a Vice: A New Approach Emerges

Shortly after Y2K (remember that?) the industry focus shifted to the challenge of the explosive growth in the size of…

6 条评论
Reliability: Eliminating Planned Downtime

2019年9月26日

Reliability: Eliminating Planned Downtime

In our prior post we discussed the issue of recovering from unplanned downtime. The other side of the coin is the time…

1 条评论
Reliability: Non-Stop Operation is the Ancelus Goal.

2019年9月20日

Reliability: Non-Stop Operation is the Ancelus Goal.

Uptime is important. In operational systems it can be critical.

1 条评论
Speed is the Reason for any Database

2019年9月16日

Speed is the Reason for any Database

Speed is the most fundamental measure of database performance. The patented Ancelus database handles a simple R/W…

1 条评论
Jobs Reported at +313,000 for February - Greatly Understated

2018年3月10日

Jobs Reported at +313,000 for February - Greatly Understated

February jobs reported at +313,000. Real number was +785,000.

1 条评论
Supply Chain Management Systems Have Promised to Transform Business for 25 Years. Where are the Results?

2018年3月2日

Supply Chain Management Systems Have Promised to Transform Business for 25 Years. Where are the Results?

Two new papers measure the impact of SCM systems on durable goods performance in the US. The amount of inventory is…

See all articles

Table Joins: Benchmarks are Fine. What About Doing Real Work?

John Layden

John Layden的更多文章

社区洞察

其他会员也浏览了

Mastering Transaction Isolation in Spring: Ensuring Data Consistency and Performance

Overloading Your Primary Keys For Highly Efficient Queries

SQL Order of Operations - Extended

Passing Bulk Data to Stored Procedure to Get Improved Performance | Infogen Labs

EXPLAIN ANALYZE or EXPLAIN (ANALYZE, BUFFERS)

Transactions - Read isolations

Wide World Importers data generation

How We Made a Query 1000x Faster by Denormalizing Columns

Looking to the wrong side while analyzing database slowness

John Layden的更多文章

The Importance of Lists, Schema Demo

Ancelus Integration with Existing Oracle or DB2 Applications

What’s Special About Ancelus Concatenated Keys?

Complexity: Major Relational Challenge Solved by Ancelus

Big Data Puts DBAs in a Vice: A New Approach Emerges

Reliability: Eliminating Planned Downtime

Reliability: Non-Stop Operation is the Ancelus Goal.

Speed is the Reason for any Database

Jobs Reported at +313,000 for February - Greatly Understated

Supply Chain Management Systems Have Promised to Transform Business for 25 Years. Where are the Results?

社区洞察

其他会员也浏览了

Mastering Transaction Isolation in Spring: Ensuring Data Consistency and Performance

Overloading Your Primary Keys For Highly Efficient Queries

SQL Order of Operations - Extended

Passing Bulk Data to Stored Procedure to Get Improved Performance | Infogen Labs

EXPLAIN ANALYZE or EXPLAIN (ANALYZE, BUFFERS)

Transactions - Read isolations

Wide World Importers data generation

How We Made a Query 1000x Faster by Denormalizing Columns

Looking to the wrong side while analyzing database slowness