登录查看更多内容

The Importance of Lists, Schema Demo

John Layden

发布日期: 2019年12月1日

In this post we will offer several examples of Ancelus linked lists, some history on how and why the table-based model prevailed, and the advantages of the Ancelus approach.

A video demonstration on how a list-based schema is developed in Ancelus is available at www.ancelus.com/tech briefs.

The natural state of all information "in the wild" is list-based. The two-dimensional table structure is a rare exception. But in the early days of database evolution (starting with Ted Codd's 1969 paper) this got to be the accepted model.

Two factors converged to produce this adoption. First, computers physically store information in a 2D format, and secondly the white pages phone book view of "name-address-phone number" was used as a definition of what a database should do. While the white pages structure worked early on, we now discover lists are more basic. (I've just realized that a whole generation of developers have never seen a white pages phone book).

Nested lists are the most common logical structures, and "name-addresslist-phonelist" is a classic example of nested lists. We have multiple addresses (home-office-vacation-etc) and multiple phones (home-office-homefax-officefax-cell-car). Even the name isn't the universal key we thought it was, when we discover that teen children need their own phonelist at our address.

One of the most important features of the Ancelus database is the native list handling architecture. While the idea of double-linked-lists isn't new (found in most textbooks), the Ancelus implementation is unique. Some elements are disclosed in the patents. But important parts have been retained as trade secrets.

A double-linked-list provides an opportunity for bi-directional traverse of the data structures. In our universal example of Part Number/Serial Number, every PN has a pointer to the first SN in the list of many SNs. This SN includes a pointer to the next SN in the list. And each SN has a reverse pointer to its one and only one PN. So the SN "table" appears to have an embedded PN as a foreign key of 40 bytes, but actually has a 4 byte pointer into the PN "table." When combined with the storage algorithm, we now have a logical model that is independent of the physical storage model. We can directly implement lists even when they are nested many layers deep.

This approach eliminates the overhead of conversions from lists to tables and back. It also eliminates the duplication of PN data. The entity-relation-diagram (ERD) used to define the logical structures can now be directly implemented in the Ancelus schema. This simplifies the more complex examples such as recursive lists (lists that point back on themselves).

Recursive lists can be visualized using the family tree example. Imagine a field named "Person" that contains the unique ID of every person who has ever lived (100 billion by some estimates). A family tree is now a list with a pointer from me to two parents, then four grandparents, etc. back to the beginning of the human race.

One classic thought problem is to build a database of all my sixth cousins. While there are PhD theses written on how to do this in tables, with Ancelus it's a simple matter of tracing back to the fifth great grandparents, then down to all their descendants. In business processes this is called "blowback traceability" and will be the subject of a later post.

Another Ancelus feature solves a common problem for financial institutions. The ability to make live modifications to the database, including even the schema definition, can address the multiple account problem. In the EU the recent requirement to report all connected accounts challenged the existing record systems. A single person might have a couple of checking and savings accounts, multiple mortgages on multiple properties, and several credit cards, all established in different departments within the bank. Each of these accounts could have been created at different times, with different addresses and variations on the name,, and all with varying levels of updates. Connecting these has been an expensive proposition, even though a common ID number like social security number makes it theoretically possible.

Since Ancelus can introduce new relationships into the schema while the database is in operation, it dramatically reduces the challenge. As the common threads are identified, simply add them to the schema. Complex reporting now achieves a new level of speed and automation.

This may seem like a minor point, but it makes it possible to deploy a new form of business process. Much of the industry approached this as a project, spending 10s of millions to scrub the data. But how long before it needs to be done again? The right way to do this is to create immediate correction methods so it can become a part of every day operations to update the schema of the databases. With Ancelus at the hub of the integration it is now practical to achieve this continuous improvement business model without the need for major modification of the existing systems. See our post on legacy integration with Ancelus.

Let us know if you have special examples you'd like to see discussed in the Ancelus forum.

Craig Mullins

Craig Mullins, President & Principal Consultant at Mullins Consulting, Inc. IBM Gold Consultant and IBM Champion for Data and AI

5 年

Lists are a commonly used approach for managing data and a database that uses that approach is an interesting idea

查看更多评论

要查看或添加评论，请登录

John Layden的更多文章

Ancelus Integration with Existing Oracle or DB2 Applications

2019年10月30日

Ancelus Integration with Existing Oracle or DB2 Applications

What if my current application works fine except for a few trouble spots? A hybrid approach may offer a simple solution…
What’s Special About Ancelus Concatenated Keys?

2019年10月23日

What’s Special About Ancelus Concatenated Keys?

The example above shows how a pricing is handled where more than one vendor exists for each part, and more than one…

1 条评论
Complexity: Major Relational Challenge Solved by Ancelus

2019年10月17日

Complexity: Major Relational Challenge Solved by Ancelus

Most database system designs have self-censored the level of schema complexity based on the general understanding of…
Big Data Puts DBAs in a Vice: A New Approach Emerges

2019年10月10日

Big Data Puts DBAs in a Vice: A New Approach Emerges

Shortly after Y2K (remember that?) the industry focus shifted to the challenge of the explosive growth in the size of…

6 条评论
Table Joins: Benchmarks are Fine. What About Doing Real Work?

2019年10月2日

Table Joins: Benchmarks are Fine. What About Doing Real Work?

In the real world we don’t build systems from benchmarks. The practical work in database systems gets done in table…

1 条评论
Reliability: Eliminating Planned Downtime

2019年9月26日

Reliability: Eliminating Planned Downtime

In our prior post we discussed the issue of recovering from unplanned downtime. The other side of the coin is the time…

1 条评论
Reliability: Non-Stop Operation is the Ancelus Goal.

2019年9月20日

Reliability: Non-Stop Operation is the Ancelus Goal.

Uptime is important. In operational systems it can be critical.

1 条评论
Speed is the Reason for any Database

2019年9月16日

Speed is the Reason for any Database

Speed is the most fundamental measure of database performance. The patented Ancelus database handles a simple R/W…

1 条评论
Jobs Reported at +313,000 for February - Greatly Understated

2018年3月10日

Jobs Reported at +313,000 for February - Greatly Understated

February jobs reported at +313,000. Real number was +785,000.

1 条评论
Supply Chain Management Systems Have Promised to Transform Business for 25 Years. Where are the Results?

2018年3月2日

Supply Chain Management Systems Have Promised to Transform Business for 25 Years. Where are the Results?

Two new papers measure the impact of SCM systems on durable goods performance in the US. The amount of inventory is…

See all articles

The Importance of Lists, Schema Demo

John Layden

John Layden的更多文章

社区洞察

其他会员也浏览了

An In-depth Look at Apollo Client for Angular Applications

From Excel to Turtle With NodeJS

Building Real-Time Applications: Harnessing the Power of Google Realtime Database in Full Stack Development

YAML

Spring Boot Projections Uncovered: How to Fetch Just What You Need

LRU Cache Explanation, Application and Design - By Ashay Nayak

Moving from Sync to Async in FastAPI with SQLModel—What you Need to Know

Mastering Data Persistence in PyQt5 with QSettings

Mastering the Array Data Structure: A Comprehensive Guide

Stack and Heap Memory in .NET

John Layden的更多文章

Ancelus Integration with Existing Oracle or DB2 Applications

What’s Special About Ancelus Concatenated Keys?

Complexity: Major Relational Challenge Solved by Ancelus

Big Data Puts DBAs in a Vice: A New Approach Emerges

Table Joins: Benchmarks are Fine. What About Doing Real Work?

Reliability: Eliminating Planned Downtime

Reliability: Non-Stop Operation is the Ancelus Goal.

Speed is the Reason for any Database

Jobs Reported at +313,000 for February - Greatly Understated

Supply Chain Management Systems Have Promised to Transform Business for 25 Years. Where are the Results?

社区洞察

其他会员也浏览了

An In-depth Look at Apollo Client for Angular Applications

From Excel to Turtle With NodeJS

Building Real-Time Applications: Harnessing the Power of Google Realtime Database in Full Stack Development

YAML

Spring Boot Projections Uncovered: How to Fetch Just What You Need

LRU Cache Explanation, Application and Design - By Ashay Nayak

Moving from Sync to Async in FastAPI with SQLModel—What you Need to Know

Mastering Data Persistence in PyQt5 with QSettings

Mastering the Array Data Structure: A Comprehensive Guide

Stack and Heap Memory in .NET