登录查看更多内容

Cloud Data Warehousing—So What Is New?

Barry Devlin

Data Architect, BI and DW Analyst and Consultant, Author and Speaker

发布日期: 2023年7月4日

I guess you’ve got the message by now! There are lots of aspects of cloud data warehousing that carry over directly and without change from “traditional” data warehousing: the purpose and principles, the conceptual architecture, and the identification of the three+ domains of data and information. So, what is new that merited a book on the topic? As seen in “Cloud Data Warehousing—Volume I: Architecting Data Warehouse, Lakehouse, Mesh, and Fabric” (available now), the differences appear at the level of the logical architecture and technology. As you might expect.

A logical architecture is a high-level, functional view of what IT must design and build to meet the business needs expressed in the conceptual architecture, taking into account the limitations of current technologies and the expectations of what they might deliver in the short to medium term. And what the logical architecture for cloud data warehousing must consider is, well, that fuzzy word preceding data warehousing: yes, cloud, and the distributed nature of the concept, as well as the emerging technologies seen in cloud computing.

In the “traditional” data warehousing logical architecture, the three data/information domains shown in my previous article become three separate pillars (representing the different technological bases needed) united by shared context-setting information (CSI).

In comparison, the logical information architecture for cloud pictured above shows two key differences. The first is the introduction of planes suggesting that a set of pillars can exist independently in multiple cloud environments, as well as on premises. CSI does, of course, need to virtually span these planes, as it does the pillars within any environment. This significance is not that there are multiple planes, but rather that the pillars have substantially the same meanings in the cloud as on premises.

领英推荐

Cloud Data Warehousing vs Cloud Data Lakes: Choosing…

Miracle Software Systems, Inc 1 个月前

Tracing the Transformative Journey of Social Media…

Ayoob Ibrahim 1 年前

Mastering EDW: A Complete Guide to Architecture…

WEZOM 1 个月前

The second difference is more impactful. The lower parts of the pillars are now conjoined. This arises from a significant change in technology. Object stores and open-source componentry are used in cloud data warehousing as the underlying storage substrate beneath different data management technologies, such as relational databases and other tools. The consequence is that the selfsame data can be used and reused by different types of processing technologies, a key foundation for the data lakehouse pattern.

The fact that the pillars remain separate higher up reflects my belief that different technologies will continue to have pros and cons for varied types of processing for the foreseeable future. A good example of this is graph databases. Despite the relational adjective, relational databases don’t manage relationships very well. The “relation” in relational databases comes from the mathematical concept of a relation or set. The relationships represented in graph theory and databases are conceptually different and are fundamental to building the complex structures of networks of related nodes of all sorts. Of particular interest in cloud data warehousing are the ontologies and inter-relationships between people, process, and information in CSI. I suggest this is an area where we’ll be seeing significant advances in the coming years.

In this article and the last, I’ve slipped into architectural considerations perhaps a little to deeply for some folks. For which I apologize! In the next post of this series, I will return to the lakehouse, mesh, and fabric patterns, and discuss how they differ based on the concepts discussed above at an architectural level.

要查看或添加评论，请登录

Barry Devlin的更多文章

Cloud Data Warehousing—Ware in the Cloud is best?

2024年8月19日

Cloud Data Warehousing—Ware in the Cloud is best?

My goal in writing “Cloud Data Warehousing—Volume II: Implementing Data Warehouse, Lakehouse, Mesh, and Fabric” and…
Cloud Data Warehousing—imagine a mesh of cloud

2024年7月24日

Cloud Data Warehousing—imagine a mesh of cloud

Data mesh might well be described—based on its founder’s reasoning and description—as an anti-warehouse. Although…

2 条评论
Cloud Data Warehousing—the sunny skein of (data) fabric

2024年7月9日

Cloud Data Warehousing—the sunny skein of (data) fabric

Of the four cloud data warehousing solutions or architectural design patterns (ADPs), data fabric stands apart as the…
Cloud Data Warehousing—a mist upon the lake(house)

2024年6月18日

Cloud Data Warehousing—a mist upon the lake(house)

When I first encountered the data lakehouse four years ago, I was fairly negative in my LinkedIn article of February…

2 条评论
Cloud Data Warehousing—a blue skies ADP

2024年6月5日

Cloud Data Warehousing—a blue skies ADP

For many years, I have talked about architecture at three levels: conceptual, logical, and physical. A conceptual…

10 条评论
Cloud Data Warehousing—Seeing Patterns in the Cloud

2024年5月24日

Cloud Data Warehousing—Seeing Patterns in the Cloud

There’s an old adage. You wait ages for a bus and then four come along at once.

4 条评论
Cloud Data Warehousing Vol II—No more foggy thinking

2024年5月17日

Cloud Data Warehousing Vol II—No more foggy thinking

Cloud. Data.

9 条评论
Cloud Data Warehousing—Architectural Design Patterns

2023年7月18日

Cloud Data Warehousing—Architectural Design Patterns

I imagine you’ve seen many IT diagrams masquerading as architectures! I use a simple rule: if there’s a product or…

2 条评论
Cloud Data is Just Data (in the Cloud)

2023年6月14日

Cloud Data is Just Data (in the Cloud)

One key message from “Cloud Data Warehousing—Volume I: Architecting Data Warehouse, Lakehouse, Mesh, and Fabric”…

3 条评论
Cloud Data Warehousing—What’s Not New?!

2023年6月2日

Cloud Data Warehousing—What’s Not New?!

So, the book. It’s new.

6 条评论

See all articles

Cloud Data Warehousing—So What Is New?

Barry Devlin

Data Architect, BI and DW Analyst and Consultant, Author and Speaker

领英推荐

Barry Devlin的更多文章

社区洞察

其他会员也浏览了

The Definitive Guide to Data Lakes on AWS

Snowflake vs Redshift vs Google BigQuery

Why Snowflake?

Snowflake Cloud Data Platform: Architecture Deep Dive

Part1: Azure Data Factory-An In-Depth Introduction with Practical Scenarios and Exercises

A Data-Driven Business Culture with Snowflake Cloud Data Warehouse

best practices for data warehousing with Azure real world scenario

Unpacking Snowflake Architecture: Revolutionizing Data Management and Analysis

What Is Snowflake Database?

领英推荐

Barry Devlin的更多文章

Cloud Data Warehousing—Ware in the Cloud is best?

Cloud Data Warehousing—imagine a mesh of cloud

Cloud Data Warehousing—the sunny skein of (data) fabric

Cloud Data Warehousing—a mist upon the lake(house)

Cloud Data Warehousing—a blue skies ADP

Cloud Data Warehousing—Seeing Patterns in the Cloud

Cloud Data Warehousing Vol II—No more foggy thinking

Cloud Data Warehousing—Architectural Design Patterns

Cloud Data is Just Data (in the Cloud)

Cloud Data Warehousing—What’s Not New?!

社区洞察

其他会员也浏览了

The Definitive Guide to Data Lakes on AWS

Snowflake vs Redshift vs Google BigQuery

Why Snowflake?

Snowflake Cloud Data Platform: Architecture Deep Dive

Part1: Azure Data Factory-An In-Depth Introduction with Practical Scenarios and Exercises

A Data-Driven Business Culture with Snowflake Cloud Data Warehouse

best practices for data warehousing with Azure real world scenario

Unpacking Snowflake Architecture: Revolutionizing Data Management and Analysis

What Is Snowflake Database?