What are the main building blocks of Data Warehouse models?

What are the main building blocks of Data Warehouse models?

In the previous and first blog in this series, we have tried to answer the question Why do we need Data Warehouse models? There was a lot of likes, shares and comments on that one and you can find the article here

Although I thought that in next blog in the series I will address benefits of implementation, I will focus on model architecture and continue the analogy that I have used in the first blog – how is building a house similar to building a successful Data Warehouse.

Long time ago (in 1998, to be precise) I went to SAS European User Group conference in Hague where Bill Inmon was a keynote speaker and that was the first time I have seen him presenting. He said at this lecture something very important that I have remembered: “You cannot build the house without foundations. I have never seen beautiful foundations - they are always a lot of mud, steel and concrete. But only on good foundations you can build good and beautiful house.

The same logic applies for Data Warehouse – if you want to build good Data Warehouse, you need to have good foundations. In Data Warehouse context, foundations are the master data that business needs, that needs to be modeled right and cleansed. Master data include all the data about time periods, locations, product and service catalogs, involved parties and types of events and statuses.

After that, you will build the ground floor of your house where you spend most of your time doing daily business, or the base layer of your Data Warehouse. This is the detailed data that you will load from your source systems or external sources, integrate and cleanse to fit in your data model. This will include detailed data about your customers, their accounts and transactions, invoices and payments and financial data.

Finally, you will build your top floor where you will have your private area, and that is derived data layer in your Data Warehouse. In this area you are creating derived data marts or data sets that combine data from various parts of foundation and base layers, enabling you to do advanced analytics like profitability and risk calculations or predictive models about customer behavior, or calculate KPI's.

You can see sample organization of Data Warehouse model for a bank in following image:

When creating the model, you will use entities and relationships that will be represented in your Data Warehouse as tables and primary-foreign key relationships. You can think about entities as rooms where you can store your things, and relationships are like doors that are connecting those rooms. Every room has one or more doors that is connecting the room with other rooms and you will use those doors to move and communicate between rooms. Each room has its own functionality – in same ways entities are representing different business subjects.

Within the big house, rooms are used to support different functionalities, such as dining, living room, guest rooms or bedrooms. In a same way, entities and relationships are grouped in Subject Areas that are covering the requirements of specific business area – Sales, Inventory or Contact Center. In model image above, subject areas are represented as small boxes.

In next blog we will discuss more about model building blocks and modeling patterns that are used for model design. At some point later, we will also discuss the benefits of the implementation of standard industry Data Warehouse model.

Dra?en Ore??anin / DWH architect / more information at www.dwh-models.com

Arnélio Chitombe

South Africa (SA): Demand Planning Executive at BAT

6 年

Láudio Mufume check this.

回复
Vithal Desai

Consultant, TCS

7 年

Great article Drazen. What a nice and simple way you have articulated. Thanks for publishing this article.

要查看或添加评论,请登录

Drazen Orescanin的更多文章

  • About startups, accelerators and ecosystems

    About startups, accelerators and ecosystems

    It is the middle of November in the year 2022 in Bucharest, Wednesday around noon, a seemingly ordinary day. In one of…

    8 条评论
  • CEO to CEO relationship

    CEO to CEO relationship

    We are living in turbulent times. After two years of Covid-19 pandemics we are now witnessing the war in Ukraine…

    2 条评论
  • Raise of Chief Analytics Officer

    Raise of Chief Analytics Officer

    I am sure that all the birds in the trees already know everything about explosion of data that hit our World in last…

    4 条评论
  • Is it a time for Data Lakehouse?

    Is it a time for Data Lakehouse?

    There was a lot of discussions about Big Data killing Data Warehouse in last few years. Of course, that was not…

  • OMG! My DWH / BI project went wrong!

    OMG! My DWH / BI project went wrong!

    Have you ever been involved in project that went wrong? You are not the only one! In previous blogs in this series, we…

    4 条评论
  • Two ways how to get things done

    Two ways how to get things done

    You must have heard hundreds of times that there are two types of people or two ways you can approach a girl or…

    2 条评论
  • Risks, costs, quality and timeframes in DWH projects

    Risks, costs, quality and timeframes in DWH projects

    In first two blogs in this series, we have tried to answer the questions Why do we need Data Warehouse models and What…

    2 条评论
  • Why do we need Data Warehouse models?

    Why do we need Data Warehouse models?

    This is the first blog from my new series about Data Warehouse models and modeling. In this series I will cover the…

    42 条评论

社区洞察

其他会员也浏览了