SLOW CHANGING DIMENSIONS
SLOW CHANGING DIMENSIONS

SLOW CHANGING DIMENSIONS

Slow Changing Dimensions is also known as SCD and is used in Data Warehouse concept. Folks in Data World are very much aware of this term.

Before we understand SCD, one must understand Fact and Dimensions. In dimensional Data Warehouse or in a Data Mart (explained in separate topic), there are two types of schema(s) i.e., Star and Snowflake Schema(s). Both are build based on Facts and Dimensions.

●???????Facts: This table keeps the transaction’s lowest level of data which can be numeric or non-numeric like sales numbers, number of clicks on social media portal, number of searches in google etc. It’s mainly loaded in append mode, means rows always gets inserted. This table is very fast-moving table and rows keeps coming in. There are hardly any organization which update and delete data from Fact Tables.

●???????Dimension: This table on the other hand keeps data element by which Fact can be analyzed e.g., sale by time, sales by region, sales by the product etc. So, Time, Regions, Products etc., are dimension. In contrast with Fact table, frequency of rows coming in these tables are very low. For Example, once we have loaded regions then when often regions will change ??. Once Time is loaded, it will never change. At the same time, Products definitions can change. A company today have 100 products, in future it can have more. Then attributes of one product can change in future a well, so there are three ways to keep the history of changed in Dimensions which are handled by principles managed by SLOW CHANGING DIMENSIONS (SCD).

Three types of SCD(s)

1.??????SCD 1: This type keeps only the latest status e.g., if product code was 001 and it changed to 002 then 003 and then 004. It will keep 004 only.

2.??????SCD 2: This type keeps complete history of every change e.g., if product code was 001 and it changed to 002 then 003 and then 004. It will keep all changes i.e., 001, 002, 003 and 004.

3.??????SCD 3: This type keeps first and latest status of changes e.g., if product code was 001 and it changed to 002 then 003 and then 004. It will keep 001 and 004 only.

Cheers.

要查看或添加评论,请登录

Mustafa Qizilbash的更多文章

  • Is Your Organization Drowning in Data Products?

    Is Your Organization Drowning in Data Products?

    The Hidden Cost of Data Product Sprawl: How to Regain Control In today's data-driven world, organizations are…

    6 条评论
  • Data Products Don't Last Forever. Are Yours Outdated?

    Data Products Don't Last Forever. Are Yours Outdated?

    In today's data-driven world, organizations often invest heavily in building and maintaining data products—dashboards…

    2 条评论
  • RETURN ON INVESTMENT (ROI)

    RETURN ON INVESTMENT (ROI)

    In today’s data-driven economy, organizations are investing heavily in data platforms, tools, talent, and governance…

    6 条评论
  • Productionization via Product (PVP) Approach

    Productionization via Product (PVP) Approach

    Traditional data and AI development processes often involve multiple environments — development, testing, and…

    3 条评论
  • Data Products with Challenges

    Data Products with Challenges

    In today’s data-driven landscape, organizations heavily rely on data products to drive insights, improve efficiency…

    6 条评论
  • Common Pitfalls when evaluating and decommissioning data products & How to Avoid

    Common Pitfalls when evaluating and decommissioning data products & How to Avoid

    Even with a structured approach, organizations often encounter challenges when evaluating and decommissioning data…

    2 条评论
  • A Lifecycle Framework for Evaluating and Decommissioning Data?Products

    A Lifecycle Framework for Evaluating and Decommissioning Data?Products

    A structured lifecycle approach ensures efficiency, accountability, and minimal disruption when evaluating and retiring…

    2 条评论
  • Types of Data Products to Decommission

    Types of Data Products to Decommission

    Not all data products remain valuable indefinitely. As businesses evolve, certain data assets become obsolete…

  • The Need for Evaluating and Decommissioning Data Products

    The Need for Evaluating and Decommissioning Data Products

    1. The Challenge of Data Product Sprawl Organizations tend to accumulate numerous data products over time for several…

    4 条评论
  • Impact & Governance

    Impact & Governance

    As organizations strive to become data-driven, the ability to measure, govern, and optimize data initiatives is…

    2 条评论

社区洞察

其他会员也浏览了