The Difference Between System of Record and Source of Truth

The Difference Between System of Record and Source of Truth

Previous Post: Organization Dynamics and Data Strategy

 

In our earlier posts, we discussed the three main components for adding value to data: Business Processes and Analytics which are both supported by Data Services.

As we design and architect the data strategy components, we will invariably face this critical question:

From where should we source the data?

 

 

To answer this question, we usually comb through the inventory of all the applications in our enterprise trying to identify the single data source. In doing so, we are often faced with two challenges:

  1. In some cases, there is no single system that has the complete data set. Bits and pieces of the required data is stored in multiple systems. To make matters worse, we cannot easily stitch the data together to make a complete meaningful data set.
  2. In other cases, the same data with the same attributes are available in multiple systems. But, the data values are different and we are not sure which among them is accurate.

If we start designing integrations or building reports or architecting applications without answering the data source question, we will deliver sub-optimal solutions in siloes. Each solution will tell a different story because the underlying data is not the same. And, we lose credibility with our business users.

To answer the data source question, we will have to understand the concepts of System of Record and Source of Truth and how they are same or different.

This distinction is critical because the below Data Services are dependent on this definition:

  • Integrations
  • Conversions
  • Quality
  • Master Data Management (MDM)
  • Governance
  • Discovery
  • Coverage

 

Definitions:

System of Record (SOR): A system of record is the authoritative data source for a given data element or piece of information.

Stating it a bit differently, it is the data repository where the data object, as a whole or specific attributes of a data object, are maintained. This maintenance includes data creation, updating, modifying and deleting.

The SOR is the data source that is used for audit or regulatory reporting purposes.

Source of Truth (SOT): The source of truth is a trusted data source that gives a complete picture of the data object as a whole.

 

We will discuss a couple of scenarios – one where the SORs and SOT could be different and another where both of them could be same.

 

SOR different from SOT

Let us take the example of items (or parts). An item is a component of a product that our organization designs, manufactures, and sells to the customers.

In the product life-cycle, an item is born in a design/engineering system like PLM and goes through the business processes of manufacturing in an ERP application, sales in a CRM application, and finally shipment in a TM application. During each and every step of the business processes, new attributes could be added or existing attributes could be modified.

So a single data object like an item has attributes stored in different systems. There is no single system that gives the complete view of the item, i.e. we do not have a single SOT.

At this point, we may hear traditional MDM proponents say that we need to create an item master MDM system where all these attributes are managed centrally. For the past couple of decades, many organizations did embark on this journey to create a central item master.

Having said that, if our organization portrays characteristics of a type 2 organization as explained in our earlier post on Organization Dynamics, this type of central MDM solution rarely works. Due to the dynamic nature of our organization, it is not possible to sustain this MDM solution. This solution has significant governance overhead and cannot quickly respond to ever changing business environment and imperatives.

The best option in this case would be to create a SOT by compiling item attributes from the different item SORs.

It is this SOT that would be the source of data for BI/Data Science and Data Integrations.

This is what Gartner refers to as a Consolidation MDM.

 

SOR same as SOT

For this scenario, let’s consider another data object like the chart of accounts (COA). The COA is normally maintained in the financial/general ledger module or a dimension management system.

This data is normally managed centrally by a select few team members. The data is published to all the subscribing processes and applications.

In this case the System of Record (SOR) and Source of Truth (SOT) are the same.

 

Delivering Sources of Truth

Most of our users work on business processes that span multiple applications. The expectations from the business users are:

  • Clearly identify all the SOTs
  • Single system to access all the SOTs

This is where a data repository helps us in delivering SOT data to the business users. This could be an operational data store, data warehouse, or data lake.

This is also known as an Analytical MDM.

 

We have discussed the nuances of SORs and SOTs. If we recollect our earlier post on “Data as an Asset Strategy” and it various components and sub-components, we can now conclude that the Analytics Data Services is critical to delivering SOTs to the business.

The operational data store or data warehouse or data lake is part of Analytics Data Services.

Whether it is Business Intelligence, Data Science or Data Integration to downstream systems, the Analytics Data layer plays a central role in this strategy.

 

 

 

Next Topic: Analytics Data Services

Very relevant comment. Clear, logical and easy to understand, Santosh. In the oil and gas business how can we improve alignnmet with the TM (data distribution and data flow pipelines) with MDM and good governance. For me the Design (PLM) elements and inextricably linked with TM elements. Can this be decoupled.

回复
Douglas Luan

Founder | Engineering | Machine Learning

9 个月

Hi Mr. Kudva. Thank you for the article, it was very helpful. I am interested in the origin of the term System of Record. The earliest reference I found so far was this: https://web.archive.org/web/20090530080907/https://www.information-management.com/issues/20030501/6645-1.html. Do you happen to know who coined the term and in what context?

回复
Fatin Munirah Ahmad

Manager - Master Data, Compliance & Analytics at First Solar

1 年

A good read and clear explanation of SORs and SOT ??

回复
Abhishek Singh

SDE- 3 @ Expedia | NodeJS Java MySQL React | AWS Certified Solution Architect - Associate

1 年

The link to the previous blog/document is not working, it is leading to a 404 Page not found

回复
Dan Avilla

Digital Transformation Leader , IT Director, Enterprise Applications Architect and Customer Success Mgr. Titles don't mean much to me, I just enjoy rallying the troops and finding ways to do things smarter and faster.

2 年

I like the article it is clear and simple. However, I fear that the last section, "Delivering Sources of Truth" may leave people with the impression that MDM is not necessary. I don't believe that this is what you are saying but a fast scan of the article may give that impression. Analytics and/or Finance are almost always the drivers behind an MDM implemention (in my experience anyway). Those functions rely on removing ambiguity from the data. eg. Who are my top customers cannot be determined if the Order Mgmt System records sales for one actual customer under 50 different accounts. I know that I'm speaking to choir here in this group. I just wanted to highlight a possible misinterpretation that people might have.

要查看或添加评论,请登录

Santosh Kudva的更多文章

  • Embedded Governance - Part 1

    Embedded Governance - Part 1

    Previous Article: Analytics Platforms - Central, Federated or Distributed This is a continuation on the subject of Data…

    12 条评论
  • Analytics Platforms - Central, Federated or Distributed

    Analytics Platforms - Central, Federated or Distributed

    Previous Article: Cost of Poor Data Quality As it pertains to this discussion, we are referring to organizations with…

    6 条评论
  • Cost of Poor Data Quality

    Cost of Poor Data Quality

    Previous Article: Success rate of Data Science initiatives Many of us might be aware of the term COPQ (Cost of Poor…

    23 条评论
  • Success rate of Data Science initiatives

    Success rate of Data Science initiatives

    Previous Article: Role of Leadership in Data Analytics We are seeing increasing use of Data Science(DS) / Machine…

    20 条评论
  • Role of Leadership in Data Analytics

    Role of Leadership in Data Analytics

    Previous Article: Data Warehouse or Data Lake? We started this article series with the title “Data as an Asset.” It is…

    23 条评论
  • Data Warehouse or Data Lake?

    Data Warehouse or Data Lake?

    Previous Article: Adopting Self-service Analytics Recently, I had the opportunity to attend an event with a group of…

    10 条评论
  • Adopting Self-service Analytics

    Adopting Self-service Analytics

    Previous post: Data Governance - A Balancing Act. We often hear the term “self-service” used in relation to analytics.

    12 条评论
  • Data Governance - A Balancing Act

    Data Governance - A Balancing Act

    Previous Post: Learning to Unlearn Everybody involved in the data space is familiar with data governance. It is a topic…

    11 条评论
  • Learning to Unlearn

    Learning to Unlearn

    Previous Article: Difference between System of Record and Source of Truth Background A few weeks back, as one of the…

    3 条评论
  • Organization Dynamics & Data Strategy

    Organization Dynamics & Data Strategy

    Earlier Post: Extracting Data Value(2) - Data Services In our previous posts we defined the components and…

    6 条评论

社区洞察

其他会员也浏览了