DATA LINEAGE ANALYTICS

No alt text provided for this image

DATA LINEAGE ANALYTICS

By W H Inmon

For a hundred reasons, data travels across the corporation. Sometimes data movement is needed for transformation. Sometimes data movement is needed for simple extraction and export of data. Sometimes data needs to be archived. There are a whole host of reasons why data travels across the corporation.

And who needs to understand this transport of data? The answer is – the analyst who wishes to do a serious job of analyzing data and turning that analysis into business value. In order for the analyst to know what data he/she has in their hand when doing an analysis, the analyst must know the source of the data.

But merely knowing the source of the data is not enough either. The analyst must also know the summarizations and aggregations that have occurred to the data along the path of transport. In addition, the analyst must know the selection criteria of data that has been transported. The analyst needs to know when the transport has been done. The analyst must know what data has been included and what data has been excluded in its journey across the corporation.

All of these factors are absolutely necessary for the serious data analyst to do his/her job properly.

But there are many obstacles to understanding the lineage of data. Some of the obstacles are –

1)?????Data is transported over dbms lines. Data starts out under dbms ABC and ends up in dbms BCD. And there are some significant differences between dbms ABC and BCD.

2)?????No documentation.?No one ever documented the programs used in transportation.

3)?????Inaccurate documentation. The systems have been documented but not accurately.

4)?????Algorithmic inconsistency. The algorithm for managing data in one location is different from the algorithm used elsewhere.

For these reasons and plenty more, data lineage is a problem in every shop and every organization.

Come hear world class expert Robert Scott of Eon Collective tell you his thoughts on managing data lineage. Robert will be speaking at the DATA ENGINEERING + DATA ARCHITECTURE + DATA?LINEAGE workshop on May 22, 2023 starting at 10:00 am Denver time. This 4 hour event will also feature Joe Reis and Bill Inmon.

In addition to hearing presentations, you will have a question and answer opportunity with Joe, Bill and Robert.

If you are not in a time zone where this workshop is convenient, you may receive a recording of the event if you sign up.

For more information go to –

https://www.eventbrite.com/e/data-engineering-data-architecture-and-data-lineage-registration-601729377767?utm-campaign=social&utm-content=attendeeshare&utm-medium=discovery&utm-term=listing&utm-source=cp&aff=escb

Fabio Oliveira, MSc

Arquiteto de dados | Head Data & Analytics | Data Anaytics | Professor | Autor

1 年

Great thank you Bill Inmon! See this Diego Brescancin.

回复
Yogesh Pandit

I can help you to convert data and AI to $ with a focus on Trust & Safety, and ROI. ?? Author & Patents : Data, AI & Trust Algos | ?? AI Innovator & Investor | ?? Board & C-level Innovation Advisor

1 年

Bill Inmon : I agree ?? with you. The new world of AI will change the way it’s done. Don’t you think so?

回复
MOHAMADOU SIDIBE

Senior Data Gouvernance Consultant.

1 年

Beyond the need for the data analyst who wants to do serious work analyzing data and turning that analysis into business value. I think that knowing the data sources, the aggregations that occurred on this data throughout its transformation phase is also essential for better data governance. Looking forward to the DATA ENGINEERING + DATA ARCHITECTURE + DATA LINEAGE workshop.

Matt M.

Revenue Operations Leader | Revenue Engines Optimized by AI

1 年

Wish I could make it - Joe Reis ?? and you in the same room are the perfect osmotic conditions for data engineering evolution

要查看或添加评论,请登录

社区洞察

其他会员也浏览了