Data Catalog
Data Catalog

Data Catalog

We discussed the Data Dictionary and the Data or Business Glossary. What is Data Catalog?

Let's decode it....

‘Data Catalog comes under Metadata Management.’

Although all three Data Dictionaries, Data or Business Glossary and Data Catalog look similar, it's very important to understand the difference.

Data Dictionary stores metadata details about the solution and Data or Business Glossary is the collection of all business terms about the solution whereas Data Catalog jointly refers to Dictionary and Glossary like which business term is referring to which table or dashboard etc. In other words, Catalog gives a complete picture of both Dictionary and Glossary

Image: https://www.alation.com/blog/what-is-a-data-catalog/

Let's take an example of a Bank Account Number (BAN).

●???????Data Glossary: BAN is the unique identifier for a customer across all banking applications whether it’s HR, Finance, Marketing etc.

●???????Data Dictionary: BAN is a unique identifier with field type as string, field length as 10, with two times # in it etc., metadata detail.

●???????Data Catalog: It tells the whole story i.e., BAN is the unique identifier for a customer across all banking applications and it is stored in tables i.e., x, y and z, it is in which schema and database. And which department owns which databases, tables, dashboards and reports etc.

What the data is about, how it is stored and WHERE it is stored, gives an added advantage to analysts, designers, and developers. Having a Data Catalog means a Data Dictionary and Data Glossary is already in place which is a perfect recipe to eliminate chances of Data Swamp. As mentioned in previous topics, with the introduction of Semi and Unstructured datasets, the requirement for a Data Catalog has moved from the Good-to-Have to Must-have zone.

‘Both technical and business folks interact with Data Catalog.’

Cheers.

Fons de Waard

Head of Data & Platforms Samsung EO at Samsung Electronics

2 年
Sebastien (Seb) Thomas

Building the future of data governance @ DataGalaxy | Co Founder & CEO

2 年

Thanks Mustafa Qizilbash ! Data catalogs now can also inventory the dashboards and reports, show data lineages from reports to original source and more ! Looking forward to the observability article !

要查看或添加评论,请登录

社区洞察

其他会员也浏览了