登录查看更多内容

Design a Data Mesh Architecture in Practice

Arvin S.

Sr. Cloud Architect Data & AI - Microsoft

发布日期: 2021年12月5日

Data Mesh vs Centralized Data Model

Long lasting relational databases and transactional architectures still have been well served variety of use cases. Once, however, organizations understood there are lots of “values” in data itself, analytical use cases brought different necessities and consequently different architectures.

Surging from Batch processing and Lambda architectures to Kappa and Micro services architectures basically came to addressed to accomplish bigdata challenges for the business.

Single Source of truth also surged to make sure everyone will see one source of unified data with centralizing data in a Data Lake. Therefore, one team produces the data and the whole other can consume those data.

In reality, however, in fully centralized data lakes, there are some clear gaps between business areas and IT team. IT teams & single Data Eng. teams try to build and create data pipelines in a hope that LOBs and executives can get full benefits of data. Since the gap is so big, in reality, in most of the cases this does not happen. And that’s because who produce the data and make it ready is not who really use it.

Data Mesh architecture concept, however, comes to reduce this gap. Organizations Looking at the data as a Product and not merely as an asset. It is where we believe we will be closer to a democratized data driven business.

One of the most important enablement you can name using Data Mesh is “Data Autonomy”. Where building a self-service data infrastructure can help very much data democratizing in practice.

Data Mesh Architecture in Practice

I have seen plenty of scenarios where in theory building an architecture with Domain based / Data product approach is easy. However, for some, making those theatrical concept into reality has been arguably a challenge.

Saikrishna Cheruvu 2 年前

Essential Ingredients for a Data Mesh Architecture…

Siddharth Rajagopal 4 年前

Understanding Data Mesh Architecture

Bruno Rodrigues Lopes 2 个月前

Cloud has reduced the complexity to build flexible data architectures. Those flexibilities, governance and security options have been critical to build new approaches which will end up transform ideas to a reality.

Some points to consider:

Data mesh is a pattern for defining how organizations can organize around data domains with a focus on delivering data as a product. However, it may not be the right pattern for every customer.
The Lake House approach with a foundational data lake serves as a repeatable blueprint for implementing data domains and products in a scalable way.
The way we look at the data here is different and the way we work in each LOB also is different (LOBs should become owner of the product, from build to produce)

The following are user experience considerations:

Data teams own their information lifecycle, from the application that creates the original data, through to the analytics systems that extract and create business reports and predictions. Through this lifecycle, they own the data model, and determine which datasets are suitable for publication to consumers.
Data domain consumers or individual users should be given access to data through a supported interface, like a data API, that can ensure consistent performance, tracking, and access controls.
All data assets are easily discoverable from a single central data catalog. The data catalog contains the datasets registered by data domain producers, including supporting metadata such as lineage, data quality metrics, ownership information, and business context.
All actions taken with data, usage patterns, data transformation, and data classifications should be accessible through a single, central place. Data owners, administrators, and auditors should be able to inspect a company’s data compliance posture in a single place.

Data Consumer1 and Data Builder1 (Producers) are from a single domain/dept. The idea here is to remove the gap between those two entities. And in reality, means how we can give more autonomy and flexibility so that domain areas can make the most of their data by creating their own “product”.

Many thanks!

Arvin S.

Sr. Cloud Architect Data & AI - Microsoft

2 年

To check more discussion: https://www.dhirubhai.net/posts/arvindata_cloud-data-data-activity-6873258858748366848-xuXo

Rosane Ricciardi

CDAO at Amil Group | 2024 Global Top100 Innovators in Data & Analytics by #Corinium | 2022 Global Top 100 Leading Enterprise Data Leaders by #CDOMagazine

2 年

excelente Arvin, eu ainda vejo o desafio de manter o catalogo atualizado x a velocidade q fazemos de ingest?o no data lake. tambem vejo ainda as empresas c muita demanda para camada semantica centralizada para ter a versao unica do numero…. mas vejo crescer a demanda por self service …. e de novo camada semantica e catalogo sao os atores principais para isto dar certo. bj Ro

1 次回应

Ed Carter

Data Management, Faithlife, LLC Founder/CEO, CartersFarm.Software - a small software company with small ideas. Aspiring Cartoon Mime Voice Actor.

2 年

I think it is important to note that the reference to "business context" needs itself to be managed within an overarching ontology so semantic meaning can be established consistently for all of the self-service actors. In practical terms, lack of collaborative attention in this area has been the weak spot in many of the data architecture projects I've seen. Serious work using Knowledge Graphs or other semantic disciplines is essential to actually implementing data lakes, lake houses, or other approaches to data as a product. #ontology #knowledgegraph #semantic

1 次回应

Tiago Gorjon

Coordenador de Sistemas @ Vivo | DEVOPS TEAM

2 年

Juliana Miranda

2 次回应

Alberto Cardoso

Solutions Architect at Databricks

2 年

A gente pode criar uma banda de pagodata chamada data mesh e remesh ??

5 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

Design a Data Mesh Architecture in Practice

Arvin S.

Sr. Cloud Architect Data & AI - Microsoft

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Enablement of data domain strategy & adoption of Data Mesh Architecture is the way forward for many GSIB's

How Medallion Architecture With ER/Studio And Databricks Solves Data As A Product For Both Business And IT

Modern Data Architecture

Data Lakehouse Architecture: A Modern Solution for Unified Analytics

Demystifying LakeHouse and Medallion Architectures.

Data Lakehouse Architecture: Combining the Best of Data Lakes and Data Warehouses

Cloud-Native Architecture Decoupling Big Data Management

Data Lakehouse Architecture: Combining the Best of Data Lakes and Data Warehouses

Five Capabilities to Make Your Modern Data Architecture More Modern – Pt. 2

Data Lake Architecture – Part 1

领英推荐

LLM on your private data

2023年5月8日

When it comes to Data and AI, migration to cloud is good, but thinking out of box is better!

2021年9月8日

Construir um relacionamento pessoal e profissional real n?o é fácil..... Eu revelo meu "Gringo secret sauce" :)

2021年1月7日

How to Use Data to Drive Revenue

2020年11月4日

Leveraging Analytics and User Segmentation to Drive Revenue

2020年11月3日

We Need to Stop Lying About Being Data-driven.

2020年8月20日

Oracle has a great past and a greater Future!

2020年6月17日

Can really healthy food help us to fight against Corona virus? What the data can tell us.

2020年5月12日

Alimentos Saudáveis podem realmente nos ajudar a combater o Corona Vírus? O que os dados podem nos dizer?

2020年5月12日

If image processing with AI help us to detect Cancer & Pneumonia why not using it for COVID-19.

2020年5月6日

社区洞察

其他会员也浏览了

Enablement of data domain strategy & adoption of Data Mesh Architecture is the way forward for many GSIB's

How Medallion Architecture With ER/Studio And Databricks Solves Data As A Product For Both Business And IT

Modern Data Architecture

Data Lakehouse Architecture: A Modern Solution for Unified Analytics

Demystifying LakeHouse and Medallion Architectures.

Data Lakehouse Architecture: Combining the Best of Data Lakes and Data Warehouses

Cloud-Native Architecture Decoupling Big Data Management

Data Lakehouse Architecture: Combining the Best of Data Lakes and Data Warehouses

Five Capabilities to Make Your Modern Data Architecture More Modern – Pt. 2

Data Lake Architecture – Part 1