Governed or Federated Data Platforms using Data Mesh
Kamal Singh
Strategic Enterprise Architect | Cloud Visionary | Expert in Business-IT Alignment & Program Management ???
The architecture of data platforms is essential for optimizing data assets in modern enterprises. The traditional monolithic data warehouse model is giving way to more flexible, distributed approaches, with the Data Mesh emerging as a prominent alternative. This evolving framework decentralizes data ownership and governance, enhancing scalability and adaptability. This article will explore three distinct variations of Data Mesh architectures, each offering unique strategies for integration and governance in contemporary data environments.
1. Full Integration with Centralized Governance
The first flavor of Data Mesh-based data platforms is the Full Integration with Centralized Governance model. This approach aims to create a single data platform for the entire organization, incorporating globally distributed data pots and sources. It closely resembles the traditional data warehouse concept but with a modern twist. The ultimate goal is to develop an integrated data model that facilitates seamless data access and analysis across the organization.
In this model, centralized governance ensures consistency, security, and compliance. All data sources and datasets are managed under a unified framework, providing a holistic view of the organization's data landscape. This approach benefits organizations seeking to maintain strict control over their data assets while enabling broad data accessibility.
Pros
Cons
2. Loose Integration with Delegated Governance
The second flavor, Loose Integration with Delegated Governance, offers a more flexible approach. Here, different data pots function as individual data platforms, but key datasets are consolidated into a central data platform. The data model is owned by the data source, allowing for the creation of various persistent views tailored to specific needs.
In this setup, governance is delegated, meaning the responsibility for managing data models and mappings lies with the data sources themselves. The central data platform serves as a hub, linking to these decentralized data sources. This reduces the overhead associated with traditional ETL processes, enabling quicker and more efficient data integration. Organizations adopting this model can achieve a balance between control and flexibility, ensuring that data is managed and utilized effectively.
领英推荐
Pros
Cons
3. Locally Loose and Globally Integrated
The third flavor is particularly beneficial for globally distributed organizations that require both local flexibility and centralized control. This model, Loose for Local and Full for Group with Delegated Governance, allows local centers to create data platforms tailored to their specific business use cases and technology preferences. These local platforms operate independently but interface with a central data platform for key datasets related to risk, finance, and compliance reporting.
In this approach, local data models are maintained by the respective centers, ensuring they are optimized for local requirements. However, when it comes to sharing data centrally, a consistent data model is used. Data is copied to the central platform, where it is governed and managed according to organizational standards. This model provides the best of both worlds: local operational flexibility and centralized oversight for critical datasets.
Pros
Cons
Head Data Operations and Strategy
3 个月Good summary of the different data mesh variants. It shows that there is not the one Implementation but that it should be designed to suit to the company's setup.
Interesting!