登录查看更多内容

Data Architecture-as-a-Service: Liberation for Data Users

Wayne Eckerson

发布日期: 2022年3月3日

ABSTRACT:?Data architecture-as-a-service or DaaS is a new self-service paradigm that empowers local data owners to create architecturally compliant data repositories.

We are soon to make a giant leap forward with self-service data and analytics. We’ve developed self-service tools for reporting, analysis, dashboarding, data set preparation, and even data science (e.g., autoML). But what we haven’t delivered yet are no-code tools that enable business users to create their own data repositories without IT assistance.?

Versus data silos.?Of course, we’ve always had data silos. Business users create them all the time in Excel, easy-to-use databases like Microsoft Access or SQL Server, or data preparation tools, like Alteryx or Tableau. What we need are “modern data silos” that enforce architectural integrity and data consistency using common dimensions, definitions, and logic. These self-service structures provide the speed and agility of data silos without the harmful consequences. These “non-siloed, data silos” are the essence of what I call “data architecture-as-a-service” or DaaS.?

These self-service structures provide the speed and agility of data silos without the harmful consequences.

Business-built data domains.?Data architecture-as-a-service enables business users to build local data domains or repositories without undermining enterprise data consistency and trustworthiness. It is the culmination of self-service, where business units liberate themselves almost entirely from enterprise IT. If done right, DaaS reduces data bottlenecks, eases the burden on enterprise data teams, and empowers local domains to service their own data needs. It’s also a key ingredient in the data mesh, an emerging distributed architecture for data ownership and management.?

Approaches

How is this possible??But here's the challenge: it’s obvious that we can’t expect data analysts to do the work of data architects or data engineers. They don’t know how to design, model, and implement robust, scalable data environments or build data pipelines that reuse standard data flows and naming conventions. We’ve seen what happens when they try: they create brittle, high-risk data silos and pipelines that don’t scale or perform well. But with DaaS, we bake architectural requirements into self-service data engineering tools so business users can create their own repositories without undermining data consistency and trustworthiness.

With architecture-as-a-service, we bake architectural requirements into self-service data engineering tools so business users can create their own repositories without undermining data consistency and trustworthiness.

Software building blocks.?In our consulting practice, we’ve seen enterprise data architects create data “building blocks” that departmental analysts use to create extensions to an enterprise data warehouse. The blocks contain governance guardrails that enable analysts to create their own data marts without deep knowledge of SQL, data structures, query logic, or schemas.?

Vintage 1 个月前

Data Modeling for Mere Mortals – Part 3: All we need…

Nikola Ilic 1 年前

An Approach to Architecting a Lower Cost, Fast and…

Alex Merced 1 年前

Unfortunately, it’s a heavy lift for most enterprise data teams to create a self-service data infrastructure given competing demands for their time. Fortunately, some vendors have recognized an opportunity and now offer data architecture-as-a-service tools. These products come in a variety of shapes and forms.?

DaaS Products

Extensible data models.?For instance, cloud data analytics vendors, such as?Domo ?and?Infor Birst , provide multi-tenant data environments with extensible data models. This enables primary tenants to propagate a global model to sub-tenants who can extend that model by adding new columns and tables to support local requirements. The global model rolls down to sub-tenants, while local data and model extensions stay local. This hub-and-spoke approach is ideal for supporting retail and manufacturing distribution networks but can be applied in almost any data environment.

Self-service data engineering.?More recently, data engineering vendors, such as?Coalesce ?and?Fivetran ?offer multi-code, template-driven toolkits that make it easy for data analysts or domain data owners to create repositories that align enterprise governance and schema requirements. Most of these tools are cloud-based variants of data integration, data transformation, or data warehouse automation tools.?

For example, Coalesce, which launched last month, is a data transformation vendor that offers a more modern version of?dbt , a popular, open source data transformation toolkit. Founded by ex-Wherescape employees, Coalesce offers both GUI- and code-based development environments, a column-aware architecture that supports full data lineage, and built-in automation functions. However, what I like best about this new product is that it allows data architects to build architectural guardrails into the GUI-based development environment via templates and other techniques so that business analysts can build architecturally compliant data repositories and pipelines.?

Similarly, Fivetran is a data integration vendor that offers a more automated approach to centralizing cloud application data. This makes it possible for a data analyst, rather than a data engineer, to build architecturally compliant data pipelines that move data from a single cloud application into a target database and run pre-built transformation processes to harmonize that data into a common schema. Both Coalesce and Fivetran are harbingers of a booming market for DaaS tools.?

In transition.?Today, however, a highly motivated data analyst might be able to use any number of GUI-based data engineering tools to build a data pipeline. However, there is little chance they will produce something that complies with architectural guidelines or governance standards. You need a trained data engineer whose work is reviewed by an enterprise data architect to do that. In a Data Architecture as a Service paradigm, however, a data architect configures a DaaS-ready tool to adhere to enterprise data standards and structures so a data analyst, rather than a data engineer, builds compliant data pipelines.?

When we abstract data architecture, we solve the most enduring data pain point: the proliferation of data silos that wreak havoc on data consistency and trustworthiness.?

Conclusion.?Data architecture-as-a-service is a verbal twist on cloud processing environments, such as software-as-a-service or platform-as-a-service. This moniker conveys that it’s possible to abstract architecture and build it into easy-to-use, customer-facing tools. When we abstract data architecture, we solve the most enduring data pain point: the proliferation of data silos that wreak havoc on data consistency and trustworthiness.

Data Strategy Insider

4,509 位关注者

J.C. S.

Sr Enterprise Data Governance Manager

1 年

Great article Wayne! It especially spoke to me as we implemented a similar approach a few years back (we called it something else...I like DaaS a lot better!). We built it around a no / low code data query tool and it has proven to be the key for unlocking our data self-service approach (for those not on the "silver service plan"). We are continuing to build on it by adding in low / no code prep tools allowing users to build out their own data mart / data lakehouse / etc. At the same time, allowing these to be refreshed on a schedule (user defined ETL of sorts). Now rather than a having large Enterprise models, we have smaller ones built around the user's specific use case while allowing them to curate it within the enterprise data governance parameters. Users now have much more control over their data and how its consumed.

Jay Powell

Chief Revenue Officer | Chief Operations Officer | Chief Strategy Officer | Enterprise Software Sales & Marketing

2 年

Great article Wayne! Far too often I have seen analytics project stopped in their tracks while the company builds out a data repository that will be their "single-source-of-truth." The problem though is that this approach wont scale, and these projects never end. A flexible data model that allows the analysis of new data is critical and it needs to be in the original design. We don't know what we don't know when we begin projects so why would we assume we know what data we will need in the future? The analytical tools are important, but data flexibility is the key.

Athanassios Hatzis

IT Solutions/Systems Expert - Researcher

2 年

Hi Wayne, in my perspective this is the key sentence from your post. ?What we need are “modern data silos” that enforce architectural integrity and data consistency using common dimensions, definitions, and logic?. That boils down to efficient and effective data modeling, transformation and integration of existing data resources according to new data modeling standards and automated discovery and query of data in an associative manner (see my work).

2 次回应

Edith Ohri

2 年

The data model must come down in price and expertise. The current assumption that more complex data require more elaborate and costly solutions is UNSUSTAINABLE. Also, analysts need to play the data, and need for that a direct & simple architecture.

查看更多评论

要查看或添加评论，请登录

Wayne Eckerson的更多文章

Data & Analytics News: Today’s Headlines

2022年4月1日

Data & Analytics News: Today’s Headlines

1. Doctors Fear Rise of Event-Overload Syndrome April 1, 2022 - As the Coronavirus pandemic winds down, doctors are now…

10 条评论
Rethinking the Data Mesh: Apply it Piecemeal

2022年2月10日

Rethinking the Data Mesh: Apply it Piecemeal

ABSTRACT: The data mesh is gaining popularity as a distributed architecture that mirrors reality. But its practical…

21 条评论
How to Master Report Sprawl

2019年5月8日

How to Master Report Sprawl

ABSTRACT: Report sprawl is a serious but under-recognized problem. Unless companies master the basics of report…

2 条评论
Data Engineers: The New IT Bottleneck?

2019年5月1日

Data Engineers: The New IT Bottleneck?

Abstract: If your goal is to maximize the productivity of your expensive data scientists, think twice before hiring a…

2 条评论
Podcast - Jason Beard: Data Quality through Process Improvement

2019年3月26日

Podcast - Jason Beard: Data Quality through Process Improvement

ABSTRACT: In this podcast, Wayne Eckerson and Jason Beard discuss the reasons why data quality and data governance are…

1 条评论
Qlik Acquires Attunity, Fleshes Out its Data Analytics Platform

2019年2月21日

Qlik Acquires Attunity, Fleshes Out its Data Analytics Platform

In acquiring Attunity in an all stock deal valued at $560 million, Qlik takes a bold step towards becoming an…

1 条评论
Podcast: Andrea Ballinger- Transformation and Change-What Technology Leaders Need to Know

2019年2月19日

Podcast: Andrea Ballinger- Transformation and Change-What Technology Leaders Need to Know

Being a change agent is hard. It's tough to inspire people and get them motivated to work on a shared vision.
AI: The New BI - How Algorithms Are Transforming Business Intelligence and Analytics

2019年2月11日

AI: The New BI - How Algorithms Are Transforming Business Intelligence and Analytics

AI will transform BI and the way people make decisions and act. Rather than start with a hypothesis, data analysts will…
Ten Things Companies Want from a Modern Data Architecture

2019年1月24日

Ten Things Companies Want from a Modern Data Architecture

ABSTRACT: This second article in a series on modern data architectures. It focuses on what drives customers to want a…

8 条评论
Podcast-DataOps in Action - Implementing Agile and Automation

2019年1月18日

Podcast-DataOps in Action - Implementing Agile and Automation

In this episode, Wayne Eckerson and Shakeeb Akhter discuss what DataOps is, the goals and principles of DataOps, and…

2 条评论

See all articles

Data Architecture-as-a-Service: Liberation for Data Users

Wayne Eckerson

Approaches

领英推荐

DaaS Products

Data Strategy Insider

4,509 位关注者

Wayne Eckerson的更多文章

社区洞察

其他会员也浏览了

Data Vault is no longer required in our Archipelago on the high Seas of Data

Data Management News for the Week of June 7; Updates from Cloudera, Snowflake, Informatica & More

Kimball vs. Inmon: Unraveling the Synergy of Data Warehouse Approaches

Data Lakehouse: Next Generation Data Management

Rise of Data Mesh Architecture [7 out of 10]

A serious word about Data Democratization

Revolutionize Your Data Workflows: Embrace the Future of Data Management with Dynamic Tables in Snowflake

Leap to Success with Data Pipeline

“Data Mess to Data Mesh” - Part:1

Data Lakes vs. Data Warehouses: Unveiling the Truth

Approaches

领英推荐

DaaS Products

Data Strategy Insider

4,509 位关注者

Wayne Eckerson的更多文章

Data & Analytics News: Today’s Headlines

Rethinking the Data Mesh: Apply it Piecemeal

How to Master Report Sprawl

Data Engineers: The New IT Bottleneck?

Podcast - Jason Beard: Data Quality through Process Improvement

Qlik Acquires Attunity, Fleshes Out its Data Analytics Platform

Podcast: Andrea Ballinger- Transformation and Change-What Technology Leaders Need to Know

AI: The New BI - How Algorithms Are Transforming Business Intelligence and Analytics

Ten Things Companies Want from a Modern Data Architecture

Podcast-DataOps in Action - Implementing Agile and Automation

社区洞察

其他会员也浏览了

Data Vault is no longer required in our Archipelago on the high Seas of Data

Data Management News for the Week of June 7; Updates from Cloudera, Snowflake, Informatica & More

Kimball vs. Inmon: Unraveling the Synergy of Data Warehouse Approaches

Data Lakehouse: Next Generation Data Management

Rise of Data Mesh Architecture [7 out of 10]

A serious word about Data Democratization

Revolutionize Your Data Workflows: Embrace the Future of Data Management with Dynamic Tables in Snowflake

Leap to Success with Data Pipeline

“Data Mess to Data Mesh” - Part:1

Data Lakes vs. Data Warehouses: Unveiling the Truth