登录查看更多内容

A quick look at ?Ad-Hoc Data Analytics to DataOps?

Klaus Haller

发布日期: 2024年8月24日

In their 2020 paper, “From Ad-Hoc Data Analytics to DataOps,” Aiswarya Raj Munappy , David Issa Mattos , @Jan Bosch, Helena Holmstr?m Olsson , and Anas Dakkak ak define DataOps, explore its core elements, and introduce a five-phase maturity model. The paper is a result of a collaboration between researchers from Chalmers University of Technology , Malm? universitet , and 爱立信 , who combined insights from academic literature with expert interviews to ground their conceptual work in real-world experience.

One outcome is a clear understanding of why organizations invest in DataOps. They aim to “achieve more insights/value cheaper and faster while still keeping the quality.” Researchers and practitioners tend to approach the topic of DataOps from one or more of the following four perspectives:

Activities of DataOps, i.e., what do the engineers do?
Goals organizations aim for with DataOps.
Technologies for implementation
Organizational structures and working methods in this domain

Understanding these facets provides valuable context for navigating the many projects, programs, and sales pitches prevalent in today’s organizations.

The paper also compares DevOps and DataOps, noting that both emphasize agility and collaboration, though there are distinct differences. DevOps integrates development and operations, whereas DataOps combines value pipelines—such as data warehouses or AI systems that quickly analyze camera data from an assembly line to detect irregularities—with innovation pipelines that deliver new analytics ideas.

Based on interviews with Ericsson specialists, the authors outline five DataOps phases, which one would typically name maturity levels:

领英推荐

Part 1: An Overview of DataOps For Computer Vision

Superb AI Inc. 1 年前

The Fusion of Data Science and Lean Six Sigma

Soumyaranjan Mukherjee 9 个月前

DataOps and MLOps – The Power of Integration

Dr. Prashant Pansare 3 年前

Ad-Hoc Data Analytics: At this initial level, engineers perform on-demand queries to answer specific business questions quickly. Customers may select data sources and fields themselves. Reuse of reports and queries is rare.
Semi-Automated Data Analysis: Here, the concept of data pipelines streamlines data collection, ingestion, preparation, and visualization.
Agile Data Science: The focus shifts to delivering continuous business value through frequent updates. Sprints, a central code repository, and sprints are the core concepts of this level.
Continuous Testing and Monitoring: This phase emphasizes the importance of ongoing testing and monitoring to ensure the robustness of data pipelines. Automated unit and high-level tests, along with monitoring and automatic alerting, are vital to maintaining the reliability and stability of data pipelines.
Full DataOps: The final level reads like a collection of Christmas wishes of a complete village. The abstract goal of managing data and code together to shorten delivery times translates directly into two actions, one technological and one organizational, quite inspiring. Organizationally, they suggest uniting all data-related specialists into groups aligned with the company’s value stream. Technologically, the focus is on transitioning from data pipelines to data products, which requires orchestrating the delivery of insights and technical changes, which benefits from fostering collaboration among teams. Instead of a large set of separate reports, the result is an ecosystem of interconnected data products whose dependencies form an acyclic graph—a powerful way to align DataOps with the concept of data products!

Klaus on AI

470 位关注者

要查看或添加评论，请登录

Klaus Haller的更多文章

Some thoughts about Data Classification and Labelling in the Cloud

2025年3月11日

Some thoughts about Data Classification and Labelling in the Cloud

Data and information classification and labelling must be important if the ISO standard has two dedicated controls for…
The most essentical cloud-native Security Services in AWS, Azure, and GCP

2025年3月10日

The most essentical cloud-native Security Services in AWS, Azure, and GCP

The pure number of cloud (security) services might overwhelm security specialists, in particular when they work in…
A Short Intro to Logging in the Cloud

2025年2月20日

A Short Intro to Logging in the Cloud

Logging is the systematic recording of events in an IT environment. It is the foundation for proactively identifying…
Security Architects & Cloud Backup Strategies

2025年2月17日

Security Architects & Cloud Backup Strategies

Cloud security architects should understand well-established backup concepts and patterns—such as RTO, RPO, and the…

2 条评论
Is Workload Security Overrated? ??

2025年2月13日

Is Workload Security Overrated? ??

Lately, I've been rethinking our priorities in security architecture. Are we putting too much emphasis on workload…

2 条评论
DeepSeek - Shaking Up the AI Marketplace Without Redefining AI

2025年1月28日

DeepSeek - Shaking Up the AI Marketplace Without Redefining AI

All eyes are on DeepSeek, the emerging AI star from China. But how does DeepSeek revolutionize the world of artificial…
RedHat Connect 2025 Dübendorf: Containers, Automation, and AI

2025年1月15日

RedHat Connect 2025 Dübendorf: Containers, Automation, and AI

Today, I had the pleasure of attending the RedHat Connect 2025 event in Dübendorf, a stone's throw away from Zurich…

1 条评论
My Top-3 2024 Security Articles

2024年12月30日

My Top-3 2024 Security Articles

As we look back on 2024, I want to highlight my most impactful posts that really connected with my audience. If you…
Securing AI: What the OWASP LLM Top 10 Gets Right – and What It Misses

2024年12月24日

Securing AI: What the OWASP LLM Top 10 Gets Right – and What It Misses

As the year winds down and we reflect on how much technology has shaped 2024, it’s hard not to notice how AI –…
Certificate Management in Azure and GCP: A Brief Look

2024年12月22日

Certificate Management in Azure and GCP: A Brief Look

Certificates play a crucial role in securing communication and controlling access to (web) services. All leading clouds…

See all articles

A quick look at ?Ad-Hoc Data Analytics to DataOps?

Klaus Haller

领英推荐

Klaus on AI

470 位关注者

Klaus Haller的更多文章

社区洞察

其他会员也浏览了

The Future of Data Engineering Development in the Next 10 Years

The Fusion of Data Science and Lean Six Sigma

DataOps and MLOps: Revolutionizing Data Engineering Workflows

The Fusion of Data Science and Lean Six Sigma

The Fusion of Data Science and Lean Six Sigma

MLOps and Data Engineering Synergy: Bridging the Gap for Smarter Workflows

DataOps: The Difference Between Knowing Something and Doing Something About It

The Fusion of Data Science and Lean Six Sigma

Data Ingestion and Validation

Part 2 - Future-proofing Data and AI Foundation: 10 Building Blocks

领英推荐

Klaus on AI

470 位关注者

Klaus Haller的更多文章

Some thoughts about Data Classification and Labelling in the Cloud

The most essentical cloud-native Security Services in AWS, Azure, and GCP

A Short Intro to Logging in the Cloud

Security Architects & Cloud Backup Strategies

Is Workload Security Overrated? ??

DeepSeek - Shaking Up the AI Marketplace Without Redefining AI

RedHat Connect 2025 Dübendorf: Containers, Automation, and AI

My Top-3 2024 Security Articles

Securing AI: What the OWASP LLM Top 10 Gets Right – and What It Misses

Certificate Management in Azure and GCP: A Brief Look

社区洞察

其他会员也浏览了

The Future of Data Engineering Development in the Next 10 Years

The Fusion of Data Science and Lean Six Sigma

DataOps and MLOps: Revolutionizing Data Engineering Workflows

The Fusion of Data Science and Lean Six Sigma

The Fusion of Data Science and Lean Six Sigma

MLOps and Data Engineering Synergy: Bridging the Gap for Smarter Workflows

DataOps: The Difference Between Knowing Something and Doing Something About It

The Fusion of Data Science and Lean Six Sigma

Data Ingestion and Validation

Part 2 - Future-proofing Data and AI Foundation: 10 Building Blocks