The Pain of Research

The Pain of Research

At the end of last year, I asked one of our customers, “What was your aha moment when using Artemis?”

Without skipping a beat, he said, “The moment I combed through the insights and realized I no longer needed to keep all this context in the back of my mind, Artemis does the research and then surfaces issues for me.”

This was not what I was expecting. I thought he would highlight all the cool AI agent tech we had built that takes the insights we surface and then resolves the tasks for him. Instead, the value that got him was that we researched for him.

In this context, research related to the work needed to understand what problems were occurring in his data platform, why they were happening, and what a suitable fix was to resolve them.

This pain of research is deep, and the more we dig into it, it is a huge cost driver for data teams. A dbt labs survey in 2024 found that data teams spend 26% of their week fixing and maintaining their data platform. This is in addition to the fact that engineers spend 55% of their time maintaining or organizing data sets; a combined 80% of their time is spent on some form of maintenance.

If teams spend 80% of their week on maintenance, a considerable amount is spent on research. Why are teams spending all this time on research?

A Few Possible Reasons for So Much Research

  • You didn’t write it: Most practitioners do not build the platform they hold the keys to. Data engineers spend their time understanding, reverse engineering, and dissecting the work of others to fix, maintain, and improve data platforms.
  • Fragmentation means no one is a master: When building and maintaining a platform of 8-12 tools, all of which have quirks and ways of becoming efficient, there is much to learn. As an engineer, you won’t know every tool and won’t have the time to understand the intricacies of each. So, you learn enough to get by and take little refresher courses when you need to solve a specific problem.

This convoluted web of work leads to tech debt, but it's also the workspace most data practitioners work in daily.

The other side of this is the mental drain on you. We like the path of least resistance, and it’s discouraging when we see a ticket that should take 15 minutes to resolve to take three or more hours due to the added complexity and research needed.

The emotional side of this problem is deep-rooted. It can crush momentum and motivation for a team performing at a high level and seriously derail progress.

Research is necessary for solving problems, and data is no different. However, in the long run, will we continue to do the research ourselves as practitioners, or will AI do it for us?

要查看或添加评论,请登录

Josh Gray的更多文章

  • The Future of Data Engineering

    The Future of Data Engineering

    A lot of LinkedIn chatter is about whether AI will replace data engineers. I believe the answer to that question is no.

  • The Natural Evolution of Data Platforms

    The Natural Evolution of Data Platforms

    A familiar picture is drawn whenever I talk with data engineers or managers. They talk about how they have multiple BI…

  • The Move to Automated Remediation

    The Move to Automated Remediation

    I’ve had a few teams ask me what makes Artemis different. The answer is simple—automated remediation.

  • Moving from Reactive to Proactive Data Observability

    Moving from Reactive to Proactive Data Observability

    I spoke with the CTO of a unicorn data startup, who said, “We are really good at gathering data, but we are not the…

  • Fragmentation Hell

    Fragmentation Hell

    On Tuesday, I had a call with a data engineer who talked about how the fragmentation in the data stack is crushing his…

    12 条评论
  • Why The Data Platform is The Most Important Internal Tool

    Why The Data Platform is The Most Important Internal Tool

    I was inspired to write this post while reading Packy McCormick's Not Boring article on Rox, a new investment of his…

  • BigQuery Slots: What You Need to Know

    BigQuery Slots: What You Need to Know

    A few weeks ago, I posted that Artemis picked up an insight that saved a customer $11k annually in BigQuery costs. One…

  • Should Apple Buy Peloton?

    Should Apple Buy Peloton?

    I have long believed that Apple should acquire Peloton. Peloton is the Apple of integrated fitness, Apple is becoming…

    1 条评论
  • We built a dbt no-code low-code dbt editor, and it failed…

    We built a dbt no-code low-code dbt editor, and it failed…

    At Coalesce, dbt’s annual conference in Las Vegas, they announced the launch of a visual editor experience. As a…

    12 条评论
  • A $21B Dataset? Why Uber is eyeing Expedia

    A $21B Dataset? Why Uber is eyeing Expedia

    Uber is a canonical example of a data company. Their engineering blog is lore to engineers who follow their sheer scale…

社区洞察

其他会员也浏览了