登录查看更多内容

Going from 0-1 in Data Operations

Greg Meyer

Data Operations @ Mangomint ? Writing ?? at finddataops.com

发布日期: 2024年10月15日

Imagine you are starting a new venture and need to describe all the data tasks that need to happen to get you from “nothing” to “something” in Data Operations .

These are the basic building blocks for understanding the work we typically do in a Data Ops team and are a good reminder for organizing the ongoing work and functioning of data in an early-stage company.

Let’s start by stating that Data Operations is not rocket science. It is a structured way of working with data to meet the everyday needs of the business and provide a framework for asking and answering data questions.

Here’s a list of the systems you’ll want to build or identify to go from zero to one in Data Operations.

Eventing and Hooks and Workflow, oh my!

Some of the most important data you want to know about signals a change that needs attention. For example, when a customer signs up for a new account, there are numerous systems that need to be updated, starting by looking at the customer status. To do this, you need a system that sends information to a specific back-end URL using an API call. By providing a specific hook for that event, you can trigger other systems in near real-time.

Think of eventing as the part of the system that lets other software know when “something important” takes place. It requires a listener that is ready to receive information, a payload of expected information, and a series of steps in a workflow that get executed when the payload is received. Whether you are running this on a schedule or just in time, a tool like Pipedream helps you respond creatively.

A place to store that information

Changing this customer data (or inserting a record when they are brand new) implies that you have a place to store information separate from your operational database for your application. Whether you are on Team Database, Team Data Lake, or Team Data Warehouse, you need to store transactional data, rolled-up data, and transformed data to share with other applications in your system or visualize in a reporting layer.

Snowflake is a great option for this and by no means the only one. You might pick it over BigQuery or Postgres because it scales nicely and combines the concepts of databases and a warehouse. (If you have a lot of data – meaning billions and trillions of rows – you probably want to spend a bit more time on your infrastructure, but this is intended for the “get started” crowd).

Transforming Data into Models

Operationally speaking, we often talk about “models” to describe the information in the system. A model is the shape we expect data to take for a particular record in a table, including the fields to bring together. We use single or multiple queries to produce or assemble the fields for the model using systems like dbt or another data pipeline tool.

Whether you use dbt or another solution, the goal is to take the raw material (transactional data, attributes in tables, time-series data) and assemble it into a model that standardizes the representation of information about that thing.

An account model might tell you basic information like the name of a company and its canonical ID value. It might also show you the number of logins in the last 48 hours or the status of that company so that you can make business decisions on that information without having to run multiple other queries.

Sourcing and Sending Information

What about the raw material that we need to populate our data warehouse? It’s going to come from sources – the ETL (extract, transform, and load) process starts with copying data from business systems like Salesforce, Zendesk, and other line-of-business systems.

You’ll also want to send important events and transformed data to some of these same systems, for example when you have workflow in your marketing automation or CRM tools that depend upon changes in operational data.

When customers upgrade their service, they may move into a different marketing or sales segment, so your customer data platform or your CRM needs to receive this broadcast. We commonly call this feature “Reverse ETL” because it takes data from the warehouse and sends it to the systems that need to know that information.

Keep in mind that the reverse ETL process also serves as an eventing loop, sending messages to collaboration systems like Slack or email or also kicking off the workflow glue we described earlier.

Asking and answering business questions

Now that you have a modeled set of data in your database and know that it’s getting updated on a schedule and at important events, it’s time to visualize that data to enable other teams in your business.

Start by making a list of key business metrics - these could be:

a customer count
the number of daily sales-qualified leads
this month’s sales numbers

If you’re not sure where to start, here are some examples .

The goal here is to build dashboards in a tool like Sigma to provide daily value, be updated on a schedule, and highlight significant events like a customer addition or a customer churn. If you’re tracking when leads fail to become qualified, then you can analyze those cohorts and find out why.

How do all of these pieces work together?

Before a Data Operations system is in place, you will definitely find some of the data that you need in some of the systems. Other systems will also be immediately stale and make it hard for you to enable team members outside of the team members in their own operational systems.

After a Data Operations strategy is in place, imagine this scenario:

A person signs up for a demo and a Slack message is sent to a team for processing
If no action has been taken by the SLA (service level agreement) time, the message to the channel is updated
When that person eventually decides to purchase the software, there is already a calculation to compare them to other purchasers (how long did it take for them to buy)
The dashboard of key metrics is updated instantly (or for other items, on a regular schedule)
When the customer updates their phone number or other information, it’s sent to the operational systems to be updated, according to the information hierarchy you describe
It’s now possible to use automated segments or messaging to provide a more personalized experience to the customer based on what happened

The beauty of this process is that every operational system now has the potential to get updates on what’s happening to the customer. And that’s the big picture: engaging with customers works much better when there is an updated customer record showing what’s going on. Data Operations helps make that happen.

What’s the takeaway??Building a Data Operations practice involves tools to move information from operational sources through a data warehouse and out to destinations, but the real benefit of this work is to broadcast what’s going on with the customer. By focusing on the customer, we make it easier for teams to respond accurately, effectively, and quickly. And for the business, we’re enabling the ability to pose and answer important questions using data.

Data Operations

1,030 位关注者

要查看或添加评论，请登录

Greg Meyer的更多文章

Redefining the Customer Journey

2024年11月7日

Redefining the Customer Journey

Have you ever played RevOps detective? ??? The story goes something like this. There’s a closed-won (or a closed-loss)…
An ode to console.log()

2024年9月17日

An ode to console.log()

Some of the first programs I ever wrote on a computer used PRINT to echo a line to the screen. Using BASIC, I filled…

1 条评论
Great performance demands mental preparation

2024年8月22日

Great performance demands mental preparation

The coach will see you now When I was younger I wanted to be a professional baseball player. Professional baseball…

2 条评论
Data Operations, revisited

2024年8月1日

Data Operations, revisited

When I started writing about data operations In 2020 I suggested an example definition that focused on data shared…
From Atoms to Bits: Building Software from Cow Paths

2024年7月16日

From Atoms to Bits: Building Software from Cow Paths

It’s not easy to be a technologist these days. For almost any problem you can think of, there is a solution claiming to…
Am I typing to a person or a bot?

2024年6月27日

Am I typing to a person or a bot?

Dear Bot, may I speak to your manager? It seems like almost every company with a large volume of customer requests is…

3 条评论
10 common ways your revops data enrichment might be failing

2024年6月6日

10 common ways your revops data enrichment might be failing

Picture this: you have a million contact records to fix and need to find a title match based on email and determine the…

4 条评论
Creating a "Minimum Viable Record"

2024年5月21日

Creating a "Minimum Viable Record"

What kind of data belongs in a “Minimum Viable Record”? There’s a lot of pressure on sellers today to hit activity…
What do you need to build a good no-code application

2024年5月7日

What do you need to build a good no-code application

There’s a lot of promise in a no-code environment. No-code development lets you abstract the building blocks of an…

3 条评论
On building an Analytics Maturity Model

2024年4月18日

On building an Analytics Maturity Model

Analytics is a hot topic at many companies, and it’s often hard to clarify what people mean when they talk about this…

See all articles

Eventing and Hooks and Workflow, oh my!

A place to store that information

Transforming Data into Models

Sourcing and Sending Information

Asking and answering business questions

How do all of these pieces work together?

Data Operations

1,030 位关注者

Greg Meyer的更多文章

Redefining the Customer Journey

An ode to console.log()

Great performance demands mental preparation

Data Operations, revisited

From Atoms to Bits: Building Software from Cow Paths

Am I typing to a person or a bot?

10 common ways your revops data enrichment might be failing

Creating a "Minimum Viable Record"

What do you need to build a good no-code application

On building an Analytics Maturity Model