登录查看更多内容

?? Reading list: The top 5 must-read data blogs from 2022

Prukalpa ?

Co-Founder at Atlan –?Home for Data Teams | Forbes30 & Fortune40 lists | TED Speaker

发布日期: 2022年12月21日

Just like that, we’re at the end of 2022! And what a rollercoaster ride it has been with major changes and uncertainty across every industry. (Especially for the bird app users ??)

A lot happened in the world of the modern data stack this year. We talked about?job titles, thought about saying?goodbye to data science, debated?centralized vs. embedded data teams?and?bundling vs. unbundling, kickstarted important discussions like the?technical pay gap, and so much more.

Continuing our tradition from last year, we’re sharing the top blogs from 2022 along with some follow-up reading to keep you thinking. Happy reading!

P.S. Metadata Weekly is going on holiday next week. We’ll see you back here in 2023!

??? On data as a product

Data product in changing environments: rethinking and updating investments?by Eric Weber

“The last few years have been full of ‘here’s what we need to do next’ or ‘once we have this team, we can do this’. We plan how we’d support more personas and areas of the business with more investment, but we don’t think about what we’d do if we had to cut support. I get it. That doesn’t feel very comfortable. But just like succession planning for people, we need to have a plan for what we’d do in hard situations. In some cases, you might drop support for particular personas on a product. In others, you might drop support for a product altogether. It isn’t easy to say what the ‘right answer’ is. But spending time thinking about your answer is important.”

More follow-up reading:

Making data actionable: the immense challenge of good data products?by Eric Weber
What’s the big deal about data products??by Willem Koenders
Building more effective data teams using the JTBD framework?by Emilie Schario
Types of data products?by Luke Lin

????On working with data

Should we be grateful for the modern data stack??by Benn Stancil

“That’s?the paradox we need to solve. Why has data technology advanced so much further than value a data team provides? Does all of this new tooling actually hurt, by causing us to lose focus on the most important problems (e.g., the data in Salesforce) in favor of the shiny new things that don’t actually matter (e.g., the data in our twenty-fifth SaaS app)? Has the industry’s talent not caught up with the capacity of its tools, and we just need to be patient? Is the problem more?fundamental? I’m not sure. But if our 2032 selves want to be as grateful for 2020s as we should be for the 2010s, those are the next questions we need to answer.”

More follow-up reading:

How to design your data stack for curiosity?by Amit Prakash
Data management is context management?by Randy Au
Build or buy: how we developed a platform for A/B tests?by Olga Berezovsky
Data systems tend towards production?by Ian Macomber
Not all data requests are urgent, so start by asking these 5 questions?by Marie Lefevre

???On data contracts

The rise of data contracts?by Chad Sanderson

“Data Contracts are API-like agreements between Software Engineers who own services and Data Consumers that understand how the business works in order to generate well-modeled, high-quality, trusted, real-time data.

Instead of data teams passively accepting dumps of data from production systems that were never designed for the purpose of analytics or Machine Learning, Data Consumers can design contracts that reflect the semantic nature of the world composed of Entities, events, attributes, and the relationships between each object.

This abstraction allows Software Engineers to decouple their databases/services from analytical and ML-based requirements. Engineers no longer have to worry about causing production-breaking incidents when modifying their databases, and data teams can focus on describing the data they need instead of attempting to stitch the world together retroactively through SQL.”

More follow-up reading:

An engineer's guide to data contracts - pt. 1?by Chad Sanderson and Adrian Kreuziger
An engineer's guide to data contracts - pt. 2?by Chad Sanderson and Adrian Kreuziger
Why data contracts are obviously a good idea?by Yali Sassoon

领英推荐

5 Telltale Signs You Don't Understand Big Data

Bernard Marr 8 年前

Big Data: The All-Important 90/10 Rule

Bernard Marr 9 年前

5 common biases in big data

Naveen Joshi 7 年前

???On building and leading a data team

Growing data teams from reactive to influential?by Emily Thompson

“Data teams tend to be a fairly scrappy bunch, and often default to rolling up their sleeves and building what they need in order to get unblocked. But there is an opportunity here to start influencing roadmaps on?other?teams. Rather than filling in the technology gaps themselves with messy workarounds, my team’s charter also prescribed that they make technical recommendations to the teams we depended on.

Because the data team was now required to proactively drive the conversation, they made the time to work with partners and propose cross-functional solutions. Foundational work was considered part of the backlog of ‘impact-driving’ work, which led to specific quarterly goals, and progress was tracked just as every other initiative owned by the data team.”

More follow-up reading:

Good data citizenship doesn’t work?by Benn Stancil
Managing the first year?by Alex K Gold
How I learned to stop worrying and love being a manager?by Brittany Bennett
Executing a data strategy with OKRs?by Chris Brown
Dealing with difficult stakeholders?by Oscar Baruffa
Leaders show their work?by Ben Balter

BONUS:?We talked with four amazing data leaders —?Stephen Bailey?(Data Engineer at?Whatnot),?Erica Louie?(Head of Data at dbt labs),?Taylor Murphy?(Head of Data at Meltano), and?Gordon Wong?(Founder of?Wong Decision Intelligence; formerly Senior Leader of Business Intelligence at?Hubspot) — about what it takes to succeed in your first 365 days as a data leader.?Download the Secrets of a Modern Data Leader ebook here.

?? On metrics, data catalogs, active metadata, and more

People-first data stacks?by Ilan Man

“The problem is your stakeholders, while giving you the thumbs up the whole time and claiming they’d love an easier way to discover data, are no longer using the tools you’ve painstakingly researched and implemented. They fall into their old habits and inevitably you see an incorrectly defined metric on a Powerpoint slide somewhere.

We need to ensure stakeholders adopt data tools in the ways they should. Reading documentation and taking a training is not enough. We need to reinforce good data-tooling hygiene. I’ve seen many instances of folks starting out in a BI tool, and a few months later they’re back in Excel, pivoting a CSV and pasting it into a presentation. There should always be room for creative solutions and serendipity, but the Data team needs to keep an eye on how stakeholders use the tools they implement. Data models and BI tools need to adapt to business changes.”

More follow-up reading:

Data's trillion dollar question mark?by Benn Stancil
How to measure data quality?by Mikkel Dengs?e
The many layers of data lineage?by Borja Vazquez
The future of data catalogs?by Prukalpa Sankar (aka me!)

??Bonus picks ?

The important purple people outside the data team?by Mikkel Dengs?e
A framework for embedding decision intelligence into your organization?by Erik Balodis
AI is not coming for analyst jobs anytime soon?by Amit Prakash
Manifesto for the data-informed?by Julie Zhuo
Why are we still struggling to answer how many active customers we have??by Seattle Data Guy
Data teams: break out of your bubble?by Mary MacCarthy
The future history of data engineering?by Matt Arderne
Why it matters where you randomize users in A/B experiments?by Adam Stone

Special shoutout to everyone who shared their data experiences, learnings, views, and observations this year! Now’s the time to have more open conversations about what we want for the future of data, and we’re so thankful for all the data practitioners who give their time to share insights, spark debate, and keep our industry moving forward.

???Last week in Atlan: Supercharged automation for your data estate

In last week’s?Atlan Activate, our quarterly product webinar, we launched a ton of new automation features to superpower your data and reduce the manual work that slows data teams down. ICYMI, here are the five new features in Atlan you should know about:

Metadata Playbooks for rule-based actions: Like Zapier for data, this is the first low-code/no-code metadata automation for data teams.
Atlan + AWS EventBridge event-based actions: Create production-grade, event-driven automations for the world of metadata, such as alerts when ownership changes or auto-tagged classifications.
Profiling and Popularity Insights: Use new column-level profiling, popularity, and usage metrics to assess data’s quality, find the most widely used queries, identify top users, and more.
Atlan to GitHub integration: Bring metadata right to GitHub to minimize risk and increase transparency before any changes are made to your data.
Trident AI powered by GPT-3: Say goodbye to manual documentation with increasingly intelligent automated descriptions, business terms, READMEs, and more.

Learn more here???

Have any interesting articles that especially got you thinking? I’d love to read them! Just send them my way on?LinkedIn.

P.S. Liked reading this edition of the newsletter? I would love it if you could take a moment and share it with your friends on social.

Metadata Weekly

9,851 位关注者

Chris P.

Cyber Planner, US Cyber Command

2 年

Many good data lessons

brittany bennett

tech and data person

2 年

thanks for giving me a shout out (:

1 次回应

Oscar Baruffa

Data Professional

2 年

Thanks for including my article in your list, it's got some good company!

1 次回应

查看更多评论

要查看或添加评论，请登录

Prukalpa ?的更多文章

How to craft the ultimate business case for data governance - Part 2

2024年11月1日

How to craft the ultimate business case for data governance - Part 2

As a data leader, you’ve probably faced the challenge of keeping stakeholders on board with a data governance project…

5 条评论
How to craft the ultimate business case for data governance - Part 1

2024年9月12日

How to craft the ultimate business case for data governance - Part 1

Selling data governance can feel like an uphill battle. It’s a big investment that often gets turned down because the…

25 条评论
How companies are making Forrester’s idea of modern data cataloging a reality

2024年8月30日

How companies are making Forrester’s idea of modern data cataloging a reality

The unified control plane in action Last week, I explored a major shift in the data world — a transformation that…

2 条评论
What the recent Forrester Wave means for data catalogs

2024年8月14日

What the recent Forrester Wave means for data catalogs

A massive transformation — data cataloging now includes governance, quality, security, monitoring, and more Quick…

4 条评论
The War of the Catalogs

2024年8月2日

The War of the Catalogs

Databricks Unity Catalog, Snowflake Polaris, and the future of cataloging Apparently this summer is the “War of the…

13 条评论
3-step framework for scaling data quality in the age of generative AI

2024年7月18日

3-step framework for scaling data quality in the age of generative AI

Apply what we've learned from healthcare to data quality I’ve found that data quality isn’t really about cleanliness or…

4 条评论
4 practical lessons from data governance leaders at Dropbox, General Motors, and Patagonia

2024年5月30日

4 practical lessons from data governance leaders at Dropbox, General Motors, and Patagonia

I think anyone working in data today would agree that governance is tough. I talked recently about why it fails and my…

4 条评论
Why data governance fails in today’s AI world

2024年5月13日

Why data governance fails in today’s AI world

Welcome back to this cozy corner of the internet where I share my (meta ??) thoughts on everything metadata. You may…

3 条评论
A Shared Language for Enterprise Data ?

2023年8月4日

A Shared Language for Enterprise Data ?

It’s 1993 and you’ve just graduated from college. You’re going job fair to job fair, looking through alumni…

1 条评论
Modernizing Data Stack ?

2023年6月29日

Modernizing Data Stack ?

Austin Capital Bank, a fast-growing community bank, sought to modernize its data stack to support its evolution into a…

See all articles

?? Reading list: The top 5 must-read data blogs from 2022

Prukalpa ?

Co-Founder at Atlan –?Home for Data Teams | Forbes30 & Fortune40 lists | TED Speaker

??? On data as a product

????On working with data

???On data contracts

领英推荐

???On building and leading a data team

?? On metrics, data catalogs, active metadata, and more

??Bonus picks ?

???Last week in Atlan: Supercharged automation for your data estate

Metadata Weekly

9,851 位关注者

Prukalpa ?的更多文章

社区洞察

其他会员也浏览了

There is no single “single pane of glass” in the data world, WTF happened at Data Council, and more

"In God we trust. All others must bring data"

Data as a product - Data discovery

Data Nugget November 2023

Big Data – An Elephant in the Room?

A Practitioner's Guide to Data Strategy

Data Won. The History of Data Transformation

We’re talking Data. Data, means Data means Nothing... Talk Information

Have You Heard about SMALL Data?

Impact of Big Data: Be Relevant or Be Redundant - Ronald van Loon Interview

??? On data as a product

????On working with data

???On data contracts

领英推荐

???On building and leading a data team

?? On metrics, data catalogs, active metadata, and more

??Bonus picks ?

???Last week in Atlan: Supercharged automation for your data estate

Metadata Weekly

9,851 位关注者

Prukalpa ?的更多文章

How to craft the ultimate business case for data governance - Part 2

How to craft the ultimate business case for data governance - Part 1

How companies are making Forrester’s idea of modern data cataloging a reality

What the recent Forrester Wave means for data catalogs

The War of the Catalogs

3-step framework for scaling data quality in the age of generative AI

4 practical lessons from data governance leaders at Dropbox, General Motors, and Patagonia

Why data governance fails in today’s AI world

A Shared Language for Enterprise Data ?

Modernizing Data Stack ?

社区洞察

其他会员也浏览了

There is no single “single pane of glass” in the data world, WTF happened at Data Council, and more

"In God we trust. All others must bring data"

Data as a product - Data discovery

Data Nugget November 2023

Big Data – An Elephant in the Room?

A Practitioner's Guide to Data Strategy

Data Won. The History of Data Transformation

We’re talking Data. Data, means Data means Nothing... Talk Information

Have You Heard about SMALL Data?

Impact of Big Data: Be Relevant or Be Redundant - Ronald van Loon Interview