登录查看更多内容

I am now a 7(8?) months old data engineer part 2

Ivanna Jurkiv Ditlevsen

Senior Data Engineer | Novo Nordisk Engineering

发布日期: 2023年6月28日

+ 关注

I recently began a series of posts on my learnings in a role of a data engineer.

The focus of my first post was on environments (not like ???????? but deployment environments like dev, test, prod).

It's quite a life-changer to go from working in a single environment to suddenly doing development in dev, testing in uat and release production-ready features to production environment ??

One more realization that I had is on the topic of data quality.

??Here is to number 2 of my data engineer learnings.

Data Quality Chase ??

One of my biggest motivators for shifting into data engineering was the impact I would have on the quality of analytics data.

Back in the day when I was a data analyst, I was constantly frustrated by inconsistencies I would find in the tables that I used when building reports, dashboards and when doing analyses??.

Sometimes data was outdated ?? and sometimes it was simply wrong and not consistent with the source system ??.

My thinking was: ??Let me get into data engineering so that I can learn to build robust pipelines and deliver good quality data for data analysts.

To my surprise however, it takes much more than a motivated data engineer, or a team of data engineers for that matter, to have some sort of impact on data quality.

Data is ingested from somewhere (a source system). And so if data is of low quality already there, then data engineers have little to no hope.

Surely one can always build some advanced set of data quality checks and rules to correct bad data upon ingestion into a data lake, or when performing cleaning and transformations.

Although isn't that a bit too much too ask of a data engineer?

(I think it is. But that's a topic for another post)

??What now, you might ask? How do I live knowing that even on a data engineer role I have so little impact on data quality?

Arif Alam 1 年前

Mastering the Path to Become a Data Engineer

Data & Analytics 6 个月前

What are the Fundamentals of Data Engineering?

StrataScratch 3 个月前

I live wonderfully, actually?? and that is because while I cannot magically correct bad data coming from source systems, I can expose that data to analysts and business users.

??This makes data quality problems visible.

Those data analysts consuming data to build their analytics solutions as well as business users consuming those solutions (reports and dashboards) become aware and impacted by poor quality data. It then becomes in their interest to address and fix bad data at the source ??

One could actually say that I am having some sort of indirect and long-term impact on data quality.

??I wonder:

Can anyone relate to my motivation to get into data engineering for data quality?
What about the dilemma of fixing bad data upon ingestion or delivery to data analysts?

??And a random question:

What does the picture in this article symbolize?

??Bogdan Cioat?

Platform Engineer hos INTELLISHORE

1 年

The rust spreads slowly but surely, until it's too late, one can say :) But don't loose hope, there are tools at hand that can help with such issues. Have a look at dbt, I'm sure it will tackle all the concerns and issues you raised in this post. Some good starting (and motivational) points to start with: * https://www.getdbt.com/blog/data-quality-dimensions/ * https://www.getdbt.com/blog/data-quality-framework/ Have fun and the time will fly :)

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

I am now a 7(8?) months old data engineer part 2

Ivanna Jurkiv Ditlevsen

Senior Data Engineer | Novo Nordisk Engineering

Data Quality Chase ??

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

From Love To Hate And Back Again. Who Data Engineers Are And How To Become One

How Can You Start Your Career as an Entry-Level Data Engineer: An Essential Guide?

3 Reasons Why You Need a Data Engineer in Your Team

Essential Data Engineering Skills for : 15+ Must-Have Abilities

Top 10 Data Science Job in 2021

The Role of a Data Engineer in a Software Company

How to scale your business with a data engineer

Data Whisperer Wanted: Mastering the Data Engineer Interview ????

USEFUL RESOURCES FOR ANY DATA ENGINEER

Data Quality Chase ??

领英推荐

Data Reading Club #10

2023年7月14日

Data Reading Club #9

2023年6月21日

I am now a 7 months old data engineer part 1

2023年6月14日

Data Reading Club #8

2023年6月9日

Data Reading Club #7

2023年5月25日

Data Reading Club #6

2023年5月12日

Data Reading Club #5

2023年5月5日

Data Reading Club #4

2023年4月14日

Data Reading Club #3

2023年3月31日

Data Reading Club #2

2023年3月21日

社区洞察

其他会员也浏览了

From Love To Hate And Back Again. Who Data Engineers Are And How To Become One

How Can You Start Your Career as an Entry-Level Data Engineer: An Essential Guide?

3 Reasons Why You Need a Data Engineer in Your Team

Essential Data Engineering Skills for : 15+ Must-Have Abilities

Top 10 Data Science Job in 2021

The Role of a Data Engineer in a Software Company

How to scale your business with a data engineer

Data Whisperer Wanted: Mastering the Data Engineer Interview ????

USEFUL RESOURCES FOR ANY DATA ENGINEER