The Future of the Data Scientist's Role

The Future of the Data Scientist's Role

Data scientists are the new "hotness"; everybody wants to be one and every company is trying to hire one. The barrier to entry is big, but once you are "in", you have joined an elite group of professionals that are in high demand. But will there be a time when data science is so easy and common that everyone is doing it? Will it go through the same transformation that web design went through, going from a highly skilled, in-demand job to something that your grandma did last week? Or will it separate itself by employing the scientific method as a foundation?

First, data science has been around for decades and some of the hottest topics like deep learning have been around 20+ years in papers and research. That said, the boom in data science and the speed at which new tools have been released is enough to take a step back and evaluate the field. So what is a data scientist? Is it that bearded guy with a PhD from CMU that talks about algorithms and draws formulas on whiteboards like an artist? Is it the woman in the office that talks about technologies that you've never heard before like Tensorflow, CNTK, scipy, numpy, SVMs, etc., etc.?

I say that a data scientist is someone that knows they can solve certain problems with data, knows 80% of the math and 80% of the coding to solve those problems, knows the various tools that are available, and how to use some of those tools to solve those problems. Why so vague of a definition? I could take any decent software developer, show them how to use Tensorflow APIs, give them a curated dataset, and they could get results. That person is not a data scientist, even though they could do that over and over again. I consider that person a data engineer, someone that takes the work of a data scientist and makes it reusable, production ready. Could the job of a data engineer disappear in the future? Possibly! As new tools, faster & cheaper resources are made available in the cloud, I can definitely see a future where data engineering is automated.

So what about the data scientist, will we automate that job with new tools? Make no mistake new tools are coming out regularly that make the task of doing data science easier and faster. Nobody writes the code to do a linear regression from scratch, they use a prebuilt function in Python, R, or even Excel. Even though it is easier and faster to perform a linear regression, that doesn't mean it is completely automated. A human still has to understand the data, understand that a linear regression is the correct model to build, and understand the results. And in most cases, a lot of data cleansing and feature engineering has to be performed before they can actually get meaningful results. Can those tasks be automated or made so simple that an untrained human could do them? Possibly! Teams are working on AI right now to automate data cleansing, fundamentally neural networks and deep learning automate feature engineering, and there are tools available to help me figure out what and if I chose the correct model.

There is no doubt that we are on the precipice of a revolution in machine learning. Tools have made it easier for data analyst to "do data science" and more are being created every day. I have no doubt that my kids kids will learn data science in middle school. And while tools will make the job faster and building models easier, the human element will never go away. The barrier to entry will become smaller and more approachable, it will still require a scientific mind to be a data scientist. And that is what differentiates data science from other "computer jobs", where those jobs are creative in nature like web design or building apps and we can create tools to make the creative process easier, data science has more in common with physics or chemistry than with software development.

Now to the original question, will there be a time when data science is so easy and common that everyone is doing it? No. There will be more data scientists, performing various degrees of work, but they will remain in demand because of their scientific approach to data. Some of the work that data scientists do now will be done by data analysts because it will be easier and more approachable. However you will still have the demand for that scientific mind in every company.

Agree/disagree, please comment.

Fabrizio Spademan

Change and Transformation Leader

7 年

An interesting take. I suspect (and quietly hope) that interpreting the results will remain beyond the reach of computers for at least a little bit longer. Big Data will certainly become more commonplace in primary education, and I hope that along with it we are able to maintain the 'human' element that makes interpreting data such an art.

Steven Alvarez

Business Administrator

7 年

Nice post, it increases my interest in data science and analysis :)

Tammie Rimon (Smart)

Mortgage Broker | Home Loan Broker | Commercial Loans | Business Loans | Car Finance | Equipment Finance

7 年

Nice insight, thanks for sharing.

Frank Amato

Head of Data Science @ Federated Hermes

7 年

Well said Phil. Could not agree more.

要查看或添加评论,请登录

Philip Coachman的更多文章

  • Lessons from a Year of Running

    Lessons from a Year of Running

    Last December I set a goal for 2024, to run one race per month in order to stay motivated to run and focus on being…

    9 条评论
  • From Computer Repairman to Data Scientist

    From Computer Repairman to Data Scientist

    Having just turned 40 and mentoring several young professionals, I started to reflect on my career. Having a…

    8 条评论
  • Why We Need MLOps

    Why We Need MLOps

    I'm going to start by saying I hate the term MLOps. Applying DevOps to a new practice area doesn't require renaming the…

    2 条评论
  • Change The World Mentality

    Change The World Mentality

    So I've been thinking about this and going through iterations for several years. I wanted to make sure if I wrote this…

    5 条评论
  • Multi-Data Lake Architecture

    Multi-Data Lake Architecture

    Based on recent client conversations, I've been forced to think about multi-data lake architectures and how they work…

    1 条评论
  • Contract Your Way to a Better Career

    Contract Your Way to a Better Career

    I'm going to start by saying that I don't work for a contracting company, I'm not a recruiter, nor have I ever been. I…

    2 条评论
  • What Does a Data Scientist Do

    What Does a Data Scientist Do

    I've recently given some talks on a data scientists role in an organization. While every company is different, I think…

  • Building A Data Driven Culture

    Building A Data Driven Culture

    I've had the unique opportunity to work for a very large corporation while it built a data driven culture. I was there…

    1 条评论

社区洞察

其他会员也浏览了