Am I a Data Analyst or a Data Scientist?
Image source: https://bit.ly/3Pnd1kZ

Am I a Data Analyst or a Data Scientist?

I should know this one, cold. What’s the difference between data analytics and data science? An acquaintance asked me that basic question recently and my response was half hazard and poorly articulated. I didn’t feel great about that. However, spending an hour googling for the distinction helped me realize I don’t have a good answer, in part, because most people don’t have a good answer. Blog A has a different definition than Blog B; Northeastern University has a different one than UVA; and Wikipedia has a different description from all of the above. Similarly, when I asked my LinkedIn community to differentiate this transportation forecasting problem, the majority was not overwhelming.

No alt text provided for this image

So, what is the difference between Data Analytics (Data Analyst, DA) & Data Science (Data Science, DS)? On one hand, you can say that it doesn’t matter and that they both solve problems using data. On the other hand, it matters tremendously if you are trying to figure out organizational design and, more importantly, how to recruit the right talent.

One thing is pretty clear; these groups are tasked with finding patterns in data. While they use a lot of the same tools and techniques (Python, R, regression, classification, clustering, dimensionality reduction, statistics, data visualization, etc.), they do it for different reasons and in different ways. Here are a few initial distinctions that come to mind:

  • DA’s answer questions by finding patterns in data. DS’s generate questions by finding patterns in data.?
  • DA’s thrive with speed. DS’s thrive with exploration.
  • DA’s make you more certain. DS’s make you less certain.
  • DA’s toolbox has breadth. DS’s toolbox has depth.
  • DA’s sit close to the decision makers. DS’s sit close to the DA’s.
  • DA’s describe while DS’s predict. DA’s & DS’s work together to prescribe.

All of these distinctions probably take a very different form depending upon A) the size of the company B) the industry and C) the time frame.

  • Size of company. Relatively larger companies probably separate these groups in an effort to realize economies of scale, but unfortunately this almost certainly increases coordination costs and reduces their effectiveness to prescribe solutions.
  • Industry. Digital companies have relatively more & cleaner data. This means the time needed to put data together is relatively less and they can probably spend more time on exploration & algorithms.
  • Time frame. Less mature data teams, I think, will have significantly less distinction between the analysts and scientists. Longer standing data organizations that have solved for rising coordination costs can have more distinction but maintain a healthy level in effectiveness to prescribe solutions.

What about the pay differential? On payscale.com, the average salaries reported show a >$30k pay difference ($62k versus $97k). On the surface, that’s big enough for me to accept that there is a gap (despite uncertainty in the figures). My experience tells me that the differential of this size is probably unwarranted, but exists because hiring teams make salary decisions about individuals rather than teams. If your goal is to find patterns in data and make decisions with that information, it feels like a great data scientist needs a great data analyst and vice versa.?

All of these features make classifying an individual as a data analyst or data scientist a rather complex, but not intractable, task. Similarly, it makes defining the difference between Data Analytics and Data Science harder than you might think on the surface. I’m feeling better about my response to “what is the difference between Data Analytics and Data Science?” At the same time, I think, write, and publish this publicly so I can upgrade my perspective. Where would you put suggestions if you were editing this article?

Adrian Lindsey

Data Science Professional with a decade of experience building creative Statistical and Machine Learning solutions to address business problems.

2 年

Both titles house such widely varied groups. I’ve known both DAs and DSs who were doing the “quick” exploration and modeling work needed by business stakeholders, and I’ve known others who sat on engineering and IT teams creating and deploying optimization and predictive models. I haven’t known any DSs who worked almost exclusively in excel building reports and answering ad hoc data requests which tends to be the majority of DA job descriptions I’ve read. While there’s a dearth for DAs curating ML algorithms or recommendation systems, I still know of some. This makes me wonder if DS isn’t a subset of DA and not the other way around…they certainly aren’t mutually exclusive. I tend to think of DS roles as more technically and academically demanding, but I recognize this isn’t always the case. However, this is partially why I market myself as a DS despite having only worked as a DA.

Chad Meley

Executive & Professor | Harnessing AI to Fuel Growth and Innovation

2 年

I like point you make around, “DA’s thrive with speed while DS’s thrive with exploration.” In my experience, Data Scientists are tasked with working upstream in the data pipeline, whereas Data Analysts tap into data sets that are curated or semi curated. Data Scientists are coders, be it for data wrangling or analytics using Python, TensorFlow, etc. That skill would explain part of the compensation variance.

回复
Jack Cox

Sr. Vice President @ FBN |Supply Chain, Logistics, Customer Experience | Amazon, Wayfair, Target, Silicon Valley Tech Startup

2 年

I think one of the biggest drivers in payscale is the evolution of what an analysis means, especially among legacy "analog" companies. For many analysts are leveraged as entry level jobs on a career path towards other business functions. You describe the tool kit of a DA and DS being similar but in these cases analyst jobs likely don't have a highly technical tool kit beyond Excel.

要查看或添加评论,请登录

Frank Corrigan的更多文章

  • A Better Model? Or a Smarter Approach to Problem Solving?

    A Better Model? Or a Smarter Approach to Problem Solving?

    I often get lost when reading the mathematical formulations in research papers. Sometimes I quit on the spot entirely.

    4 条评论
  • How to think about ‘prolific’

    How to think about ‘prolific’

    Prolific, by definition, is about producing much or being present in large numbers. When used as an adjective to…

    2 条评论
  • Modern NLP might give us new teamwork metrics

    Modern NLP might give us new teamwork metrics

    In basketball, an assist is attributed to a player who passes the ball to a teammate in a way that leads directly to a…

    1 条评论
  • Accelerating Through the Funnel

    Accelerating Through the Funnel

    Who is the “most successful?” This is a hypothesis; it’s the people that are able to accelerate through the funnel the…

    2 条评论
  • Convincing Your Colleagues: How Data Professionals Can Make a Powerful Impact

    Convincing Your Colleagues: How Data Professionals Can Make a Powerful Impact

    As a data professional (data analyst, data scientist, data engineer), who is responsible for convincing your peers and…

    3 条评论
  • In order to be a great data science leader, I need to be a great data scientist.

    In order to be a great data science leader, I need to be a great data scientist.

    In order to be a great data science leader, I need to be a great data scientist. To whatever degree this is true, I…

    5 条评论
  • Coincidence of Attention

    Coincidence of Attention

    Sometimes, people praise the barter system; a system of exchange in which participants in a transaction directly…

    2 条评论
  • This is a Straw Man

    This is a Straw Man

    The straw man mental model is one I’ve always struggled to grasp. When you Google search, there are two definitions…

    2 条评论
  • Are you an interesting person?

    Are you an interesting person?

    “I’ll probably lose another million dollars or $10 million or something. But directionally, I think that sort of thing…

    2 条评论
  • We don't have context for asking better questions. And therefore we don't.

    We don't have context for asking better questions. And therefore we don't.

    David Turetsky, during his talk at WorldAtWork 2022, said “We don't have context for asking better questions. And…

    2 条评论

社区洞察

其他会员也浏览了