Why So Many ‘Fake’ Data Scientists?

Why So Many ‘Fake’ Data Scientists?

Have you noticed how many people are suddenly calling themselves data scientists? Your neighbour, that gal you met at a cocktail party — even your accountant has had his business cards changed!

There are so many people out there that suddenly call themselves ‘data scientists’ because it is the latest fad. The Harvard Business Review even called it the sexiest job of the 21st century! But in fact, many calling themselves data scientists are lacking the full skill set I would expect were I in charge of hiring a data scientist. 

What I see is many business analysts that haven’t even got any understanding of big data technology or programming languages call themselves data scientists. Then there are programmers from the IT function who understand programming but lack the business skills, analytics skills or creativity needed to be a true data scientist.

Part of the problem here is simple supply and demand economics: There simply aren’t enough true data scientists out there to fill the need, and so less qualified (or not qualified at all!) candidates make it into the ranks.

Second is that the role of a data scientist is often ill-defined within the field and even within a single company.  People throw the term around to mean everything from a data engineer (the person responsible for creating the software “plumbing” that collects and stores the data) to statisticians who merely crunch the numbers.

A true data scientist is so much more. In my experience, a data scientist is:

  • multidisciplinary. I have seen many companies try to narrow their recruiting by searching for only candidates who have a Phd in mathematics, but in truth, a good data scientist could come from a variety of backgrounds — and may not necessarily have an advanced degree in any of them.
  • business savvy.  If a candidate does not have much business experience, the company must compensate by pairing him or her with someone who does.
  • analytical. A good data scientist must be naturally analytical and have a strong ability to spot patterns.
  • good at visual communications. Anyone can make a chart or graph; it takes someone who understands visual communications to create a representation of data that tells the story the audience needs to hear.
  • versed in computer science. Professionals who are familiar with Hadoop, Java, Python, etc. are in high demand. If your candidate is not expert in these tools, he or she should be paired with a data engineer who is.
  • creative. Creativity is vital for a data scientist, who needs to be able to look beyond a particular set of numbers, beyond even the company’s data sets to discover answers to questions — and perhaps even pose new questions.
  • able to add significant value to data. If someone only presents the data, he or she is a statistician, not a data scientist. Data scientists offer great additional value over data through insights and analysis.
  • a storyteller. In the end, data is useless without context. It is the data scientist’s job to provide that context, to tell a story with the data that provides value to the company.

If you can find a candidate with all of these traits — or most of them with the ability and desire to grow — then you’ve found someone who can deliver incredible value to your company, your systems, and your field.

But skimp on any of these traits, and you run the risk of hiring an imposter, someone just hoping to ride the data sciences bubble until it bursts.

What would you add to this list? I’d love to hear your thoughts in the comments below.

Thank you for reading my post. Here at LinkedIn and at Forbes I regularly write about management, technology and the mega-trend that is Big Data. If you would like to read my regular posts then please click 'Follow' and feel free to also connect via TwitterFacebook and The Advanced Performance Institute.

Here are some other posts from my Data Science Central column:

About : Bernard Marr is a globally recognized expert in big data, analytics and enterprise performance. He helps companies improve decision-making and performance using data. His new book is Data: Using Smart Big Data, Analytics and Metrics To Make Better Decisions and Improve PerformanceYou can read a free sample chapter here.

 

Dung Huan

Non-Intrusive Knowledge

4 年

The first link "What is big data" does not work. Try this one: https://www.datasciencecentral.com/profiles/blogs/what-is-big-data-infographics-by-bernard-marr

回复
Nicolas Gakrelidz

Make an Impact with our customers to get value from Data and AI ! Authorized to deliver the BPIFrance Diag Data and AI

8 年

As people and especially managers are confused about datascience there is a large part of jobdesc and people with a "datascientist" position. This is exactly the same with the BigData term. The most important is define exact definition and especially during events where you can find managers. There is exactly 4 profiles in Analytics (no order of value and no opposition = just complementarity !!!) : datascientists, dataminers, statisticians, data analyst (or business analyst).

回复
Cesar Pati?o ???? ????

Data & Analytics Executive, helping companies and people to cross the chasm of Digital Transformation and Innovation

9 年

Bernard Marr great article and I totally agree with you. Actually, a few weeks ago, I did some research on Data Scientists on Linkedin and I realized several ETL programmers suddenly became Data Scientists. Sometimes you’ve used the expression "pairing him or her with someone" and in my point of view, the real value of using data and analytics can only be achieved through a team effort as if it were an orchestra, because it’s impossible for a single person understanding all tools and techniques (Hadoop framework, ETL, statistical, visualization, business, creativity) just as it is impossible a single musician playing well all the instruments. So, the “Data Scientist” should have enough business knowledge and technical experience, in order to act as the Maestro, properly defining the problem, orchestrating the various experts and analyzing the effectiveness of the final solution and the business results they achieve.

Pranav Verma

Co-Founder @ Acuro AI | Founder & CEO @ Busigence

9 年

Very true.

回复
Tom Maiaroto

CTO at Clearstory

9 年

Wouldn't "DevOps" still be #1 there then?

回复

要查看或添加评论,请登录

社区洞察

其他会员也浏览了