Big Data Mythbusters - Part 1

Big Data Mythbusters - Part 1

Sorry guys and girls, I’ve had it. Too many people in this field are confusing things and I need to clarify them for everyone’s sake (including my own). Here’s part 1 of 2 of my Big Data Mythbuster series.

 

Myth 1: “There is Small Data and Big Data”

Not really. There is just data, full stop. The difference lies within the business approach versus the technology approach. If you follow a technology-agnostic business strategy then even the smallest dataset has a “Big” component to it, and therefore merits a closer look in terms of business value. If that business value is substantial then the datasets could and should be expanded to make the mashed-up sample more meaningful.

All of that is data. So no, a seemingly small question (at least at first glance) has a whole load of implications, consequences and considerations, all of which are dependencies, associations and patterns. Even the smallest decision needs a complete dataset to support it. ‘Complete’ may mean a different size, depending on the question, but there is certainly no small, isolated data that exists in a vacuum.

In the business context you have social data, statistical data, more and more historical data, benchmarks, best practices and all the bits and pieces you and your data-universe are creating on a daily basis. Even if you are just a small number of shops selling pizzas, you can still benefit from looking at the bigger picture and drawing informed conclusions based on all the available information.

Myth: Busted

 

Myth 2: “It is easy to find Data Scientists”

Nope, it isn’t. Because you need Coders, who understand maths. You need Mathematicians who understand how to write code. You need business-minded people who understand both, and subject matter experts who understand data, coding, maths and business-minded people who have a grasp of all of the above. Then you can assemble the team which will remotely resemble your idea of a good Data Scientist. Ahh and you need a manager who has done that before, ideally…
These days everyone that has ever used a calculator thinks he is a Data Scientist, but it will be quite some time until these various disciplines come together and create the perfect hybrid – if ever.

One would expect a consultant to state that companies need consultants for that set of tasks and I strive to not disappoint. You need help from companies like ours. Not people flogging software, but actually people that have assembled teams to solve the skills-challenge described above. We’ve been building a few teams like these above and let me tell you: It is quite a journey!

Myth: Busted

 

Myth 3: “Cognitive is needed to do Big Data”

By cognitive I mean artificial intelligence (A.I.) and Machine Learning. These are separate fields that have never actually been applied in a business centric Big Data project as far as we know. There is talk, there is PowerPoint-ware and then there is real life. In real life, over 85% of all analytics challenges are simple statistical computations that can be done on a calculator and don’t even need an algorithm. (Free Buster-Bonus Algorithms are time intensive and difficult to develop – find a list of the common most ones here).

If you want to go all Watson than you need to massage data for a very long time before you can get meaningful insight resulting from cognitive technology. Don’t get me wrong, I love tools such as Watson as they stir the inner-geek in me, but in terms of real-life applications for the average company, it will be a while until technologies converge. Right now, A.I. needs data, but big data does not need A.I.

Myth: Busted

Next week I’ll release another 3 common big data myths to bust. For now, if any of these ring true and you’d like to discuss in more detail just get in touch.

Stay tuned.

Originally posted on www.fintricity.com 

Doctor Sekzi

Sexuality Research Author at DoctorSekzi.com

8 年

You are absolutely right about finding the right team. You also need a person who can translate between sales and marketing, the stakeholders, the actual clients and the coders.

回复
Steve Dumbrell

Domain Architect @ Transport for NSW | Data Architecture

8 年

Well done, succinct and pithy.

回复
Robert Taylor

Regenerative | Circular | Sustainable knowledge management

8 年

Yes. Very often some quite modest data sets and simple statistical analysis can give really good results provided you actually apply them. On the other hand, increasingly complex data analysis could be irrelevant if no decision or action is taken as a result. I've done all my performance improvement work with modest data and analysis - enough to be valid and useful - and always what matters more is the implementation, business change and benefits realisation.

Dave Clark

Re-thinking everything!

8 年

Really enjoyed this Florian thanks for taking the time to do this

Michael Gell

Multi-disciplinary innovation

8 年

An excellent article. There are many examples of words commonly being used to convey an aura of there being something special ... when underneath the verbal posturing there really isn't. Words such as 'big', 'strategic', 'deep' are in that group and usually are best dealt with a delete button. Of course, there are instances where the words do convey something special, something different, something that stands out and has significance. It is refreshing to come across those nowadays-too-rare instances; they speak for themselves.

回复

要查看或添加评论,请登录

Florian Krueger的更多文章

  • How to get Funding - Scale-Up

    How to get Funding - Scale-Up

    If you are in the market for growth capital, then you know that you are not the only one looking. Or if you don’t know…

    1 条评论
  • Where have all the ICO, STO & Advisors gone?

    Where have all the ICO, STO & Advisors gone?

    Why it is time for #1society The world has come to a screeching halt. Nothing is the way it used to be and the only…

  • Blockchain vs. Business?

    Blockchain vs. Business?

    Introduction In a space that makes the Wild West look like a kid’s playground, semantics are extremely important…

  • 3 Ideas - How to avoid costly Blockchain/DLT Experiments

    3 Ideas - How to avoid costly Blockchain/DLT Experiments

    This is more for business folks, but not only. If you are a technology leader you may want to support your business…

    8 条评论
  • (Data) Power to the People!

    (Data) Power to the People!

    “How can we earn more money with data? How can we make more sense of it and gain more insight” These were the questions…

    2 条评论
  • Big Data Mythbusters: Part 2

    Big Data Mythbusters: Part 2

    Our two-part Data series continues with 3 more big data myths. Myth 4: “Big Data is a technology challenge” Rubbish, it…

    1 条评论
  • Human 2 Data & Data 2 Human

    Human 2 Data & Data 2 Human

    The Big Data/Business Intelligence Conundrum needs to be solved before the IoT takes off Intro Chicken and Egg, or is…

    1 条评论
  • 10 Tips on "How to Spot a Capable Manager"

    10 Tips on "How to Spot a Capable Manager"

    Without wanting to brag, I've seen many managers over the past 25 years and there are a few commonalities that stand…

    4 条评论
  • The (un-)negotiable value of IT

    The (un-)negotiable value of IT

    Have a laugh. Have everyone around you define the value of IT in one sentence.

社区洞察