DITCH THE JUNK FOOD DATA DIET

DITCH THE JUNK FOOD DATA DIET

Many organisations are at the point that they want to do a lot more with the data they generate, or that they can acquire from customers and suppliers. They might want to invest in industry ecosystems for mutual benefit, standardising and sharing with peers and partners. They may be looking to introduce automation with RPA tools, exploiting the possibilities of scale and efficiency that can be realised. Or they could be thinking of how they can train AI models to accelerate their growth or reduce their cost base. These are just some of the technologies that are tempting business leaders to invest right now.

Whilst the returns on investments in these areas can be phenomenal, the reality is that they require time, effort and crucially “good” data to work effectively. So, what is good data? The traditional view is that it has five key elements – accuracy, completeness, relevance, reliability, and timeliness. This is a very good list, but it is the relevance trait that I am considering today.

I am lucky enough to have worked in technology for quite a long time. Those who remember the days of big ERP, systems developed slowly, as monoliths, from enormous tomes detailing requirements and specifications, know why it took so long to change those systems. Data feeds were scarce, and outputs were likewise limited. If data went in, it went in via a keyboard operator or through primitive EDI. Changes to systems required slow, expensive feedback loops between “users” and “IT” and these groups rarely interacted in meaningful ways.

I realise that for some people who might read this, it sounds horrendous – you work in an agile, collaborative way deploying changes monthly, weekly, daily or even more often than that, to complex production systems. For others I am not describing the past, but something recognisable even now. But with agility comes temptation – today’s information technology landscape has the vast proliferation of data to deal with. We can ingest data from unlimited sources, internally generated and externally sourced. Where data processing and storage was once a bottleneck, it is now the least of our concerns. So we can be tempted to gather and keep data that adds to the sum of our knowledge in only the most marginal of ways. We are in danger of living on junk food data.

When you live on junk food data it’s cheap and easy, but we are in danger of piling on excess weight and taking what is convenient rather than what is most suitable, and sustainable. When we come to test ourselves with new applications that require quality data in order to produce good results, we can find that we are not in a fit state to work with them. 

At IDC we are seeing more use of data catalogues for data quality, governance and self-service access. Where these exist and are combined with thoughtful KPIs for things like the number of data terms, definitions and growth it is possible to provide a high-quality service to the organisation as well as being more properly informed of the sustainability of your estate – if you like, the nutritional value of your data. Aligned with this approach is a continuing effort to improve data literacy within the organisation – these go hand in hand, as you improve your data intelligence through better management you will improve data literacy, and vice-versa better data literacy contributes to more thoughtful data management.

Getting ready for the challenge of automation, artificial intelligence and advanced analytics and predictive tools means choosing your data sources carefully, trimming the fat where it exists, and maintaining as lean a data profile as is sensible. It might seem like a hard challenge now, but you will be thankful for it in future.

----------------------------------------------------------

I'm Chris Weston, and I'm part of the CIO Advisory team at IDC. We work with our clients across Europe to help them make the best decisions around their technology selection, strategy and delivery. Please contact me if you would like to join our invitation-only CIO peer connection network.



Freddie Quek

Digital & Technology Leader | Board Member | Researcher | Advisor | Mentor

4 年

Thanks Chris Weston. Important to be able to identify what good questions you want to ask, instead of thinking if you throw everything together somehow you will gain insights. It is like finding a needle in a dump/haystack, except that you don't even know that you are looking for one!

David J. Harding

Transformation Advisor | Cxx Effectiveness Expert

4 年

Interesting post Chris Weston , reminds me of a question I onced asked a client who had sunk £m into a #datalake project..when told about the amount of data that would be captured, I asked ‘why?’ ..to be told that it would enable them to develop ‘insights’..still a bit lost I asked ‘on what?’...I didn’t get a reply..my point? A more logical approach might have been to develop the questions (or hypotheses) that need answering BEFORE investing in data storage...that might help focus effective data investment decisions and reduce the chances of storing junk data..?

要查看或添加评论,请登录

Chris Weston的更多文章

  • Six Key Considerations around Agentic AI

    Six Key Considerations around Agentic AI

    Since ChatGPT was released to the world a little over two years ago, it's changed our perceptions of how our technology…

    11 条评论
  • Can Generative AI replace coders?

    Can Generative AI replace coders?

    "GenAI can write code, so we won't need developers any more". There have been many excitable takes on the potential of…

    1 条评论
  • In the shadow of AI, Quantum Computing is quietly progressing

    In the shadow of AI, Quantum Computing is quietly progressing

    Back in 2017, I wrote a LinkedIn article attempting to explain Quantum Computing for the general businessperson. At the…

    7 条评论
  • What Mark Twain has to teach us about AI

    What Mark Twain has to teach us about AI

    This morning I was listening to the excellent Mystery AI Hype Theater 3000 podcast, in which Emily Bender and Alex…

    3 条评论
  • Sustainable IT, one project at a time.

    Sustainable IT, one project at a time.

    Today, 22nd April 2024, is Earth Day. It’s 54 years since the first Earth Day, launched in the USA before that country…

    5 条评论
  • Sustainability today, not tomorrow.

    Sustainability today, not tomorrow.

    As someone who has been involved in carbon reduction and sustainability efforts throughout my career, it was a joy to…

    3 条评论
  • London Tech Week - Worth the visit?

    London Tech Week - Worth the visit?

    Earlier this month, Jumar participated in London Tech Week through our industry association techUK, which is a…

  • 3 reasons you might still have Azure Classic VMs in your estate

    3 reasons you might still have Azure Classic VMs in your estate

    Back in 2020, Microsoft announced that their classic Virtual Machine Service Manager would be switched off in September…

    1 条评论
  • Why 2023 will be the year of Tactical Automation in business

    Why 2023 will be the year of Tactical Automation in business

    Automation tools have seen a great deal of investment in the past few years with the bigger vendors leading the charge…

    4 条评论
  • Wow. Jumar and me, one month in.

    Wow. Jumar and me, one month in.

    Remarkably, it’s been a month since I joined Jumar as Chief Digital and Information Officer. Time goes very quickly…

    3 条评论

社区洞察

其他会员也浏览了