The ExcElephant in the Room

The ExcElephant in the Room

First of all apologies for the pun – it’s terrible.

Now it has been well documented in the news how through the (mis)use of Excel in their data processing solution Public Health England (PHE) underreported around 16,000 COVID-19 cases. PHE has taken a bit of a pounding in the press and been ripped to shreds on LinkedIn by data professionals.

No alt text provided for this image

The use of Excel as an enterprise reporting tool or within in a data ecosystem has been the bane of my working career since the day I started my first job back in 2008 and here are some of the reasons why:

  • I have seen lives lost to the upkeep of VBA macros written by someone who has disappeared into the ether without leaving documentation.
  • I have had to manually move multiple excel files to prep for my ETLs because “that’s the way we’ve always done it”.
  • I have seen critical reports with no auditability of changes, no data quality checks and no documentation/meta data of how the data is transformed within the 70-tab Excel rats’ nest.
  • These reports have contained assumptions or hard coded logic from years ago that has led to incorrect information being reported for who knows how long.
  • I have been on the receiving end of having no other choice but to accept csv/xls/xlsx as a data source for parts of my warehouse solutions that change shape more often than my stress ball when reading articles about Excel data clangers.
  • “We don’t need a data warehouse or BI solution, we use Excel” arrgghhhhh

The list could go on and on and I think most data professionals will have their own list, but Excel isn’t the problem here, the problem lies elsewhere:

  • In the lack of data literacy/data awareness of those who rely on Excel for much more than it was ever built for
  • With data professionals who know better but don’t challenge or speak up against its misuse/over reliance
  • With senior leaders who do not listen to those data professionals that do speak up.

Let’s be honest, if you have worked in the data field you will have used Excel for something you ideally wouldn’t have done the term ‘tactical solution’ springs to mind. I am guilty of this too, which is why you won’t catch me joining in with the PHE bashing.

We do this with the intention of putting a fully productionised solution in place once the dust has settled but as we know you don’t always get the time to go and fill the cracks you merely papered over during the delivery of your MVP. 

There are a number of high-profile excel based data ‘mishaps’ such as:

  • In 2003, TransAlta lost $24 million due to Excel copy-and-paste error
  • In 2005, Kodak suffered an $11 million severance error due to Excel typo
  • In 2008, Barclays Capital was forced to spend millions on worthless contracts due to Excel reformatting error
  • In 2010, MI5 bugged the wrong phones due to a spreadsheet formatting error
  • In 2011, AstraZeneca accidentally released confidential information due to an Excel templating error
  • In 2012, JP Morgan was hit with $6 billion trading loss due to Excel copy-and-paste error
  • In 2012, London Olympics were publicly embarrassed by a simple Excel typo
  • With PHE Track and Trace 2020 becoming the latest addition. 

I try and find the positive in these situations and to quote Sun Tzu, “in the midst of chaos, there is also opportunity”. So here is that opportunity.

These events become the aces up our sleeve …

When the time comes to replace that 'tactical solution', are you given the time and resource to do so? When you speak up against any future Excel based shortcuts in your data ecosystem, are you taken seriously? When you see departments and services that are in Excel reporting hell and you speak up and offer a solution that treats the data with the respect it deserves does the conversation go further than “how much will it cost?”

If the answer is no then pull that ace from out of your sleeve and slam it on the table for all to see.

Next time I challenge the business on the overreliance and misuse of Excel then I will be citing all of the above like a lawyer fighting for his client’s life. 

Keep fighting the good fight!

Andrew Mason

Data Leader || Head of Data and Analytics || Data Storytelling and Visualisation Specialist || @thatbidataguy

1 年

Neil McIvor one for you here Neil. ??

Richard Anthony

A senior Data, BI and Analytics professional

4 年

This is a great post, unfortunately as excel is free and at hand to most business users business leaders turn a blind eye untill something goes wrong, and it aways does. Governance and data literacy are much more important that any specific technology to prevent such mishaps.

Good article Andrew and from the replies you can see we all for most part align..I’ve seen 2 extreme examples of excel 1) Company used an old bi tool as a data dump (pseudo etl) running at weekends to dump into a folder massive csvs that were then linked in MS Access and of course subsequent excel pivots and then the weekly global management pack that was issued on the Wed after the 3 days of manipulation - the person who created this wonder was known globally as the goto in-house BI guy and enjoyed the title. Fair enough he did the best he could with excel and was by default Spartacus to the more senior in the global team. A BI Audit was completed, back to SQL, DB and “Governed Data”, after day 1 Audit 4 Million usd discrepancy identified and escalated to global Mgnt. Bye bye Spartacus! Despite the reasonably good job the person had done with what was at hand. = “Governed Data”? 2) Similar Spartacus scenario, but not an excel error, but for the first time in 5 years Spartacus took very sick, unreachable. No one else in the company knew how to follow the well documented process, a small piece of VBA was not documented! Excel Mgnt packs out of service for 2 weeks! = “Governed Data”? Excel = Not Governed Data = Risk ??

Dave Brown

Data Engineer / DBA

4 年

I think you'll find it's an unusual stance to take. I always remember you mentioning the "eradication of excel" comment at an interview a while ago. There are still too many higher level managers, with some level of budget responsibility, that gleefully jump on "use x to save £" and Excel, like the worlds easiest jigsaw puzzle, fits this mantra perfectly. There will be the mirror of this, where the use of Excel has saved companies millions. After 20+ years it's the one puzzle that seems no closer to being solved.

Scott Dawson

Adding value beyond recruitment

4 年

This was a great read! The pun had me shaking my head, though I laughed for way longer than I should have! ??

要查看或添加评论,请登录

Andrew Mason的更多文章

社区洞察

其他会员也浏览了