In search of data quality

In search of data quality

Everybody wants data quality, but experience tells us that data quality can be difficult to define and even more difficult to achieve.

In the mid-1990s I joined the newly-formed HESA in a role focussed on improving data quality. HESA had just completed its first annual data collections and despite the triumph of standing up a new organisation, data specification and collection system in less than two years, the Statutory Customers – on whose patronage HESA depended – were not happy with the data quality.

The journey to data quality is neither quick nor simple. The first stage is all about listening and understanding to establish exactly what stakeholders needed from the data and therefore what quality means to them. This stage can really surface the extent to which data quality is a slippery concept – difficult to define and often varying significantly between different types of stakeholder.

The second stage is to think about how quality can be assessed – or perhaps how poor quality can be identified and trapped. At HESA we built a multi-layered approach because data can fail in many different ways. The tests that we apply to data need to address these different failure modes and building the machinery and processes to undertake these tests can be an epic architectural endeavour.

The final stage is about bringing this whole thing to life. The world evolves; our data needs evolve and so the definition and assessment of quality needs to evolve. This needs to take place within the machine – learning from previous failures – and by looking outwards to the world that the data describes and the uses to which the data are put.

Data quality is not an absolute, nor is it a destination; it is an on-going process that continually reinvents itself in order to achieve and maintain relevance in this ever-changing world.

Sometimes data quality is an issue of hard logic...and other times, one instinctively knows when something is right...


Gideon White

Strategic planning | Operations excellence | Programme assurance | Continuous improvement

10 个月

Good ol' Jeeves! Gosh that takes me back a bit!!

Richard Harrison

Higher education strategy, policy & practice

10 个月

And that’s not only true for data quality; lots of parallels with academic quality.

要查看或添加评论,请登录

Andy Youell的更多文章

  • Why are data standards so hard?

    Why are data standards so hard?

    If you want to smooth the flow of data between systems, reduce cost and complexity, get better information out….and a…

    12 条评论
  • High performing teams...

    High performing teams...

    A lot is written about high-performing teams; the magic recipe; the critical factors; the combination of skills and…

    6 条评论
  • Moving on

    Moving on

    I’ve never been convinced about the idea of having a detailed career plan. Opportunities come and go; you weigh them…

    17 条评论
  • What is a course?

    What is a course?

    In 2011 the HE sector was gearing up for the implementation of the latest big idea in the better-informed-students…

    23 条评论
  • When you think you've got the lot...

    When you think you've got the lot...

    The UK higher education sector is full of surprises. This is my 33rd year in HE and I think I finally know it.

    9 条评论
  • Three big problems with Digital Transformation Programmes

    Three big problems with Digital Transformation Programmes

    As aspirations around the use of technology continue to grow, many organisations launch themselves into a Digital…

    13 条评论
  • On Jürgen Klopp and the importance of leaving well

    On Jürgen Klopp and the importance of leaving well

    I’m not sure what it was that first latched me onto Liverpool FC but ever since the days of Kevin Keegan and Emlyn…

    8 条评论
  • HE data in 2023

    HE data in 2023

    Those awfully nice people at Wonkhe have asked me to deliver the 2023 State of the Data Nation at the Festival of…

    3 条评论
  • What drives data burden in HE?

    What drives data burden in HE?

    I've been thinking a lot about data burden recently. It's been an issue in HE for decades and I think it is increasing.

    4 条评论
  • Tomorrow is a blank page

    Tomorrow is a blank page

    They say that life is like a novel; every day is a blank page to fill as best we can. As pages become chapters the plot…

    49 条评论

社区洞察

其他会员也浏览了