Expect More Sparse Data

One of the arguments for NoSQL Databases, along their ability to handle Big Data, is their ability to handle sparse data. Sparse data is data that if loaded into a relational DB will contain too many nulls – for any record, most of the columns will have no values. In a situation like that, representing data as a set of ML/JSON documents makes more sense than relational representation.

Data are naturally sparse. Say, I have a Strava profile (that says that I’ve ridden 0km this month). A boxer would not have Strava data but will have a boxing record. Other people will have something else. Most of the information associated with us is naturally sparse.

We reason we haven’t been talking about sparse data for that last three decades is because we decided to ignore it. We only use information we really, obviously need – and we create additional tables to unload information that is uncommon.

However the new era of Big Data clearly taught us the value of knowing more. Expect public and private enterprises to try to source more data from huge variety of sources. More information than the number of tables we are prepared to create and maintain. Therefor we will want to work with sparse data in a document-oriented DB.

Was originally published on https://www.businessabstraction.com/2014/07/expect-more-sparse-data/

Kostya Burlutsky

Principal Consultant Power BI

10 年

Hence there is a whole range of techniques of filling empty data with "what this data should have been".

回复

要查看或添加评论,请登录

Alex Jouravlev的更多文章

  • You have your Business Architecture. Do you use it?

    You have your Business Architecture. Do you use it?

    As a Consultant, I often ask prospective clients about Business Architecture. The usual answer is that someone already…

  • The True Face of Level 4 Process Mapping

    The True Face of Level 4 Process Mapping

    We need to have a serious conversation about Process Centricity vs Data Centricity in the face of Digital…

  • Agile, Simplified

    Agile, Simplified

    It doesn’t look like there is a good working definition of what constitutes Agile. The Agile Manifesto is supposed to…

    6 条评论
  • As-is Modelling, the Sweet Wasteland of Enterprise Architecture

    As-is Modelling, the Sweet Wasteland of Enterprise Architecture

    Enterprise Architecture is under attack. On one side, the Service Design people are “planning and organizing people…

    81 条评论
  • Agile Expectations Board

    Agile Expectations Board

    An Agile Expectations Board seeks to prevent an Agile project from successfully delivering Iterations on the way to…

  • Understanding Semantic and Property Graphs

    Understanding Semantic and Property Graphs

    Executive Summary As enterprises increasingly adopt Graph Databases, to better reflect the nature of the data, or as…

  • The Cost of the Right to be Different

    The Cost of the Right to be Different

    It is a high season for IT contracts here in Canberra, so the “Let the Hundred Flowers Bloom” anti-pattern is in full…

  • COTS or CRHMS? Understanding Full Stack of a Core Enterprise Software.

    COTS or CRHMS? Understanding Full Stack of a Core Enterprise Software.

    Commercial Off-the-shelf Software, or COTS, seems as, although expensive, a way to avoid risks and challenges…

    1 条评论
  • Some Inconvenient Thoughts about Architecture

    Some Inconvenient Thoughts about Architecture

    Enterprise Architecture should include Diagrams understood by the highest executive level to be useful. If you don’t…

  • Enterprise is the Data: Are Processes Overrated?

    Enterprise is the Data: Are Processes Overrated?

    The first thing I noticed when started to transition some of my clients from UML to OWL Ontology Modelling was that…

    6 条评论