登录查看更多内容

Expect More Sparse Data

Alex Jouravlev

Data and Enterprise Architecture veteran and practitioner with up to date strategic knowlege and hands-on skills in AI. Proponent and enabled of Data-Driven Enterprise. Everything Graph and Metadata

发布日期: 2014年7月17日

One of the arguments for NoSQL Databases, along their ability to handle Big Data, is their ability to handle sparse data. Sparse data is data that if loaded into a relational DB will contain too many nulls – for any record, most of the columns will have no values. In a situation like that, representing data as a set of ML/JSON documents makes more sense than relational representation.

Data are naturally sparse. Say, I have a Strava profile (that says that I’ve ridden 0km this month). A boxer would not have Strava data but will have a boxing record. Other people will have something else. Most of the information associated with us is naturally sparse.

We reason we haven’t been talking about sparse data for that last three decades is because we decided to ignore it. We only use information we really, obviously need – and we create additional tables to unload information that is uncommon.

However the new era of Big Data clearly taught us the value of knowing more. Expect public and private enterprises to try to source more data from huge variety of sources. More information than the number of tables we are prepared to create and maintain. Therefor we will want to work with sparse data in a document-oriented DB.

Was originally published on https://www.businessabstraction.com/2014/07/expect-more-sparse-data/

Kostya Burlutsky

Principal Consultant Power BI

10 年

Hence there is a whole range of techniques of filling empty data with "what this data should have been".

要查看或添加评论，请登录

Alex Jouravlev的更多文章

You have your Business Architecture. Do you use it?

2019年1月6日

You have your Business Architecture. Do you use it?

As a Consultant, I often ask prospective clients about Business Architecture. The usual answer is that someone already…
The True Face of Level 4 Process Mapping

2018年12月27日

The True Face of Level 4 Process Mapping

We need to have a serious conversation about Process Centricity vs Data Centricity in the face of Digital…
Agile, Simplified

2018年12月23日

Agile, Simplified

It doesn’t look like there is a good working definition of what constitutes Agile. The Agile Manifesto is supposed to…

6 条评论
As-is Modelling, the Sweet Wasteland of Enterprise Architecture

2018年12月22日

As-is Modelling, the Sweet Wasteland of Enterprise Architecture

Enterprise Architecture is under attack. On one side, the Service Design people are “planning and organizing people…

81 条评论
Agile Expectations Board

2018年12月10日

Agile Expectations Board

An Agile Expectations Board seeks to prevent an Agile project from successfully delivering Iterations on the way to…
Understanding Semantic and Property Graphs

2018年10月24日

Understanding Semantic and Property Graphs

Executive Summary As enterprises increasingly adopt Graph Databases, to better reflect the nature of the data, or as…
The Cost of the Right to be Different

2017年5月28日

The Cost of the Right to be Different

It is a high season for IT contracts here in Canberra, so the “Let the Hundred Flowers Bloom” anti-pattern is in full…
COTS or CRHMS? Understanding Full Stack of a Core Enterprise Software.

2016年6月7日

COTS or CRHMS? Understanding Full Stack of a Core Enterprise Software.

Commercial Off-the-shelf Software, or COTS, seems as, although expensive, a way to avoid risks and challenges…

1 条评论
Some Inconvenient Thoughts about Architecture

2015年11月15日

Some Inconvenient Thoughts about Architecture

Enterprise Architecture should include Diagrams understood by the highest executive level to be useful. If you don’t…
Enterprise is the Data: Are Processes Overrated?

2014年12月21日

Enterprise is the Data: Are Processes Overrated?

The first thing I noticed when started to transition some of my clients from UML to OWL Ontology Modelling was that…

6 条评论

See all articles

Alex Jouravlev的更多文章

You have your Business Architecture. Do you use it?

The True Face of Level 4 Process Mapping

Agile, Simplified

As-is Modelling, the Sweet Wasteland of Enterprise Architecture

Agile Expectations Board

Understanding Semantic and Property Graphs

The Cost of the Right to be Different

COTS or CRHMS? Understanding Full Stack of a Core Enterprise Software.

Some Inconvenient Thoughts about Architecture

Enterprise is the Data: Are Processes Overrated?