Data Entropy
Data Entropy

Data Entropy

Data Entropy, a term must be known by Data Scientists but not by general data folks.

Let’s decode it…..

Entropy term is normally used in Data Science domain.

·??????Where finding unexpected outcomes is the aim

·??????Where surprises are welcomed

·??????Where results are analyzed based on probability ratio, lower the probability the better it is

·??????Where informative information is something which wasn’t known

Types

·??????High Entropy means, more surprises, more unexpected values are considered more informative

·??????Low Entropy means, less surprises, less unexpected values are considered less informative

‘Entropy is also called as Extreme Disorder of values.’

Image: https://towardsdatascience.com/entropy-how-decision-trees-make-decisions-2946b9c18c8

Referring to the image, we can see at the starting point all the signs are MINUS, then in the middle there are 50/50 signs of PLUS & MINUS and right at the end all the signs are PLUS. At those extremes left, middle and right corners, the Entropy is at the lowest, so no surprises are expected so nothing much for Data Scientists to predict right. But in 2nd, 4th, and 5th circles, it's difficult to say how many PLUS(s) and MINUS(s), if values are not visible, at this points Entropy is at the highest.

Yes, it’s a confusing topic but this is how Data Scientists find unknown values and tries to predict unknown values.

Cheers.

Maarten van der Heijden

Data Architect at Tata Steel BV, designer of data constructs.

1 年

I tend to use data entropy in a different way: The loss of validity relative to the real world of now over time. A company has to react in the real world and due to that the company and the real world change, leading to a decline in value of the older data for the prediction of what happens now. This is an aspect of data that is hardly to never included in models and the maintenance of the models.

要查看或添加评论,请登录

Mustafa Qizilbash的更多文章

  • DATA PLATFORM GRAVITY

    DATA PLATFORM GRAVITY

    A Unified Approach to Data Ecosystem Management The concept of Data Platform Gravity revolves around a holistic…

    1 条评论
  • DATA PRODUCT OWNERSHIP

    DATA PRODUCT OWNERSHIP

    Data Product Ownership refers to the responsibility of managing and overseeing a data product from creation through to…

    12 条评论
  • DATA PRODUCT OWNER

    DATA PRODUCT OWNER

    A Data Product Owner (DPO) is the person responsible for overseeing the development, maintenance, and evolution of a…

    1 条评论
  • Data As A Product

    Data As A Product

    The transformation of data into a strategic asset has spurred the evolution of data products into what’s known as Data…

  • DATA PRODUCT

    DATA PRODUCT

    A data product is a tool or service that uses data to solve a problem, answer a question, or improve decision-making…

    1 条评论
  • DAC Virtual Layer Integration

    DAC Virtual Layer Integration

    In our comprehensive data architecture, the Virtualization Layer serves as a cornerstone for enhancing data…

    1 条评论
  • Understanding HR Taxonomy and Ontology for Organizational Structure and Functionality

    Understanding HR Taxonomy and Ontology for Organizational Structure and Functionality

    As we all know, what Taxonomy and Ontology are but at the same time it becomes must easier to understand with example…

    6 条评论
  • POTENTIAL COST COMPARISON CHART BETWEEN TRADITIONAL VS PVP APPROACHES

    POTENTIAL COST COMPARISON CHART BETWEEN TRADITIONAL VS PVP APPROACHES

    Hardware Costs Traditional Approaches: Requires separate hardware setups for each environment, increasing hardware…

  • CHALLENGES IN TRADITIONAL APPROACHES ADDRESSED BY PVP

    CHALLENGES IN TRADITIONAL APPROACHES ADDRESSED BY PVP

    Complexity Descriptions: Increased complexity due to managing multiple environments separately. Resolutions in PVP:…

  • HOW TO USE THE PVP APPROACH

    HOW TO USE THE PVP APPROACH

    The Productionizable Viable Product (PVP) executes the new initiative/solution ONLY IN SINGLE PRODUCTION ENVIRONMENT…

社区洞察

其他会员也浏览了