Data & Business Analytics Series: Basis Statistics (1/n)
Jayadev Palai
Business Excellence & Strategy | Capital Budgeting | PMO | Consulting | Business & Data Analytics | TQM Champion
In this series of posts, I'll try to cover tools and techniques required for analysis, visualization, and presentation of data & business insights from basic to advance level.
"Data analysis is a combination of statistical mathematics and the art of organizing, analyzing, and interpreting the data to generate meaningful insights for taking right action at right time."
Basic Statistical terminologies and its usage:
1. Mean, Median & Mode
Each of these measures gives us different insights and they are used based on the nature and distribution of the data.
Tip #1: Mean is a good option when the data distribution is symmetrical, while the median is often a better choice when the distribution is skewed because it isn’t affected by extreme values. The mode, on the other hand, can be useful for categorical data.
Tip #2: In a simple mean, all numbers are treated equally, while in a weighted mean, some numbers contribute more to the final average than others. The weighted mean is particularly useful when we want to calculate an average that is not skewed by a few outliers or when the data elements in the set are not equally important. We generally give more weights to recent data sets or relevant data sets without ignoring older / earlier data trends.
2. Variance & Standard Deviation:
Variance and Standard Deviation are both statistical measurements that describe the spread of data points in a data set.
领英推荐
3. Central Limit Theorem of a Normal Distribution & 68–95–99.7 rule:
The popular alternate name of Central Limit Theorem is 'Bell Curve' due to its shape (which has a infamous use during annual appraisal cycle of employees by People Managers)
Stay tuned for coming parts of this series. Do let me know your thoughts through comment / DM and share if found useful.