8 Building Blocks of Statistical Thinking
The fundamentals of statistical thinking are crucial for success and remain the same whether you're dealing with small or big data.
In their article [1], Roger Hoerl and Ronald Snee discussed 8 building blocks of statistical thinking.
Here's a summary of the practical tips, mistakes, and relevant case studies.
1) Clear Problem Statement
Tip: Clearly define the problem and intended uses of the model upfront. Don't get distracted by the data.
Mistake: Jumping into data analysis without clear objectives. This often leads to models that don't generalize.
Case Studies:
2) Process Understanding
Tip: Thoroughly understand the process that generated the data, including how measurements were obtained.
Mistake: Assuming the data is perfect without evaluating its pedigree. Garbage in, garbage out.
Case Studies:
3) Analysis Strategy
Tip: Develop an iterative, phased analysis strategy rather than jumping straight to modeling.
Mistake: Attempting to solve the problem in one pass based on model fit statistics.
Case Studies:
4) Variation Sources
Tip: Seek to understand sources of variation. Reducing variation improves processes.
Mistake: Assuming all variation is noise instead of a signal pointing to root causes.
Case Studies:
5) Quality Data
Tip: Assess data pedigree including origin, collection methods, and measurement process.
Mistake: Using flawed data and assuming algorithms can compensate.
领英推荐
Case Studies:
6) Domain Knowledge
Tip: Leverage domain expertise in data selection, variable choice, model interpretation, and more.
Mistake: Ignoring theory and expertise already established, and relying solely on data.
Case Studies:
7) Sequential Approach
Tip: Take an iterative, multi-phase approach to dig into a problem versus one pass at the data.
Mistake: Assuming a problem can be solved fully with one data set and analysis.
Case Studies:
8) Modeling Process
Tip: Choose a modeling process that aligns with the problem and avoids overfitting the data.
Mistake: Focusing too much on complex algorithms without sound statistical thinking principles.
Case Studies:
Successful data analytics integrates sound statistical thinking, domain expertise, and business understanding with modeling techniques.
This holistic approach is more likely to produce actionable insights that stand the test of time. Just throwing algorithms at data is prone to errors, overfitting, and results that cannot be replicated.
Take the time to do thoughtful analysis.
References:
[1] Hoerl, R. W., Snee, R. D., & De Veaux, R. D. (2014). Applying statistical thinking to?‘Big?Data’ problems. WIREs Computational Statistics, 6(4), 222–232.
What’s Next?
?If you’re an industrial leader and want to know about Statistical Thinking Applications, check my previous Newsletter last week here.
?? Subscribe here and Join 6800+ leaders getting actionable advice to solve problems, make informed decisions, and lead with influence - every Friday.
Sales & Business Development, Process Engineering & Simulation Specialist, Digital Transformation Leader, Training Courses & eLearning Designer
1 年Thanks for sharing, all the best :)