登录查看更多内容

Tuning a Statistical Forecast Part 2: Methodology

Simon Joiner

Preparing you for Lift-Off with o9 Solutions, Inc.

发布日期: 2021年3月31日

Forecast Tuning Methodology.

In this second article I will explain the methods that can be used to tune a statistical forecast. The first of this series of articles presented the various resource options to tuning a Statistical Forecast and subsequent articles will cover the strategic elements, business decisions, prioritisation, impact and ability and actual procedures.

Methodology is the activities that are grouped together in rough order of difficulty from simple to complex. Tuning generally requires many of the methods to be applied but this can be a very difficult task to achieve in a blended manner (as in, multiple methods at the same time). It is easier to pursue tuning by individual methods first, then blend various methods together as your understanding of tuning effectivity grows.

The 9 Steps

The Methodology of Engine Tuning for accuracy consists of the following 9 steps:

History: Clean & adjust to create the best forecast. Difficulty Factor = Easy
Decision Process: Lifecycle / Segments / On or Off. Difficulty Factor = Easy
Hierarchy Levels: Edit to drive better usage. Difficulty Factor = Moderate
Engine & Models: Selection in engines or plans. Difficulty Factor = Moderate
Engine Settings: Tweak for optimal settings. Difficulty Factor = Complex
Causals: Add, remove, edit to obtain better results. Difficulty Factor = Moderate
Proport: Assess allocation & aggregation methods. Difficulty Factor = Moderate
Nodal: Refine above per individual combinations. Difficulty Factor = Complex
Strategy: Approach to tuning measure & adjust.

1.History: Clean & adjust to create the best forecast. Difficulty Factor = Easy

Since most Statistical Forecasts use History (Orders, Shipments, Invoices etc.) as the source of the statistical calculations, the first and easiest tuning step of all is to adjust your history. Yes, this means that you are already tuning your forecast!

You should retain the original History for reference and then have adjustment data streams (absolute and/or percentage) that then feed into a Final History data stream. The Final History should show Actual History unless there are adjustments in which case use the absolute adjustments and then the percentage changes.

Why would you want to perform changes? There are a large number of reasons why the history is either incorrect or inappropriate for the Statistical Forecast you would like to have. Some examples are:

Combinations that need to be inactivated (remove any order history)
Stock-Out (add the order history that would have been there)
Competitor Stock-Out (reduce order history that would not have been received)
New Product (add history that to create the forecast shape you want),
Data error (remove / add as needed)

2. Decision Process: Lifecycle / Segments / On or Off. Difficulty Factor = Easy

A 'Node' is the combination of Organisation, Product and Location (and could other Dimensions). Node Processing is the decision of whether or not to forecast that node or combination. What does this actually mean? It means determining how your data should react inside your planning system. You control what gets forecasted and how.

Turning the nodes 'Off' means you don't want a Statistical Forecast to run and turning them 'On' means you do. Artificial Intelligence and Machine Learning Engines can automate some of this activity once you set parameters for the System too react to. Typical practical examples of why you might adjust the node settings are:

The Combination is not statistically forecastable
The Product is obsolete, and no more forecast is wanted.
Customer on hold and their demand needs to be removed.
Dummy Combination: created for testing and no forecast is wanted.
Data Loading Error: A duplicated combination to deactivate.

Node Processing can also be a crucial exception management pivot. Do you have Forecast for a Customer who is on hold? Do you have no Forecast where a Node has history and is set to active?

3.Hierarchy Levels: Edit to drive better usage. Difficulty Factor = Moderate

What level is your Forecast generated at? Some solutions are set at a particular level (say, Organisation, Item & Customer Channel) while others may use hierarchies flexibly using Automation and Machine Learning to select the 'right' level as the data demands.

Tuning the engine to create better forecasts will involve assessing these hierarchies for appropriateness. This approach may not be easy to perform (especially where data is fixed within an Integrated Solution) but it will be worth analysing to confirm if there is a problem or not. If there is a challenge, at least you are aware of it and it can be added to the list of future improvements.

A Flexible hierarchy solution should be analysed regularly and indeed, used for exception managements since forecasts generated higher in the Forecast Tree will be due to lack of lower-level data. The higher a forecast is created, the less accurate the prediction will be and these combinations should be reviewed using the other options in this list.

4. Engine & Models: In Engines or Plans. Difficulty Factor = Moderate

Solutions vary of course, but it should be possible to select and deselect the models used by your statistical engine. Time Series, Exponential, Intermittent and Regression Models will all create quite different forecasts from the same set of historical data.

工程关注我们，每天学习?? 9 个月前

Decision Tree Algorithm

Harry Thapa 10 个月前

What Is The Difference Between Parametric And…

Ze Learning Labb 3 个月前

Assess the different models that are available to determine the best model selections. Best Fit solutions will offer a one model per combination while more sophisticated systems will use Machine Learning to mix and match.

If there are conflicts with model settings, consider creating independent sets of data where the best models can be set per data segment. For example, you could build 2 engines: one for intermittent data and one for smooth or perhaps one for B2B and one for B2C.

5. Engine Settings: Find the optimal settings. Difficulty Factor = Complex

Engine Parameters define how the models react to data in the system. Parameters will define the length of your forests horizon, the significance applied by the engine to recent history, what to do with null history, the definition of the moving average and so on. There could be many hundreds of parameters.

Complex solutions should have their parameters properly assessed and tuned for project go live but how long ago was that? Extract the parameters, assess them and create a plan to change and validate.

6.Causals: Maintain to obtain better results. Difficulty Factor = Moderate

Causal Factors are elements that can be defined as?having an effect upon demand and can used to improve the accuracy of the forecast.??Causals need to be defined, added into history and also loaded into the future. Broadly speaking, there are two types of causal factor:?

Global?and?Local: Global Factors?affect the entire dataset (Spring, Summer, Autumn, Winter, Christmas, New Year, Easter, Price).
Local Factors?affect subsets of the data (Regionalisation, Promotions, Weather, Temperature, Price)

Too many casuals can create a lot of noise for statistical forecast engines to assess. As a starting point, fewer is better. Only include causals when you know they add value. Verify that the casual data is complete and as correct as possible. For example, if price change is a casual, validate that price data is not missing anywhere in the dataset. If Promotions exist - are post promotional reviews conducted to validate and correct assumptions?

7.Proport Function?(Allocation & Aggregation) Difficulty Factor = Moderate

Proport defines how data is rolled up and down your demand planning hierarchies in the past and the future. An example of proportionality; if one item-location combination has four times as many sales as another, the former combination should receive four times as much of the statistical forecast. Examples of where proport can impact demand planning data:

When the Statistical Engine generates a forecast at an aggregated level.
When data is imported at an aggregated level.
When users perform chaining at an aggregated level.
When History data is overridden
When Forecast data is overridden

Typically, historical data will use itself as the proport mechanism, but the Forecast can use itself, or previous approved forecast or last year or annual budget... Select the least compromising weighting method. Try to make adjustments as low as possible.

8.Nodal: Refine individual combinations. Difficulty Factor = Complex

Nodal tuning is a term used to describe the maintenance of the previous 7 options per individual data intersection. This feature, if it is available to you, can transform forecast accuracy since each combination can be optimally tuned. The downside of this local tuning approach is that the cost of maintenance can be extremely high.

Nodal tuning should be used when it is proven that a particular set of data performs better with a unique combination of settings and where this data cannot be removed and managed in a separate plan.

9.Strategy & Procedures?(Approach to Tuning) - Measure & Adjust

This step really should be the first one, but you need to know the impact and difficulty of the 8 steps before you can really set a strategy and procedure. A strategy defines the purpose of tuning (better accuracy, more trust, greater efficiency etc.) and the plan to achieve the strategic purpose. A procedure defines the way that the methods described above will be applied and assessed in order to deliver the against the strategy. Some basic question to ask:

Which, if any steps do not exist in your business solution and/or cannot be performed?
Which steps require Training, Systems and Data?
What recording & analysis methods to use?
How to decide 'what is good' before stopping a tuning step and moving on?

I expect that existing Demand Planners will be able to work with Methods 1 & 2 immediately as these steps are naturally performed by planners, but are they formally captured and analysed for impact?

A good place to start engine tuning is to capture baseline data (settings and forecast results) and then to try and record changes applied and the results achieved. It will take some time before the correct spreadsheet structure and reporting mechanism will be found to manage procedures. Test it out before embarking on a more complex and thorough tuning journey.

Tuning a Statistical Forecast Part 1 (Resources

Clive Goodman CIMA ACMA MAPM

Project Finance Business Partner at HS2 (High Speed Two) Ltd, Mega Project Cost & Financial Control Specialist

3 年

Big share for this one Simon ??

1 次回应

Simon Joiner

Preparing you for Lift-Off with o9 Solutions, Inc.

3 年

It's not really 7 minutes. I blame the images.

查看更多评论

要查看或添加评论，请登录

查看全部

Tuning a Statistical Forecast Part 2: Methodology

Simon Joiner

Preparing you for Lift-Off with o9 Solutions, Inc.

Forecast Tuning Methodology.

The 9 Steps

1.History: Clean & adjust to create the best forecast. Difficulty Factor = Easy

2. Decision Process: Lifecycle / Segments / On or Off. Difficulty Factor = Easy

3.Hierarchy Levels: Edit to drive better usage. Difficulty Factor = Moderate

4. Engine & Models: In Engines or Plans. Difficulty Factor = Moderate

领英推荐

5. Engine Settings: Find the optimal settings. Difficulty Factor = Complex

6.Causals: Maintain to obtain better results. Difficulty Factor = Moderate

7.Proport Function?(Allocation & Aggregation) Difficulty Factor = Moderate

8.Nodal: Refine individual combinations. Difficulty Factor = Complex

9.Strategy & Procedures?(Approach to Tuning) - Measure & Adjust

更多精彩文章

社区洞察

其他会员也浏览了

TIQ Part 4 – Being Time intelligent

Data Analysis Mistake: Common sense may result in 180-degree wrong causation judgment

Addressing Normality in Latent Profile Analysis (LPA) and Latent Class Analysis (LCA)

Dive into the World of Robust Statistical Methods: More Than Just Data Analysis (1/5) ????

Essentials of Time Series Forecasting: Key Components, Challenges, and Algorithms

The most powerful tool of your mind is not its capacity to know, but its ability to question.

Understanding the Minimum Description Length Principle: A Balance Between Model Complexity and Data Fit

Nowcasting with MIDAS regressions

Discover about the Shewhart Chart/ Statistical Process Control Graph

Why Most Reports Fail - and How to Make Yours Essential for Decision-Makers

Forecast Tuning Methodology.

The 9 Steps

1.History: Clean & adjust to create the best forecast. Difficulty Factor = Easy

2. Decision Process: Lifecycle / Segments / On or Off. Difficulty Factor = Easy

3.Hierarchy Levels: Edit to drive better usage. Difficulty Factor = Moderate

4. Engine & Models: In Engines or Plans. Difficulty Factor = Moderate

领英推荐

5. Engine Settings: Find the optimal settings. Difficulty Factor = Complex

6.Causals: Maintain to obtain better results. Difficulty Factor = Moderate

7.Proport Function?(Allocation & Aggregation) Difficulty Factor = Moderate

8.Nodal: Refine individual combinations. Difficulty Factor = Complex

9.Strategy & Procedures?(Approach to Tuning) - Measure & Adjust

Still Planning in Spreadsheets?

2022年1月22日

Cars & Forecasting: Speed or Accuracy?

2021年4月16日

Tuning a Statistical Forecast Part 3: Strategic Elements

2021年4月12日

Buy-In & Sell-Out

2021年3月30日

Forecasting Strategy

2021年3月27日

Tuning a Statistical Forecast Part 1: Resources

2021年3月24日

Demand Planning Panaceas

2021年3月23日

Oracle Demantra v Demand Management Cloud: Structures.

2021年3月12日

Demand Planning Pain Points

2021年3月6日

Oracle Demand Management Cloud. A quick introduction and comparison to Demantra.

2021年3月1日

社区洞察

其他会员也浏览了

TIQ Part 4 – Being Time intelligent

Data Analysis Mistake: Common sense may result in 180-degree wrong causation judgment

Addressing Normality in Latent Profile Analysis (LPA) and Latent Class Analysis (LCA)

Dive into the World of Robust Statistical Methods: More Than Just Data Analysis (1/5) ????

Essentials of Time Series Forecasting: Key Components, Challenges, and Algorithms

The most powerful tool of your mind is not its capacity to know, but its ability to question.

Understanding the Minimum Description Length Principle: A Balance Between Model Complexity and Data Fit

Nowcasting with MIDAS regressions

Discover about the Shewhart Chart/ Statistical Process Control Graph

Why Most Reports Fail - and How to Make Yours Essential for Decision-Makers