Selecting Forecasting Methods in Data Science
We are dealing with plethora of data and information in the world today and expectation is to predict and forecast how we can gain competitive advantage based on the information that we have, to act in advance. We look forward to define and furnish various methods based on our gut feel, past historical data, simple mathematical averages, and many more to get an incredibly precise prediction. With advanced analytics and data science, we develop “always-on” forecasting models which enable our clients to take their decisions effectively. From intuition to traditional algorithms to machine learning, phases have been evolving over a period.
Key processes that we follow while using forecasting can be outlined as per below:
1. Defining the goal or business objective
2. Getting the required data
3. Explore and visualize series
4. Pre-processing of the data
5. Partitioning the series
6. Applying suitable forecasting methods
7. Evaluate and compare performance
8. Implement forecasts / system
Processes 2 & 3 are iterative and 6 & 7 are iterative.
We try to look for answers to various questions in the process – is the goal / business objective descriptive or predictive in nature? What is the forecast horizon (how far into the future, rolling forward or at a single time point, one time forecasting or ongoing task)? How will forecast(s) be used (who are stakeholders, whether it’s a numerical or event forecast, what is the cost of over-prediction or under-prediction, will forecasts undergo “adjustments”)? What is the forecasting expertise and automation needed to accomplish the goal? etc.
When it comes to data, the quality of data, sample, temporal frequency, balance between signal & noise, series granularity, domain expertise is quintessential. There are many methods that can be used to forecast. Which are relevant to our situation depends upon our objectives and conditions we face. Often, there is no single best method. In fact it is best to use different methods and combine their forecasts.
If we talk about well-accepted methods that should be used to provide benchmark forecasts, the simplest forecasting method for time series for example is the random walk. It assumes that the future values of a time series will be equal to the current value. In other words, one does not have useful information about future changes in the series – it is equally likely to go up or down. Time series components can be categorized in multiple parts.
1. Systematic part
i. Level
ii. Trend
iii. Seasonal patterns
2. Non-systematic part
i. Noise
Additive and multiplicative models can be defined in an equation comprising of these components. i.e.
Additive model: Y(t) = Level + Trend + Seasonality + Noise
Multiplicative model: Y(t) = Level * Trend * Seasonality * Noise
A model which fits the data well, does not necessarily forecast well. A perfect fit can always be obtained by using a model with enough parameters. Over-fitting a model to data is as bad as failing to identify the systematic pattern in the data. Hence as a solution we look forward to data partitioning strategies where we look forward to training, validation, and future aspects. Ideally, validation period depends on the forecast horizon, seasonality, length of series, underlying conditions affecting the series etc.
Typical time series patterns can be looked at in Exhibit 1.
Exhibit 1
When we look at common predictive accuracy measures, then average error, mean absolute error (MAE), mean squared error (MSE), mean absolute percentage error (MAPE) techniques come into mind. Exhibit 2 displays how these measures are different.
Exhibit 2
There are various forecasting methods used based on data and situation. If there is a need for one time forecasting, in-house expertise is available, smaller number of series exist, typically model based methods are used and these are typical “manual”. In the other hand, if there is ongoing forecasting, no in-house expertise available, many series to forecast etc., then typically data driven methods are used and these are “automated” and computationally fast. Exhibit 3 shows various forecasting methods which are either model based or data driven. Ensembles are often used by combining forecasts from different methods.
Exhibit 3
While prediction is concerned with future certainty, forecasting looks at how hidden currents in the present signal changes in direction. Objective of forecasting is to identify the full range of possibilities and not limited to set of illusory certainties. “Forecasting can probably be looked at a subset of prediction” - any time we predict into the future, it is a forecast. All forecasts are predictions, but not all predictions are forecasts, as when we would use regression to explain the relationship between two variable. So, what a forecast need? It requires a logic, an ability for quality assessment of forecasting approaches and few rules for effective forecasting. We need to be pragmatic in terms of defining it in a manner that helps decision maker or stakeholders to exercise strategic judgment, need to identify key patterns and seasonality, need to embrace those items which cannot be classified, need to look at more past or historical data to make sense as fewer data elements would not make any meaningful forecasting etc. We also need to understand when to make a combination of forecasts or forecasting methods by using ensembles and when “not to” forecast at all.
Forecasting and selecting an appropriate method for doing forecasting will always be interesting blend of “Art” and “Science” in addition to our judgement and practicality.
Disclaimer: "The postings on this site are my own from my experiences and don't necessarily represent IBM's positions, strategies or opinions.”
Superintendente Geral, Produtos de Tesouraria
7 年Very interesting article! I'm creating an Algo Trading portal named stratsphera, that will be focused on Latin America traders and academics. I'd like to ask your permission to repost this as it would be much helpful to the community. Thanks!
Vice President & CTO, Data and AI Services at Kyndryl
8 年Very well explained - thank you Kamal !!
Senior Analyst at Westpac Group, Australia
8 年Concise, well written work.....Thnx..
Director Consulting IP at CGI
8 年Good explanation...sometime more the methods...knowing why to forecast, what variable to forecast- keeping in mind the inter dependency of variables under consideration is also a key criteria
Tax Fusion leader for BDO creating innovative digital products to drive business growth
8 年Well explained Kamal.