登录查看更多内容

Out-of-sample forecasting challenges in time series data

Murtaza Haider

Professor … Columnist … Data Scientist

发布日期: 2024年2月20日

Achieving "reliable" out-of-sample forecasts remains a formidable challenge, notwithstanding the significant strides made in forecasting methodologies. We delve into the nuances of forecasting out-of-sample data for housing starts in Canada, scrutinizing a time series that spans annually from 1948 to 2023.

This analysis juxtaposes two distinct methodologies: the traditional econometric approach, embodied by the Autoregressive Integrated Moving Average (ARIMA) models, and the more contemporary Prophet model, initially developed by Facebook (now Meta). The intention behind minimal adjustments to the modelling parameters is to underscore the inherent variability in results that can arise from different modelling choices. Both methodologies are applied to univariate time series data, deliberately excluding exogenous variables to maintain focus on forecasting beyond 2023 without the need to predict external inputs.

Utilizing RStudio for the analysis, we explore the ARIMA model through the Auto ARIMA function within the forecast package. This process automatically recommended an ARIMA(0,1,0) configuration, which, when visualized, yielded an out-of-sample forecast that closely resembled a straight line, underscoring a potential limitation in capturing the dynamics of the series.

Forecast generated using AutoArima function in Forecast package

Seeking a more nuanced forecast, we experimented further with the ARIMA framework, settling on an ARIMA(1,1,1) model augmented with a drift term. This model offered a slightly nuanced and, arguably, more "realistic" forecast compared to a straight, horizontal line extension, illustrating the potential benefits of incorporating a trend component in the model. The question remains: how does one capture the variance observed in the data in out-of-sample forecasts?

领英推荐

Simple Linear Regression in Statistics using Least…

Lean Manufacturing & Six Sigma Worldwide 8 个月前

Difference Between Skewness and Kurtosis in Statistics

Lean Manufacturing & Six Sigma Worldwide 7 个月前

Test Assumptions Only After Running an Initial Model

The Analysis Factor 2 周前

The investigation then shifted towards the Prophet model, a tool designed for intuitively handling seasonal trends and the ability to accommodate holidays and other special events easily. Initially, the default Prophet model settings did not yield insightful forecasts. However, after refining the model specifications, including adjustments for seasonality and trend components, the resultant forecasts for both in-sample and out-of-sample periods appeared considerably more plausible.

Housing starts forecasted with Facebook's Prophet model

The wide confidence interval (shaded in blue) should not give one much confidence in the forecast.

These explorations into time series forecasting for housing starts in Canada highlight the critical role of model selection and parameterization and emphasize the inherent uncertainties associated with predicting future trends. We are particularly eager to engage with the broader community on this topic. We invite you to share your experiences with out-of-sample forecasting in time series data, including any insights on alternative tools or methodologies that have proven effective in your univariate time series forecasting endeavours.

Insights with Data & Analytics

6,577 位关注者

要查看或添加评论，请登录

Murtaza Haider的更多文章

Should you give up cars or burgers to save the environment?

2025年2月16日

Should you give up cars or burgers to save the environment?

Transport environmentalists have a favourite target: automobiles. They aim to limit car usage, so policies designed "to…

2 条评论
How Housing Costs Shape Labour Markets

2025年2月6日

How Housing Costs Shape Labour Markets

The cost of housing plays a crucial role in labour market dynamics. Recent analysis by the Canada Mortgage and Housing…

2 条评论
DeepSeek AI: A Case Study in Ideological Bias

2025年1月30日

DeepSeek AI: A Case Study in Ideological Bias

There is nothing subtle about the ideological roots of the China-based generative AI tool, DeepSeek. It is deeply…

11 条评论
The advantage of being an incumbent in politics

2025年1月26日

The advantage of being an incumbent in politics

Days before journalists reported on an impending snap election in Ontario, residents of Mississauga—and likely other…

1 条评论
Join us in Ottawa for the 2025 Canadian Stata conference

2025年1月22日

Join us in Ottawa for the 2025 Canadian Stata conference

We are thrilled to announce the Canadian Stata Conference, scheduled to take place this autumn on October 3rd. This…
Using AI to Draft a Memo Prohibiting AI Use: A Paradox in Practice

2025年1月21日

Using AI to Draft a Memo Prohibiting AI Use: A Paradox in Practice

Generative Artificial Intelligence (Gen AI) is becoming ubiquitous. With the rollout of Microsoft Office 365 Copilot…

2 条评论
Why Canada's Dependence on US Trade Should Be a National Security Concern

2025年1月20日

Why Canada's Dependence on US Trade Should Be a National Security Concern

January 20, 2025, will be a momentous day for Americans when President-elect Donald Trump will take oath as the 47th US…

4 条评论
Development Charges Make Housing Expensive -- A Lot*

2025年1月19日

Development Charges Make Housing Expensive -- A Lot*

* Reproducing our column that appeared in July 2022 in the Financial Post. https://financialpost.

6 条评论
Amortize Development Charges to make new homes affordable

2025年1月18日

Amortize Development Charges to make new homes affordable

A recent survey by the Ontario Real Estate Association highlights growing concerns about the high development charges…

9 条评论
Congestion Pricing in Manhattan -- For whom the bell tolls

2025年1月9日

Congestion Pricing in Manhattan -- For whom the bell tolls

Transport modellers often exhibit an unshakable faith in their models, championing ideas and interventions with…

2 条评论

See all articles

Out-of-sample forecasting challenges in time series data

Murtaza Haider

Professor … Columnist … Data Scientist

领英推荐

Insights with Data & Analytics

6,577 位关注者

Murtaza Haider的更多文章

社区洞察

其他会员也浏览了

Moving Averages in Time Series Analysis

Decision Tree Algorithm

Simplifying Statistics One Post at a Time!

The Estimation vs. Computation Dilemma

Structural Dynamics Modeling Techniques for Analyzing Markets (Executive Summary)

Data

Harry Potter and the hidden variables.

Taming Time: Mastering Seasonality and Trends in Time Series Forecasting

Exploring the F-Distribution and ANOVA: Keys to Statistical Insights

Concise Basic Stats - Part X: Distribution-free tests (Nonparametric Statistics)

领英推荐

Insights with Data & Analytics

6,577 位关注者

Murtaza Haider的更多文章

Should you give up cars or burgers to save the environment?

How Housing Costs Shape Labour Markets

DeepSeek AI: A Case Study in Ideological Bias

The advantage of being an incumbent in politics

Join us in Ottawa for the 2025 Canadian Stata conference

Using AI to Draft a Memo Prohibiting AI Use: A Paradox in Practice

Why Canada's Dependence on US Trade Should Be a National Security Concern

Development Charges Make Housing Expensive -- A Lot*

Amortize Development Charges to make new homes affordable

Congestion Pricing in Manhattan -- For whom the bell tolls

社区洞察

其他会员也浏览了

Moving Averages in Time Series Analysis

Decision Tree Algorithm

Simplifying Statistics One Post at a Time!

The Estimation vs. Computation Dilemma

Structural Dynamics Modeling Techniques for Analyzing Markets (Executive Summary)

Data

Harry Potter and the hidden variables.

Taming Time: Mastering Seasonality and Trends in Time Series Forecasting

Exploring the F-Distribution and ANOVA: Keys to Statistical Insights

Concise Basic Stats - Part X: Distribution-free tests (Nonparametric Statistics)