登录查看更多内容

Milestone Project: Time series forecasting in TensorFlow (BitPredict ????)

Marius Poskus

Cybersecurity Executive @ Fintech | Cybersecurity Leader | Board Advisor | AI Security | mpcybersecurity.co.uk

发布日期: 2024年8月9日

Time Series you might ask? Time series deals with data over a period of time. It can be anything from number of employees over a 10 year period, sales of computers over period of 5 years or electricity usage over 50 years.

The timeline can be short (seconds/minutes) or long (years/decades). And the problems you might investigate using can usually be broken down into two categories:

Classification - Anomaly detection, time series identification (where did this time series come from?)
Forecasting - Predicting stock market prices, forecasting future demand for a product, stocking inventory requirements

What we will cover during this project

Get time series data (the historical price of Bitcoin)Load in time series data using pandas/Python's CSV module
Format data for a time series problemCreating training and test sets (the wrong way)Creating training and test sets (the right way)Visualizing time series dataTurning time series data into a supervised learning problem (windowing)Preparing univariate and multivariate (more than one variable) data
Evaluating a time series forecasting model
Setting up a series of deep learning modelling experiments, Dense (fully-connected) networks, Sequence models (LSTM and 1D CNN), Ensembling (combining multiple models together)Multivariate models, Replicating the N-BEATS algorithm using TensorFlow layer subclassing
Creating a modelling checkpoint to save the best performing model during training
Making predictions (forecasts) with a time series model
Creating prediction intervals for time series model forecasts
Discussing two different types of uncertainty in machine learning (data uncertainty and model uncertainty)
Demonstrating why forecasting in an open system is BS (the turkey problem)

Types of Time Series

?? Note: The frequency at which a time series value is collected is often referred to as seasonality. This is usually mesaured in number of samples per year. For example, collecting the price of Bitcoin once per day would result in a time series with a seasonality of 365. Time series data collected with different seasonality values often exhibit seasonal patterns (e.g. electricity demand behing higher in Summer months for air conditioning than Winter months).

Creating Test and Training Data Sets

We will also explore the differences between univariate and multivariate data, which might be useful in our case because Bitcoin halvening effect might alter our price predictions, as you can see in the image below:

There is an important problem that we needed to address in creating working data sets, because we are working with prediction models, we can't any longer use random data split models as we used in computer vision or language processing modelling, we have to split the data based on the dates, for example, first 80% of date range of prices will be training data and the last 20% will be the test data. This problem is perfectly summed up in the image below:

So as mentioned when we use the data split according to the time, the split looks like in the image below:

During our modelling experiments we built in total 11 models to try various deep learning architectures and explore different horizon and window sizes. Horizon and Window Sizes? It's not what you might think, Horizon is number of steps we are looking to predict in the future and Window size is the number of data point from the past we will use to predict the horizon. So image below shows all the models we have tried:

领英推荐

Early adopter version of my book - mathematical…

Ajit Jaokar 5 个月前

Supervised Machine Learning in Time Series Forecasting

BI4ALL 2 年前

Cluster bugs using ML (K-Means Clustering Algorithm) –…

Sumon Dey 1 年前

Deep Learning model for time series problems

Let me paint a picture first of all about the Naive model - this is the model that does not need any prediction as it uses very simple formula:

Which basically means that it takes previous day's result as the future forecast, but as you will such a simple model is not that easy to beat with deep learning architectures. Before i showcase the results from each modelling experiments, let sum up what we used to measure the model effectiveness and why we used it:

Mean absolute error shows us how far on average the model prediction is away from the real data.

After running all 11 models with our data, the results can be found in the able below:

The majority of our deep learning models perform on par or only slightly better than the naive model. And for the turkey model, changing a single data point destroys its performance.

?? Note: Just because one type of model performs better here doesn't mean it'll perform the best elsewhere (and vice versa, just because one model performs poorly here, doesn't mean it'll perform poorly elsewhere).

As I said at the start, this is not financial advice.

After what we've gone through, you'll now have some of the skills required to callout BS for any future tutorial or blog post or investment sales guide claiming to have model which is able to predict the future.

Mark Saroufim's Tweet sums this up nicely (stock market forecasting with a machine learning model is just as reliable as palm reading).

As covered by model 10 - regarding a Turkey problem, just looking at the historic market data, it is almost impossible to predict future prices because we are not looking at potential market conditions and Turkey events - such as market crash, which some of you might be aware we are experiencing this week.

Cyber Secrets Unveiled

3,435 位关注者

要查看或添加评论，请登录

Marius Poskus的更多文章

Navigating the Risks of AI Adoption: Crucial Security Controls and Open-Source Tools You Need Today

2025年3月28日

Navigating the Risks of AI Adoption: Crucial Security Controls and Open-Source Tools You Need Today

As organizations increasingly integrate Artificial Intelligence (AI) into their operations, the landscape of…

2 条评论
Navigating the AI Revolution: Managing Security Risks in Machine Learning and Generative AI

2024年12月19日

Navigating the AI Revolution: Managing Security Risks in Machine Learning and Generative AI

In today's rapidly evolving technological landscape, Artificial Intelligence (AI) and Machine Learning (ML) are no…

1 条评论
The Evolution of the CISO Role: From Tech Expert to Business Leader

2024年10月29日

The Evolution of the CISO Role: From Tech Expert to Business Leader

In today's dynamic business landscape, the role of the Chief Information Security Officer (CISO) has undergone a…

10 条评论
Beyond the Tools: Redefining CISO Job Requirements

2024年10月23日

Beyond the Tools: Redefining CISO Job Requirements

In the ever-evolving landscape of cybersecurity, organizations are increasingly recognizing the critical role of the…
Your CISO is Not a Swiss Army Knife: Building a Balanced Security Team

2024年10月15日

Your CISO is Not a Swiss Army Knife: Building a Balanced Security Team

In the complex and ever-evolving landscape of cybersecurity, there's a dangerous misconception that persists in many…

2 条评论
Why Bargain-Hunting for a CISO is a Recipe for Disaster

2024年10月8日

Why Bargain-Hunting for a CISO is a Recipe for Disaster

In today's digital landscape, where cyber threats loom large and data breaches can cost millions, the role of the Chief…

21 条评论
The True Cost of Misunderstanding the CISO Role

2024年9月30日

The True Cost of Misunderstanding the CISO Role

In today's rapidly evolving digital landscape, the role of the Chief Information Security Officer (CISO) has become…
Cybersecurity: From IT Afterthought to Boardroom Priority

2024年9月24日

Cybersecurity: From IT Afterthought to Boardroom Priority

In today's rapidly evolving digital landscape, cybersecurity has undergone a dramatic transformation. Once relegated to…
The Cybersecurity ROI Puzzle: Cracking the Code

2024年9月13日

The Cybersecurity ROI Puzzle: Cracking the Code

In the boardrooms of companies around the world, a challenging question often arises: "What's the return on investment…

6 条评论
Proactive Cybersecurity: Reshaping Business Processes

2024年9月6日

Proactive Cybersecurity: Reshaping Business Processes

In an era where cyber threats are constantly evolving and becoming increasingly sophisticated, organizations can no…

See all articles

Milestone Project: Time series forecasting in TensorFlow (BitPredict ????)