登录查看更多内容

Comparing two univariate noisy time series of same length and sampling time

Gustavo Sánchez Hurtado

Award-Winning Engineer, Researcher & Educator | Digital Transformation: Control Systems, IoT, and Machine Learning | PLC/SCADA programmer | Python/MATLAB | Node Red | Global Speaker, Author & Podcaster

发布日期: 2022年7月24日

A very common problem in practice is this: given two univariate noisy time series of same length and sampling time, we need two decide if they come from the same stochastic process or not. For example, consider the two series showed in the figure above. They correspond to 100 values, from t = 0 to t = 10s, so the sampling frequency is 10Hz. They seem to have the same average, near 0. Series y2 seems to reach higher max and lower min values. Let's take a look of their basic statistics.

We confirm that both series have mean near 0, but y2 has a greater variance. Now let's take a look of their distribution.

Again we can see that y2 seems to have a broader distribution compared to y1. Although their distribution does not look Gaussian, we may perform an F-test, to compare the variances.

Given the p-values, we have more clues that these two series do not come from the same stochastic process.

Let's take a look of they spectrum.

We can see that y2 is having a peak which is greater in frequency and amplitude compared to y1.

The R script for this example is available at:

https://github.com/multiopti/MYWAI/blob/main/comparing2series.R

Do you know a better approach to solve this problem? Do you have a counterexample in which this method does not work? Do you have any general comment about this article?

I would be happy to receive your comments to: [email protected]

At?MYWAI?we promote agile, explainable, reliable and affordable ML at the edge.

Jyoti R. Nair

Risk Analyst

2 年

Hi Prof...It depends on what you want to study from the two series.. IMHO 1. First of all I presume, the both series are stationary. In that case you can find the optimum ARMA model (individually) by using the auto.arima function in R. Or you can write a short program in R, iterating through to AR=12, MA=12 to find the best combination, while minimising the information ratio (either AIC or BIC). Conduct the residual test for serial correlation. ARCH may be present, but that is not an issue (because volatility is not studied here). The residuals should be stationary. Once you test the robustness, you can conduct forecasting on these models. 2. Second, if they are not stationary, but their first difference (both) is stationary, you can conduct cointegration as they are integrated at I(1). Essentially that means that there is a unit root, hence cointegration test can be done. 3. If their first differences are stationary, you can look at Vector Autoregression (VAR) to model the two series to understand causality. 4. If one of the series is stationary whereas the other is not, you can use ARDL . 5. You can use Multivariate GARCH models to study the interdependencies with respect to volatilities. Hope the above helps.

1 次回应

Jay Laramore

Product Marketing Director - FICO Xpress Optimization | Decision Scientist

2 年

Interesting article and approach, thanks for sharing. You might also consider running a cointegration test (e.g., Engle-Granger, Johansen, etc.)

1 次回应

Kanwardeep Singh Gahlot

BITS RMIT Cotutelle Ph.D. Researcher | Ex - Intel Corporation | DTU'23

2 年

Very good to see time series result! You may expand your results by using Singular Spectrum Analysis

1 次回应

查看更多评论

要查看或添加评论，请登录

Gustavo Sánchez Hurtado的更多文章

Training Restricted Coulomb Energy (RCE) classifiers in Python

2023年5月21日

Training Restricted Coulomb Energy (RCE) classifiers in Python

The RCE (Restricted Coulomb Energy) classifiers rely on the identification of nearest training examples, based on the…

3 条评论
How does the Fourier transform look in 2D?

2023年5月6日

How does the Fourier transform look in 2D?

The Fourier transform can be difficult to understand, especially for those who are not familiar with advanced…
Change point detection based on spectral residual and CNNs

2023年4月30日

Change point detection based on spectral residual and CNNs

In some applications we need to identify instants where the statistical properties of a time series (e.g mean…
Anomaly detection using the Minimum Covariance Determinant (MCD) method

2023年4月23日

Anomaly detection using the Minimum Covariance Determinant (MCD) method

Assume we need to detect anomalies in Gaussian-distributed data or at least with an unimodal, symmetric distribution…
Trajectory prediction using Extended Kalman Filter (EKF) training

2023年4月16日

Trajectory prediction using Extended Kalman Filter (EKF) training

Trajectory prediction is one the classic problems in estimation and control theory. In this note we follow the approach…
Time series classification using LibSVM

2023年4月9日

Time series classification using LibSVM

It is possible to use LibSVM for time series classification, based on the raw previous values or on some set of…

1 条评论
How long should be the sliding window for time series classification?

2023年4月1日

How long should be the sliding window for time series classification?

It is well-known that Sliding Window Length (SWL) directly affects classification performance. However, it is difficult…
Time series anomaly detection based on ARMA model in C#

2023年3月18日

Time series anomaly detection based on ARMA model in C#

In some cases, it can be advantageous to use languages such as C++ or C# for numerical computing, as assigning data…

5 条评论
Bode-like plot for NN classifiers

2023年3月11日

Bode-like plot for NN classifiers

Inspired by papers like this one: I decided to run the following experiment: 1)Train an NN (Sklearn - MLPRegressor)…

1 条评论
Handling missing values in time series

2023年2月25日

Handling missing values in time series

In this note, we explore different methods to handle missing values in time series, represented in this example as…

See all articles

Comparing two univariate noisy time series of same length and sampling time

Gustavo Sánchez Hurtado

Award-Winning Engineer, Researcher & Educator | Digital Transformation: Control Systems, IoT, and Machine Learning | PLC/SCADA programmer | Python/MATLAB | Node Red | Global Speaker, Author & Podcaster

Gustavo Sánchez Hurtado的更多文章

社区洞察

其他会员也浏览了

Markov Decision Process - (with a logistics example)

A benchmark for signal classification inspired by the "two moons" problem

Embracing Machine Learning for Predictable Portfolios

Anomaly detection based on linear filtering

?? Day 134 of 365: Introduction to Feature Selection ??

The Stochastic Assumptions made simple

Predicting the hailstone sequence using a Temporal Fusion Transformer (Pytorch)

SVMs versus Logistic Regression

Linear Regression Model Assumption

A benchmark for signal classification - part 2

Gustavo Sánchez Hurtado的更多文章

Training Restricted Coulomb Energy (RCE) classifiers in Python

How does the Fourier transform look in 2D?

Change point detection based on spectral residual and CNNs

Anomaly detection using the Minimum Covariance Determinant (MCD) method

Trajectory prediction using Extended Kalman Filter (EKF) training

Time series classification using LibSVM

How long should be the sliding window for time series classification?

Time series anomaly detection based on ARMA model in C#

Bode-like plot for NN classifiers

Handling missing values in time series

社区洞察

其他会员也浏览了

Markov Decision Process - (with a logistics example)

A benchmark for signal classification inspired by the "two moons" problem

Embracing Machine Learning for Predictable Portfolios

Anomaly detection based on linear filtering

?? Day 134 of 365: Introduction to Feature Selection ??

The Stochastic Assumptions made simple

Predicting the hailstone sequence using a Temporal Fusion Transformer (Pytorch)

SVMs versus Logistic Regression

Linear Regression Model Assumption

A benchmark for signal classification - part 2