Exploring Returns: Beyond Normal with Generalized Error Distribution
Jakub Polec
20+ yrs in Tech & Finance & Quant | ex-Microsoft/Oracle/CERN | IT / Cloud Architecture Leader | AI/ML Data Scientist | SaaS & Fintech
Recently, I engaged with an intriguing article by Graham Giller, which critically examines a prevalent assumption in financial analysis: the 'null hypothesis' that daily returns of major market indices, such as S&P500, conform to a normal distribution. This exploration led me to reconsider and explore various fundamental concepts in financial statistics, particularly the potential of alternative distributions like the generalized error distribution in describing market price changes
In both theory and practice, the rate of return is a pivotal concept in finance, playing a crucial role in investment decision-making. Typically, investors' decisions to buy or sell securities are based on the probability of achieving a certain profit level or incurring a loss. In practice, it's common to assume that rates of return follow a normal distribution. This assumption is not merely theoretical; it's embedded in various financial models and decision-support tools, such as the Black-Scholes stock option valuation method, the Capital Asset Pricing Model (CAPM), and certain Value at Risk (VaR) methodologies.
What's fascinating is that moving away from this assumption puts into question the reliance on two sigma as a significant deviation from the mean, as well as the rationality of the commonly used 95% confidence level criteria in financial analysis.
My Analysis Using the Generalized Error Distribution (GED):
In my analysis, following Graham Giller's article, I decided to fit the index and stocks data to the Generalized Error Distribution (GED), which is a statistical distribution that generalizes the normal (Gaussian) distribution.
The GED offers a more flexible approach for modeling financial returns, particularly for data that exhibit 'fat tails' - a common characteristic in financial markets.
Why use GED for Financial Returns?
Here, the parameters s, ν, and λ are defined as follows:
This flexibility is crucial for capturing the fat tails and varying levels of kurtosis observed in real-world financial data. For risk management, accurately modeling the tails of the distribution is vital, and the GED's ability to capture tail risk more effectively than the normal distribution is particularly valuable.
The GED application:
The easiest way to utilize the GED is through the scipy.stats package and its gennorm function, which includes just one more parameter than the Normal distribution (be careful with a slightly different naming).
领英推荐
Testing for Normality:
To investigate the normality of rates of return, I set up two hypotheses: the null hypothesis (conformity with a normal distribution) and the alternative hypothesis (non-conformity). To perform parametric tests for normality on stock data, you can use either the Wald test or other tests such as Shapiro-Wilk, Anderson-Darling, or Kolmogorov-Smirnov. These tests have different assumptions and methodologies. The K-S and A-D tests are used to test the goodness of fit of a distribution (e.g., testing for normality), while the Wald test is used for hypothesis testing in statistical models.
The Wald Test:
The Wald test relies on the asymptotic properties of maximum likelihood estimators. It tests whether a parameter of the population distribution is equal to a specified value. The Wald Test Statistic is a measure of how much the estimated parameter (like the mean) of your data deviates from a hypothesized value under the null hypothesis (typically the population parameter).?
A low Wald test P-Value typically indicates strong evidence against the null hypothesis. In the context of normality testing, it suggests that the data does not follow a normal distribution. A p-value close to 0 (usually below a threshold like 0.05) suggests that the observed data significantly deviate from the hypothesized distribution—in this case, a normal distribution.
The Kolmogorov-Smirnov (K-S) test:
The Kolmogorov-Smirnov (K-S) test is a non-parametric test used to determine if a sample comes from a specific distribution. In the context of testing for normality in financial returns, the K-S test can be applied to assess whether the observed data deviates significantly from a normal distribution. The test statistics measures the maximum difference between the empirical distribution function of the data and the cumulative distribution function of a normal distribution. A higher value of the test statistic indicates a greater deviation from normality. Similarly, a low p-value resulting from the K-S test indicates a rejection of the null hypothesis of normality, providing evidence that the observed data significantly deviates from a normal distribution.
The results from the analysis:
Here are the plots of S&P500 (^GSPC) and some growth stocks:
From the plots, one can see that the results seem quite convincing. Rejecting the null hypothesis is not enough; as usual, we need a suitable alternative. The GED, while not perfect, provided a much better fit for describing the data, especially during periods of market stress or volatility.
It's important to remember that it's not enough to reject a Null Hypothesis; we also have to have something to replace it with. The GED, while not perfect, clearly does a much better job of describing the data.
Therefore, it's crucial to consider significant changes in the values of the parameters describing the GED, depending on the period under analysis and its length. The analysis of changes in developing parameters of GED may be used as an element of technical analysis, within the scope of predicting changes in stock market cycles.
Observations in Market Trends:
I have read other scientific papers and observed the GED applicability for different market regimes, noting discrepancies. During bear market periods, the level of the p-value was significantly higher than in bull market periods. This observation could be crucial for future market predictions and risk assessments.
The learning and future use:
Join?quantjourney.substack.com?for more.
La Sorbonne
1 年Returns during the day go up and down. If so, they follow cycles, circles and PI. It also means the market is not random nor complex, just orderly while following PI and there is no binary event. That comes from a Penn Station Homeless linked to 3B. More will come later.
Econometrician (The original data science)
1 年José Donizetti de Lima