登录查看更多内容

The Mirage of Algorithmic Trading: Deciphering Genuine Strategies from Randomness

Ali H. Askar, CQF

Quant

发布日期: 2023年10月4日

With the rise of online resources, there's no shortage of software tools that promise easy algorithmic trading strategy design. Many of these platforms employ advanced techniques like Genetic Algorithms and Data Mining, luring beginners into a deceptive world of seemingly perfect trading strategies.

Imagine a novice, thrilled by their initial dives into such platforms, discovering an equity curve that's almost linear—what they might perceive as the elusive 'holy grail' of trading. But here's the catch: due to inherent data-mining biases, distinguishing genuinely intelligent strategies from random ones becomes a challenging endeavor.

The concept of randomness, especially in the domain of algorithmic trading and data mining, might seem a bit abstract at first. But let's try to decode this using a classic analogy involving our simian friends.

Imagine you've assembled a colossal troop of 1 billion monkeys. Now, hand each of these monkeys a keyboard and allow them to jump and play on it for an hour. With this vast number of trials, it's statistically probable that at least one monkey might inadvertently type out the phrase "to be or not to be."

Now, the essential question arises: Does this isolated occurrence imply that the monkey possesses the literary prowess of Shakespeare? The logical answer is a resounding no. The monkey's typing is purely a product of chance, not intent or intelligence.

This scenario is a vivid representation of the phenomenon known as 'multiple selection bias.' When given a large enough sample size or enough iterations, there's a higher likelihood of random occurrences aligning with what we perceive as significant or meaningful patterns, even if they arise purely from chance.

In the realm of trading and algorithmic strategies, this bias emerges when numerous strategies or models are tested. Among countless variations, one or a few might appear exceptionally successful purely by coincidence, not because they inherently possess superior intelligence or effectiveness. Just as our hypothetical monkey isn't a budding Shakespearean playwright, these seemingly promising strategies might be nothing more than fortunate anomalies in a sea of randomness. Recognizing and accounting for this bias is crucial to avoid being misled by such deceptive appearances.

Let's take a hypothetical scenario where you're using such a tool to devise a trading strategy for BTCUSD. After numerous iterations, both manual and automated, you're presented with a promising equity curve based on roughly 10,000 trades in both in-sample and out-of-sample datasets.

However, this apparent success comes after sifting through myriad combinations of entry and exit strategies—ranging from hundreds to even trillions. While the resultant equity curve might seem ideal, there's an underlying suspicion: Is this strategy a mere product of randomness, exacerbated by the bias from countless comparisons?

For instance, consider a situation where a developer believes that ramping up the number of trades—say to 20,000—diminishes randomness. If both the in-sample and out-of-sample performances appear robust, does it genuinely indicate a lower likelihood of randomness?

Surprisingly, the truth might be starkly different. Both these equity curves could merely be the results of a simple coin toss experiment, with outcomes of +1 for heads and -1 for tails. Even the impressive second curve might be a product of a few random simulations. The lesson here? Sometimes, sheer luck can conjure up compelling equity curves.

while True:
    rand = np.random.choice([-1,1], size=20_000).cumsum()
    if rand[-1] >= 500:
        break
        
plt.figure(figsize=(12,7.5))
plt.plot(rand)
plt.axvline(x=16000, color='red')

The challenge lies in discerning: Can an attractive equity curve, especially when juxtaposed against a sea of mediocre ones, genuinely indicate a smart underlying algorithm, or is it just another random occurrence? Answering this counterintuitive question isn't straightforward.

This coin toss analogy underlines a critical pitfall: the propensity to be misguided by randomness when assessing numerous equity curves. To combat biases stemming from overfitting, data snooping, and selection, one needs a profound understanding that often eludes the average developer. In fact, methods to scrutinize strategies for potential biases become as crucial, if not more, than the techniques to create them. Recognizing these biases and their implications becomes central to developing a genuine trading edge.

Strategies to Discern Genuine Trading Systems from Randomness

When diving deep into trading strategies, the sheer volume of data and parameters can blur the lines between genuine intelligence and randomness. To decipher this, it's crucial to follow some guidelines:

1. Consistency in Equity Curves:

领英推荐

5 Best AI Forex Trading Platforms

Guru99.com 11 个月前

Game Theory and Cooperative Strategies:…

Untrading Technologies Pte Ltd 2 个月前

A Century in Review: The Evolution of Systematic…

OSTC 10 个月前

When faced with multiple equity curves, inquire into the process that generated them. If it’s random and produces different 'best' curves each time, it might just be a fluke. Remember, a market cannot have an infinite number of genuine edges.

2. Test Exit Logic:

Strip the strategy of its exit logic and apply a basic profit target and stop-loss. If the system flounders, its entry-timing intelligence is probably lacking. Overly complex exit strategies, like trailing stops, might just be overfitting past data, making them vulnerable to future market changes.

3. Beware of Over-Optimized Indicators:

A strategy that leans heavily on indicators with meticulously tuned parameters to achieve a desired equity curve might be a red flag. The more parameters it relies on, the more it's susceptible to data-mining bias, potentially making the strategy nothing more than randomness masquerading as intelligence.

4. Single vs. Multiple Runs:

If the software employed spits out the final result after several runs on the in-sample data, be cautious. Using in-sample data repetitively with various indicator and rule combinations can lead to data-mining bias. The strategy may look good on paper, but it might just be good luck.

5. Avoid Data-Snooping:

If after an out-of-sample test you find yourself adjusting strategy parameters, you're introducing potential overfitting, selection bias, and data-snooping bias. If this out-of-sample data is then used for further tweaking, it essentially becomes in-sample data, rendering any further out-of-sample tests moot.

6. System Parameter Permutation (SPP) Check:

Always verify if the median return of the System Parameter Permutation is positive or negative. This can be an additional layer of confirmation regarding the strategy's viability.

7. Noise Testing:

When validating your strategy, it's pivotal to test its resilience against noise. By deliberately introducing noise into your original OHLC (Open, High, Low, Close) data and observing the subsequent return differences, you can gauge the system's stability. A significant deviation between the noisy data returns and the original equity returns might indicate an underlying data-mining bias. Essentially, a robust system should be relatively immune to minor data perturbations.

8. Variance in Trade Returns:

For strategies that employ a fixed take-profit/stop-loss mechanism, the returns distribution tends to be binary, straying from the norm. To assess the strategy's adaptability, introduce a variance to the final trades' returns—center the distribution around the average trade, or if you are using entry/exit conditions you might end up using a normal distribution and injecting, for instance, a 5% variance. On resampling, a healthy strategy should depict an upward-trending 'spaghetti' chart. If not, the strategy's future may be less promising than presumed.

George O'krepkie

CEO with expertise in Corporate Advisory and Sales Leadership at Andes Capital Group

1 年

TY Ali ...

3 次回应

Vincenzo Riflesso

Head of Risk - CQF

1 年

Well done.

3 次回应

Stuart Fotheringham

1 年

Nice article Ali. Good summary of the beginner's dilemma which arises when distinguishing between genuinely intelligent strategies and mere random outcomes, exacerbated by data mining biases. Could you elaborate on the types of data mining biases that are most prevalent in algorithmic trading and how they can be mitigated to improve the accuracy and reliability of trading strategies? You've discussed the concept of a meta-strategy in your previous posts. How do you think meta-strategies can help in mitigating the challenges posed by data mining biases in algorithmic trading?

4 次回应

查看更多评论

要查看或添加评论，请登录

Ali H. Askar, CQF的更多文章

Fading Fortunes: The Transient Nature of Data-Driven Trading Systems

2024年5月13日

Fading Fortunes: The Transient Nature of Data-Driven Trading Systems

In subjective quantitative trading, our task is to discover alpha—patterns that can predict future market behavior…

4 条评论
Transitioning to Bitcoin to Stop Wars and Foster Prosperity

2023年10月23日

Transitioning to Bitcoin to Stop Wars and Foster Prosperity

In 1921, Henry Ford, the visionary founder of Ford Motor Company, penned an article for the New York Tribune that has…

2 条评论
Variance Testing: an easy way to detect Data Mining Bias

2023年10月5日

Variance Testing: an easy way to detect Data Mining Bias

In the intricate world of algorithmic trading, strategies come aplenty, and the possibility of landing upon a 'golden'…

1 条评论
Market Data is Social Data from Fractals to Topological Data Analysis

2022年8月23日

Market Data is Social Data from Fractals to Topological Data Analysis

A financial time series can be described as the practice of asset valuation over time. In other words, it is using the…

1 条评论
Is the 37 billion dollars Terra-Luna crash the crypto equivalent of Lehman Brothers?

2022年5月12日

Is the 37 billion dollars Terra-Luna crash the crypto equivalent of Lehman Brothers?

I'd want to try to describe what happened with Terra—Luna at a high level, and detailed knowledge of the circumstances…

3 条评论
Is money real?

2022年2月23日

Is money real?

In the first article of this series "Understanding the Infinity Game is the first step in believing that the Bitcoin…

1 条评论
Understanding the Infinity Game is the first step in believing that the Bitcoin Network will never fail.

2022年2月16日

Understanding the Infinity Game is the first step in believing that the Bitcoin Network will never fail.

In the early 1900s, a team of well-funded specialists prepared to address one of humanity's most severe concerns…

3 条评论
The Carbon Footprint of Paper Money and Central Banking

2021年5月14日

The Carbon Footprint of Paper Money and Central Banking

Every media outlet and environmentalist are trying to find the carbon footprint of Bitcoin, but what about the Carbon…
Answering the Question: What is Bitcoin from a Digital Sovereign Individual perspective

2021年1月14日

Answering the Question: What is Bitcoin from a Digital Sovereign Individual perspective

Consistently I hear this inquiry, What is Bitcoin? What is this revolution and what is changing under us? I'll clarify…

1 条评论
Current AI and the principal establishment of the scientific method

2020年9月12日

Current AI and the principal establishment of the scientific method

The times of conventional, human-driven problem solving — building up a hypothesis, revealing standards, and testing…

2 条评论

See all articles

The Mirage of Algorithmic Trading: Deciphering Genuine Strategies from Randomness

Ali H. Askar, CQF

Quant

领英推荐

Ali H. Askar, CQF的更多文章

社区洞察

其他会员也浏览了

Testing ARMMs in Production

ALGORITHMIC TRADING MARKET SIZE & SHARE ANALYSIS - GROWTH TRENDS & FORECASTS (2023 - 2028)

Understanding Algorithmic Trading and the Critical Role of Latency

Algorithmic Trading: Currency and Gold Strategies

Exploring the Benefits of Algorithmic Trading

Is Algorithmic Trading the answer you're looking for?

The power of the algorithms

Learn How to Identify Algorithmic Trading Strategies

Algorithmic Trading Market to Reach USD 44.4 Billion by 2032 with a CAGR of 11.9%

The Fundamentals of Algorithmic Trading: A Beginner's Guide

领英推荐

Ali H. Askar, CQF的更多文章

Fading Fortunes: The Transient Nature of Data-Driven Trading Systems

Transitioning to Bitcoin to Stop Wars and Foster Prosperity

Variance Testing: an easy way to detect Data Mining Bias

Market Data is Social Data from Fractals to Topological Data Analysis

Is the 37 billion dollars Terra-Luna crash the crypto equivalent of Lehman Brothers?

Is money real?

Understanding the Infinity Game is the first step in believing that the Bitcoin Network will never fail.

The Carbon Footprint of Paper Money and Central Banking

Answering the Question: What is Bitcoin from a Digital Sovereign Individual perspective

Current AI and the principal establishment of the scientific method

社区洞察

其他会员也浏览了

Testing ARMMs in Production

ALGORITHMIC TRADING MARKET SIZE & SHARE ANALYSIS - GROWTH TRENDS & FORECASTS (2023 - 2028)

Understanding Algorithmic Trading and the Critical Role of Latency

Algorithmic Trading: Currency and Gold Strategies

Exploring the Benefits of Algorithmic Trading

Is Algorithmic Trading the answer you're looking for?

The power of the algorithms

Learn How to Identify Algorithmic Trading Strategies

Algorithmic Trading Market to Reach USD 44.4 Billion by 2032 with a CAGR of 11.9%

The Fundamentals of Algorithmic Trading: A Beginner's Guide