登录查看更多内容

Stat-sig on a Shoestring Traffic Budget

Alexey Komissarouk

Growth Engineering Advisor

发布日期: 2023年7月18日

Status: Early Draft

There comes a time in every company’s life when the CEO says “it’s growth time.”?Now, you’re getting ready to run lots of experiments, until you realize “actually, we don’t quite have the numbers.”

“Given our traffic, what kind of a win could we reasonably detect within a couple of weeks?”

No alt text provided for this image — https://cxl.com/ab-test-calculator/

20% minimum??That is a crazy high win.?Even 10% would be a huge winner.?Finding a 20% win is very rare.

So, what do you do - do you give up??Probably not. Instead, you find ways to run the experiments you’re planning for anyway, but with an update in methodology.?

Here’s how.

Part 1: Squint Hard Enough: If you’re only missing your stat-sig target by a bit

Take bigger swings & batch by theme

Say you have a hypothesis that positioning your product more like X is likely to help with conversion.?You come up with 4 potential changes to experiment with & see if they help.

Great.?Now combine them into one mega-experiment & run that instead.?

If you can only detect a 10% win, only take 10% swings.?When prioritizing your experiment backlog, de-prioritize (or batch) ideas that wouldn’t have a big impact if they were successful.

However, when productionizing a batched win, it’s harder to tell which part of the change was actually effective. In the near term, this is tolerable - when the company gets to a larger scale and you revisit this theme, you can always tease out the underlying causes through follow-up experiments.

Use Fewer Variants: A/B/C/D become A/B

You need a certain number of visitors to both your “winning variant” and “control” to determine the result. Every variant you add reduces the oxygen flow to your winning variant.

If you’re tight on traffic, stick to two variants.?

领英推荐

Mastering GA4 with this Comprehensive Checklist

Sumeet Gupta 1 年前

Techmeme Traffic Trifecta

Jay Cuthrell 1 年前

Google to Rollout Auto-Suggest Answers for Google My…

Marco Reuter 5 年前

However, this might slow down your pace of learning, until you realize you can also:

Run in Parallel: A/B/C/D becomes A/B, A/C, and A/D

If you can tease out the assumption between every variant, so long as the variants are independent, you can run multiple A/B tests at once, each with two variants.?This approach won’t let you try 4 versions of a hero header, but you could factor out your header, subheader, and call-to-action change each into their separate experiment.?This way, every hypothesis gets 50% traffic, enough to see how ti performs relative to control.

However, you might run into experiment interactions (IE, C only wins when also B).?This happens from time to time; the easiest solution is to analyze the experiments together and identify interactions at that point.?After all, there’s nothing wrong with shipping both B and C.

Run tests for longer: 7 days becomes 2-3 weeks

This is perhaps the most obvious strategy; if you have more traffic, you get a larger sample size, which means you can detect a smaller win with stat-sig.

However, especially at earlier stage companies, I rarely see the discipline to actually leave an experiment running longer than a couple of weeks.?Inevitably, an executive will pop in and demand an update. “Trending positive?” they’ll say, “Great, let’s just ship it, it’s fine.”?And it is fine, except now you’ve committed the cardinal sin of peeking, your likelihood of a false positive has gone up, and you have not truly “learned” anything with confidence.

Aside: If you often find yourself shipping tests earlier than planned, consider switching to a non-frequentist (fixed-sample) methodology, such as Bayesian or Sequential statistics.

Get Comfortable with False Positives: p<0.05 becomes p<0.2

Another tolerable trade-off: in a world where a false positive is harmless, get comfortable setting your Type 1 Error tolerance (IE, how often can I live with a false positive) from its traditional a=0.05 (a 5% false positive) as high as a=.2 (a 20% false positive rate)

However, don’t use this approach for any significant changes, such as a new pricing strategy. Also, realize that your ability to trust your learnings from experiments is decreased, since there’s a greater chance your “insights” are now coming from randomness and not reality.

All of the above work so long as your required traffic is close to what you need. Other times, your traffic is off by an order of magnitude. What desperate measures do you take during desperate times?

Part 2: Desperate Measures is coming soon. Sign up at https://tinyletter.com/engineering-growth to be notified.

Samantha August Allen

VC content studio owner | angel investor | former founder

1 年

This is really great

1 次回应

Taylor Adams

1 年

Consolidate potential surfaces/ pages! Buy traffic!

1 次回应

查看更多评论

要查看或添加评论，请登录

Alexey Komissarouk的更多文章

A Growth Engineer is a Rare Creature

2024年4月16日

A Growth Engineer is a Rare Creature

Growth Engineer [grohth en-juh-neer], noun A Software Engineer charged with optimizing a business metric (conversion…
How do we Estimate the Impact for a Growth Engineering team?

2024年4月12日

How do we Estimate the Impact for a Growth Engineering team?

Every quarter comes a new OKR cycle, where PMs and EMs join battle for headcount and make the case that their team is…

1 条评论
The 30/90 Principle

2024年4月9日

The 30/90 Principle

The 30/90 Principle Why is Growth Engineering so different than Product Engineering? Code that Product Engineers ship…
Growth Engineering Teams' Heirarchy of Needs

2023年6月19日

Growth Engineering Teams' Heirarchy of Needs

Status: early draft, sharing preview on LinkedIn. Feedback always welcome! You can some up most advice about Growth…

4 条评论
Why Core Product Engineers can't Hack it on Growth

2023年5月3日

Why Core Product Engineers can't Hack it on Growth

“You're doing great so far!" The engineer beamed - coming from a manager, this was good news. "For next quarter, I’m…

7 条评论
Unshackling Marketing from Engineering Bottlenecks: A Primer

2023年3月14日

Unshackling Marketing from Engineering Bottlenecks: A Primer

"We've had these new landing pages mocked up for the last two months! All of our research says the new pages will be a…

2 条评论
Introducing: A Retirement Program for Technical Co-Founders

2023年3月9日

Introducing: A Retirement Program for Technical Co-Founders

Exhausted from running your startup? Burnt out, losing friends and hair, gaining weight and wrinkles? Now, there’s a…

3 条评论
Avoid Premature Optimization: Growth Advice for Early Stage Founders

2023年3月8日

Avoid Premature Optimization: Growth Advice for Early Stage Founders

Early-stage founders often ask what I could do for them, and how they might go about spinning up a growth team. And the…

1 条评论
The Alexey Test: 11 steps to better Growth Engineering

2023年3月7日

The Alexey Test: 11 steps to better Growth Engineering

Inspired by the Joel Test. Growth Engineering is a growing profession these days.
There's no such thing as Organic Traffic

2023年3月6日

There's no such thing as Organic Traffic

From a recent investor update: ..

3 条评论

See all articles

Stat-sig on a Shoestring Traffic Budget

Alexey Komissarouk

Growth Engineering Advisor

Part 1: Squint Hard Enough: If you’re only missing your stat-sig target by a bit

Take bigger swings & batch by theme

Use Fewer Variants: A/B/C/D become A/B

领英推荐

Run in Parallel: A/B/C/D becomes A/B, A/C, and A/D

Run tests for longer: 7 days becomes 2-3 weeks

Get Comfortable with False Positives: p<0.05 becomes p<0.2

Alexey Komissarouk的更多文章

社区洞察

其他会员也浏览了

The Question and Answer Economy : How Customer Questions Drive Revenue Growth

How Gradient Metrics helped understand what users were doing during the COVID pandemic

When People Say They’ve ‘Done Their Research’ You Can Bet They Haven’t

"How Do You Calculate TAM?" asks Sam-I-Am.

"Slide" for the neurons Review

2024 - The Year of Peak Performance Supply Chains

A/B tests on LTV

Reduce bids or pause completely?

VSO and LSO: Two very important tools for any business

?? Why upgrading to Google Analytics 4, before July 1, 2023 is important! ??

Part 1: Squint Hard Enough: If you’re only missing your stat-sig target by a bit

Take bigger swings & batch by theme

Use Fewer Variants: A/B/C/D become A/B

领英推荐

Run in Parallel: A/B/C/D becomes A/B, A/C, and A/D

Run tests for longer: 7 days becomes 2-3 weeks

Get Comfortable with False Positives: p<0.05 becomes p<0.2

Alexey Komissarouk的更多文章

A Growth Engineer is a Rare Creature

How do we Estimate the Impact for a Growth Engineering team?

The 30/90 Principle

Growth Engineering Teams' Heirarchy of Needs

Why Core Product Engineers can't Hack it on Growth

Unshackling Marketing from Engineering Bottlenecks: A Primer

Introducing: A Retirement Program for Technical Co-Founders

Avoid Premature Optimization: Growth Advice for Early Stage Founders

The Alexey Test: 11 steps to better Growth Engineering

There's no such thing as Organic Traffic

社区洞察

其他会员也浏览了

The Question and Answer Economy : How Customer Questions Drive Revenue Growth

How Gradient Metrics helped understand what users were doing during the COVID pandemic

When People Say They’ve ‘Done Their Research’ You Can Bet They Haven’t

"How Do You Calculate TAM?" asks Sam-I-Am.

"Slide" for the neurons Review

2024 - The Year of Peak Performance Supply Chains

A/B tests on LTV

Reduce bids or pause completely?

VSO and LSO: Two very important tools for any business

?? Why upgrading to Google Analytics 4, before July 1, 2023 is important! ??