登录查看更多内容

What’s a Control Group & Why Do I Need One?

Momchil Kyurkchiev ?? GDC

Chief Strategy Officer at CleverTap / Co-founder of Leanplum (acquired by CleverTap)

发布日期: 2016年11月7日

Control groups are at the heart of mobile A/B testing, but their importance is often understated. Failure to use control groups correctly could cost you revenue.

When you set up an A/B test, usually you’re trying to answer a question: “should I do A or B?”

Let’s say you want to increase user retention by sending a push notification to people who have been dormant for a few days. You’re considering personalizing the message by reminding the person of how long it’s been since they signed in. Your hypothesis is that personalizing will increase open rates, but you want to test it against the less-personalized message?—?to confirm your theory will have the intended effect.

But there’s always a third option. The question isn’t “should I do A or B?”?—?it’s “should I do A, B, or nothing at all?”

That’s where control groups come in. Doing nothing is a decision in itself?—?and sometimes it’s the right decision. The only way to be sure that your changes aren’t doing more harm than good is to test everything against a control group. Let’s go over a few examples to see how control groups can be used for more accurate A/B testing.

Control Groups in an A/B Test

To begin, let’s take a look at Leanplum’s A/B testing screen.

A control group and a variant in Leanplum’s A/B testing dashboard.

As you can see, the first group is called “Control” by default, and its variables are grayed out. You can tweak the first variant as much as you’d like, but the control group’s variables are pre-set based on your app’s code. It represents the current version of your app.

Now, if you’re in a rush to send a message and you want to test two versions of it, it’s tempting to work around this restriction. You could always adjust your app variables so that your control group represents one version of the proposed message, and your first variant represents the other version. But the results of that test would be unreliable.

It might be slower, but control groups are the only way to guarantee that you’re not doing more harm than good. For an A/B test, that means you can only test one potential change at a time, because the A will be the control group.

Control Groups in a Multivariate Test

What about multivariate tests? If you’re looking to cover more ground with each test, multivariate tests (MVTs) aren’t a bad option. Here’s what a MVT would look like in Leanplum:

In addition to the control group, which in this example won’t receive a message, we’re now testing three different versions of the same message.

This can feel like a best-of-both-worlds scenario. You’re maintaining your data’s accuracy by including a control group, but you’re also working quickly by testing several variants at once.

Theoretically, multivariate tests allow for faster iterations. But note the difference under the “Time” heading. While this MVT is expected to take eight days to reach statistical significance, the A/B test up above is only expected to take four days! What does this mean?

An experiment isn’t deemed complete until it identifies a given degree of change?—?say, five percent?—?with a given degree of certainty. A smaller change requires a smaller margin of error to detect, so the test will take longer to complete.

The longer the experiment runs, the more samples it collects. To pick an extreme example, what would happen if our push notification were only sent to four users?

If two recipients clicked on variant A while only one recipient clicked on variant B, the results would show that the first version had a 2x higher open rate.

Of course, this statistic is misleading?—?the sample size is so small that the difference in open rates may have been due to random chance. We would need to collect more samples before comparing the two open rates.

Leanplum’s time estimator calculates roughly how long an A/B test will take to complete.

Furthermore, the degree of change affects the completion time.

If a metric grows from 20 percent to 25 percent, both the control and the variant are allowed just under a 2.5 percent margin of error. This means that even if the actual results were off by a couple of percentage points, the two ranges would not overlap, so the difference was not solely the result of random change.

Whereas a change from 20 percent to 21 percent would demand a less than0.5 percent margin of error, so more samples would be required.

In the following example from the Leanplum dashboard, the margin of error is represented by the shaded area around each line. Since the ranges rarely touch, we can conclude that, on average, the “New Message” variant offers a statistically significant improvement over the control group.

The more variants you add to an experiment, the longer it will take to complete. There’s a time and a place for multivariate tests, but it is incorrect to assume that fitting more variants into a single test is inherently faster than testing one variant at a time.

For tests on small segments, you might not have enough users to feasibly perform a MVT. In these situations, an A/B test will provide quicker and more stable results.

The Only Times Control Groups Are a Bad Idea

Control groups are a bad idea when the content is urgent. Statistical accuracy is important, but what if there’s an important app or account update that users need to know about? Using a control group means that some users won’t receive this urgent update until later.

In situations where you have to send a message, such as a terms of service update, the third option of not doing anything isn’t really an option. Therefore, there’s no use in testing message copy against a control group?—?you’ll have to send the message anyway, even if the control group wins.

Overview of the A/B test interface.

The only thing worse than not A/B testing at all is drawing the wrong conclusions from your tests. With this information in mind, you can skip the beginner’s mistake of ignoring control groups.

Of course, there’s more to A/B testing and multivariate testing than control groups. For a more holistic view, you can read up on Leanplum’s mobile A/B testing feature, or contact us for a full demo.

—

Leanplum is the most complete mobile marketing platform, designed for intelligent action. Our integrated solution delivers meaningful engagement across messaging and the in-app experience. We work with top brands such as Expedia, Tesco, and Lyft. Schedule your personalized demo here.

要查看或添加评论，请登录

Momchil Kyurkchiev ?? GDC的更多文章

AI in Gaming – the Future of Player Engagement?

2024年3月28日

AI in Gaming – the Future of Player Engagement?

These days, you can't walk down the street without hearing about AI. It’s the latest buzzword but for good reason.

1 条评论
Three Years Later: The Mobile Gaming Industry Post-ATT

2024年3月11日

Three Years Later: The Mobile Gaming Industry Post-ATT

With the third release anniversary of Apple's App Tracking Transparency (ATT) coming up in April, it seems fitting to…

3 条评论
[TEDxVitosha Recap] A Corporate Culture of Happiness

2018年4月18日

[TEDxVitosha Recap] A Corporate Culture of Happiness

Back in January, I fulfilled a lifelong dream and delivered my first TED talk at TEDxVitosha in Sofia, Bulgaria. My…

3 条评论
Learning From The Past: Personalization In A Digital World

2018年4月3日

Learning From The Past: Personalization In A Digital World

Marketing has always been about connecting the best product or service to the right person. Once upon a time — before…
Here Are the Biggest Black Friday Learnings?—?And Strategies to Make More $$$

2017年11月27日

Here Are the Biggest Black Friday Learnings?—?And Strategies to Make More $$$

This year’s Black Friday shattered all the records. But nothing made a bigger wave than mobile, where almost two-thirds…
Series D, Growth & the Next-Gen Marketing Cloud

2017年11月8日

Series D, Growth & the Next-Gen Marketing Cloud

Today, Leanplum is absolutely thrilled to announce $47 million in Series D funding led by our new investor, Norwest…

11 条评论
3 Facts About Mobile App Revenue You Need to Know

2017年8月28日

3 Facts About Mobile App Revenue You Need to Know

Did you know that on average, 90 percent of mobile shopping carts are abandoned? For retailers around the world, the…
5 Ways to Leverage In-App Messages For Your Mobile UX

2017年6月27日

5 Ways to Leverage In-App Messages For Your Mobile UX

There’s little doubt that in-app messages add value for mobile marketers. With consumers spending 85 percent of their…

1 条评论
App Editor: The Code-Free Solution to Optimize Mobile Apps

2017年6月14日

App Editor: The Code-Free Solution to Optimize Mobile Apps

In mobile app businesses, multiple teams touch the core product. Marketing, design, product, and engineering all have a…

1 条评论
5 Mobile App Strategies That Still Work in 2017

2017年5月9日

5 Mobile App Strategies That Still Work in 2017

The mobile app strategies that work in 2017 are not the ones that dominated in years past. Mobile devices have already…

1 条评论

See all articles

What’s a Control Group & Why Do I Need One?

Momchil Kyurkchiev ?? GDC

Chief Strategy Officer at CleverTap / Co-founder of Leanplum (acquired by CleverTap)

Control Groups in an A/B Test

Control Groups in a Multivariate Test

The Only Times Control Groups Are a Bad Idea

Momchil Kyurkchiev ?? GDC的更多文章

社区洞察

其他会员也浏览了

16 Major Challanges Faced By Testers While Testing a Web Application

27 Ways for ROI Using Mobile Application Test Automation, Part 2

Why am I being asked to pay for development work to keep A/B testing?

Testing Mobile Applications

Mobile Application Testing

Testing scenarios of Admin Panel syncing with an app

Best Practices In Mobile Application Testing

7 Criteria for Selecting Mobile Application Testing Tools For Your Business

Debugging Google Tag Manager on mobile, oh my!

Control Groups in an A/B Test

Control Groups in a Multivariate Test

The Only Times Control Groups Are a Bad Idea

Momchil Kyurkchiev ?? GDC的更多文章

AI in Gaming – the Future of Player Engagement?

Three Years Later: The Mobile Gaming Industry Post-ATT

[TEDxVitosha Recap] A Corporate Culture of Happiness

Learning From The Past: Personalization In A Digital World

Here Are the Biggest Black Friday Learnings?—?And Strategies to Make More $$$

Series D, Growth & the Next-Gen Marketing Cloud

3 Facts About Mobile App Revenue You Need to Know

5 Ways to Leverage In-App Messages For Your Mobile UX

App Editor: The Code-Free Solution to Optimize Mobile Apps

5 Mobile App Strategies That Still Work in 2017

社区洞察

其他会员也浏览了

16 Major Challanges Faced By Testers While Testing a Web Application

27 Ways for ROI Using Mobile Application Test Automation, Part 2

Why am I being asked to pay for development work to keep A/B testing?

Testing Mobile Applications

Mobile Application Testing

Testing scenarios of Admin Panel syncing with an app

Best Practices In Mobile Application Testing

7 Criteria for Selecting Mobile Application Testing Tools For Your Business

Debugging Google Tag Manager on mobile, oh my!