登录查看更多内容

Explained: Hypothesis - Testing to a marketier (Inferential statistics- part II)

Chandralekha Ghosh

General Manager Accountability at Omnicom Media Group | Cross-Media Measurement & Audit | Proficient in Data Analytics & Statistical Modeling | Passionate About Reducing Media Waste and Enhancing Client Satisfaction

发布日期: 2023年4月2日

A new financial year has started. It's time we look at a hypothetical scenario.

Suppose, as a digital marketer, generating quality leads is one of your media KPIs and Cost Per Quality Leads (CPQL) is the metric to measure your success. You're responsible for generating a healthy CPQL (say below the industry average).

So here you are, presenting the CPQL reports for each subset of the brands to your client. You have done a great job, the CPQLs for each sub-brands have gone down by 15%-20% compared to last year. You can imagine, you're on cloud nine and proud of your achievement. Your client is really happy with the performance. So it's a happy ending then?

Nope, a bit is still left. The client has a third-party auditor who'd validate these numbers.?They'll compare your CPQLs against the market average. They're using a pool of prices paid to generate similar quality leads for selected brands that have similar features to your brands. To save time and money, they calculated the market average on the sample data. However, there's nothing to worry about, they ensured that all the data points are independent and the sample size is large enough to apply CLT.

The auditors too are happy with your performance, as for most of the sub-brands, your CPQL is lower than the market average. However, the results differ for one sub-brand, where you paid a higher price to generate a quality lead compared to the market average. This is for the most expensive product which was launched last year and is under the scanner of your client. The imaginary numbers are:

Your CPQL for sub-brand-X- INR 1500/-

Pool Market Average CPQL for similar brands to brand X - INR 1450/-

So, you delivered costlier CPQL, INR 50 more than your competitors to generate a single quality lead.

The client was ok with the auditor's results, as this was just only for one brand and overall you did a great job. However, you're not happy and with your statistical knowledge, you decided to challenge the status quo. You decided to perform a test.?

So you have a question of interest. Is the market average CPQL of all similar brands less than 1500/-? So we're trying to figure out if the market average CPQL drawn from the entire population (population mean) would be > = or < the CPQL you achieved (1500/-).?

Let's denote the market average CPQL for all similar brands i.e., our parameter of interest by μ.

Let's figure this out Step-by-Step.

Step 1- Forming Null and Alternative Hypothesis

Our Null Hypothesis will be if the market average CPQL drawn from the population data is > or = 1500/-. In other words, our null hypothesis is if we're delivering equally or even better CPQL than the market average. So this is a one-tail test.

H0 : μ >=1500

Here goes the Alternative:

H1 : μ < 1500

Alternative states that we're less efficient than the market average, as the market average CPQL< our CPQL (1500/-)

Failing to reject the null theory would mean that the results are in our favor, that the market average CPQL is NOT lower than our CPQL.

Before we look at the data we need to make sure we establish how much evidence we require in favor of the null theory. It's time to set the significance level for the test. The standard significance level is 5% and your client agrees to it.

What is the 5% significance level??This means under the null, the results will be so unusual that we would see those results or more extreme, no more than 5% of the time. In other words, the risk level can be tolerated in rejecting the null when it is in fact true.

Step 2- Checking Assumptions, Summarizing Data with Test Statistics?

Here you have a problem. You don't have the access to the sample data points that the auditor used. However, they assured that the samples are random and independent. Also, you're happy to know that the sample size is fairly large, thanks to the Central Limit Theorem, the normality condition is met.

You do know the sample mean CPQL as well, 1450/-. This is also called the Best Estimate. It's lower than your CPQL by 50/-. But is that significantly Low? Because, you know, since this is the sample mean, the numbers might vary with a different set of samples. So is this difference of INR 50 significant? Or is the difference just because of the variability in the sample mean?

Ok, it's all boiled down to Variability or Spread or Standard Deviation. Now, you don't have the Standard Deviation for the population data. But you can ask for the Standard Deviation of the sample data from the auditor.?Say it's 180. Now you can calculate Standard Error(SE).

Sample Size=25

Estimated Sample Mean=1450

领英推荐

Follow the Data, Not the Crowds: 3 Common…

Spiralyze 2 年前

The Blind Leading the Bland - Bad consumer research in…

Harinder Singh Pelia 3 年前

Revolutionize BFSI Marketing Measurement with AI: Get…

Data POEM 11 个月前

Sample Standard Deviation=180

SE= Sample Standard Deviation/SQRT(Sample Size)=36

Let's calculate the Test Statistics(T):

T=(Best Estimate-Hypothesize Mean) / Standard Error

T=--1.39

What does it mean? It says how our sample means compared to our hypothesized mean in terms of the estimated standard error. Our Sample Mean is lower than 1500/- but it's only -1.39 Standard Error away. Is that distance significant? To decide, let's convert the Test Statistics into Probability values.

Step 3- How unusual are the results? Determine p-value

What is the p-value? The probability of seeing a test statistic like our result, -1.39 or something more extreme, assuming the null hypothesis is true. How likely is it to get -1.39 or more extreme values? Well, to calculate that we need to know which distribution our test statistic follows. We don't know the population variance, hence we're following t-distribution.

Ok. We calculated the p-value with the help of Python Scipy and it's 0.09. This means, there's a 9% probability of obtaining test statistics equal to or more extreme than our result under the null theory. So, it's not that unusual under the null theory, but quite likely.?

Step 4- Making Decisions with Sufficient Statistical Evidence

Now let's look at the fun rules of p-value to make a decision:

- If p-value > significance level, we don't have enough evidence to reject the null.?

- If p-value < significance level, we will reject the null

This table above (source:study.com) represents each of the three possible null & alternative hypotheses that can be tested for in an independent T-test along with the rejection regions.

In our case, since the p-value is higher than the significance level (0.09>0.05), hence we fail to reject the null hypothesis.?

BINGO! There's insufficient evidence to conclude in favor of the alternative hypothesis, i.e., the market average CPQL is lower than the CPQL we achieved, 1500/-. In other words, we fail to reject the null theory that states the market average CPQL = or > the CPQL you achieved (1500/-).

Finally, you can construct a confidence interval at a 5% significance level to further support your statement.

One thing to note, the decision might vary with changes in the significance level. If we increase our level of significance from 5% to 10%, we'll conclude in favor of the alternative hypothesis as then the p-value(0.9) < significance level(0.10). However, since you both agreed to a 5% significance level, hence you're going to stick with your results at a 5% significance level.?

Almost felt like a lawyer fighting her case in the high court? ??

Where would you use hypothesis testing? to get actionable insights from:

-?The change in the CTA button or ad copy

-?Revamping pages like Product/Cart/Checkout, etc,?of your e-commerce site

-?Email vs In-App notification??

?- and so on..............Well in this world of A/B testing, almost everywhere!

#data #dataanalytics #statistics #python #digital #abtesting

要查看或添加评论，请登录

Chandralekha Ghosh的更多文章

The Growth Hacker’s Journey: Can She Crack the Code?

2025年2月23日

The Growth Hacker’s Journey: Can She Crack the Code?

She was staring at the stagnant sales numbers on her screen. Despite all her efforts, her startup—a small but promising…

1 条评论
Can You Estimate the Market Size of a Product Category Using Marketplace Bestseller Data?

2025年2月16日

Can You Estimate the Market Size of a Product Category Using Marketplace Bestseller Data?

This weekend, I was on the hunt for a good night cream. As I scrolled through the bestsellers on multiple marketplaces,…

4 条评论
Exploring Sales Funnel Analysis for E-commerce with GA4

2023年5月21日

Exploring Sales Funnel Analysis for E-commerce with GA4

"Life is a Struggle". Yes, it is! We all have our battles to fight.
Deciphering Customer Lifetime Value(CLV) with GA4

2023年4月30日

Deciphering Customer Lifetime Value(CLV) with GA4

The User Lifetime Value Report is one of the biggest upgrades in GA4. Yes, it's available in the older GA (GA3)…

2 条评论
A dive into Data-Driven Attribution model with GA4

2023年4月16日

A dive into Data-Driven Attribution model with GA4

The FIFA 2022 WC is long over now, but still so fresh in my mind. Do you give 100% credit to Messi for winning the WC?…
Inferential Statistics- Part I: Explained: Confidence Interval with Conversion Rate

2023年3月26日

Inferential Statistics- Part I: Explained: Confidence Interval with Conversion Rate

Even though the FIFA WC ended a few months back, I still couldn't fully recover from Messi's magic. Who is the better…
Decoding the Impact of User Engagement on Revenue with Google Analytics

2023年3月12日

Decoding the Impact of User Engagement on Revenue with Google Analytics

Gone are the days when CPM, CPC, or CTR was used as the essential KPI to measure success. In today's ecosystem, these…
Explained: Simpson's Paradox with Sherlock's Quotes

2023年3月5日

Explained: Simpson's Paradox with Sherlock's Quotes

“There is nothing more deceptive than an obvious fact.” Sherlock Holmes's quote came to my mind looking at the dataset…

1 条评论
Bring your Data to life with Histogram...

2023年3月1日

Bring your Data to life with Histogram...

So here we are, on a Tuesday evening, experimenting with real data we extracted from the Google Analytics (GA) demo…
Out of the Box Insights with Boxplot

2023年2月26日

Out of the Box Insights with Boxplot

Lazy Sunday. No wonder, my mind started visualizing.

See all articles

Explained: Hypothesis - Testing to a marketier (Inferential statistics- part II)

Chandralekha Ghosh

General Manager Accountability at Omnicom Media Group | Cross-Media Measurement & Audit | Proficient in Data Analytics & Statistical Modeling | Passionate About Reducing Media Waste and Enhancing Client Satisfaction

领英推荐

Chandralekha Ghosh的更多文章

社区洞察

其他会员也浏览了

Measuring Network Effects - implications on strategy and business model

How to best manage product feedback collection and classification from different global markets / clusters?

Boost Your Business's Conversion Rates with this Client Behavioural Tips

Lesson #1. We need to define personalisation. Or do we.

The Power of “Paper Models” in Decision-Making

Turn Your Expertise into Profit: Converting Specialized Knowledge into a Scalable Service Offering

Heuristic Analysis Framework for Conversion Optimization : Review

Unleash the power of intuition

Human bias and A/B tests

Propensity Modelling - Why You Need It In Your Digital Lives

领英推荐

Chandralekha Ghosh的更多文章

The Growth Hacker’s Journey: Can She Crack the Code?

Can You Estimate the Market Size of a Product Category Using Marketplace Bestseller Data?

Exploring Sales Funnel Analysis for E-commerce with GA4

Deciphering Customer Lifetime Value(CLV) with GA4

A dive into Data-Driven Attribution model with GA4

Inferential Statistics- Part I: Explained: Confidence Interval with Conversion Rate

Decoding the Impact of User Engagement on Revenue with Google Analytics

Explained: Simpson's Paradox with Sherlock's Quotes

Bring your Data to life with Histogram...

Out of the Box Insights with Boxplot

社区洞察

其他会员也浏览了

Measuring Network Effects - implications on strategy and business model

How to best manage product feedback collection and classification from different global markets / clusters?

Boost Your Business's Conversion Rates with this Client Behavioural Tips

Lesson #1. We need to define personalisation. Or do we.

The Power of “Paper Models” in Decision-Making

Turn Your Expertise into Profit: Converting Specialized Knowledge into a Scalable Service Offering

Heuristic Analysis Framework for Conversion Optimization : Review

Unleash the power of intuition

Human bias and A/B tests

Propensity Modelling - Why You Need It In Your Digital Lives