登录查看更多内容

p-hacking

Satwik Behera

Machine Learning Engineer | Data Scientist

发布日期: 2024年7月26日

p-hacking isn't really a technique to manipulate data, rather a bunch of various practices involving various techniques that aim to somehow achieve significance. This often involves performing multiple tests, adjusting variables, or selectively reporting results to achieve a p-value less than 0.05

Let's try and understand what constitutes p-hacking with an example. Say a researcher is conducting a study to investigate whether listening to classical music improves cognitive performance. The primary hypothesis is that participants who listen to classical music will perform better on a memory test compared to those who do not listen to any music

Original Analysis

The researcher conducts a memory test on 2 groups: one listens to classical music before the test, and the other group does not listen to any music. The results show no significant difference between the two groups (p > 0.05)

p-hacking techniques:

1. Post Hoc Subgroup Analysis

The researcher decides to divide participants into subgroups based on age, gender and education level. They find that among participants aged 20-30, those who listened to classical music performed significantly better on memory test (p < 0.05)
They report this subgroup result as if it were a primary finding without disclosing that this was discovered after multiple subgroup analysis

2. Selective Reporting

The researcher initially measured several cognitive outcomes: memory, attention, and problem-solving. Only the memory test was not significant. However, by chance, the attention test showed a significant improvement (p < 0.05) in the music group.
They choose to report only the attention test results, ignoring the primary outcome (memory) and other non-significant results.

3. Re-defining Variables

The researcher redefines the success criterion for the memory test. Initially, the number of correctly recalled items was the measure. They change it to the percentage improvement from a pre-test to post-test, finding that this redefined measure shows a significant improvement (p < 0.05)

4. Data Exclusion

After examining the data, the researcher notices that some participants had very low scores, which could be considered outliers. They decide to exclude these participants from the analysis, and after this exclusion, the results show a significant effect (p < 0.05)

5. Stopping Data Collection

The researcher collects data in phases and checks the results periodically. At one point, they observe a significant result (p < 0.05) and decide to stop collecting further data and report the findings, without mentioning the interim analyses

Outcome:

The final published study claims that listening to classical music significantly improves cognitive performance, specifically in attention and among younger adults. These findings are a result of p-hacking, not a true effect.

领英推荐

Cybersecurity Testing in 2024: Impact of AI

testRigor 4 个月前

Hacking Your Algorithm: Why Critical Thinking Is…

Nicky Verd 2 个月前

AI Security : There is no spoon? You cannot solve a…

John Egan 4 个月前

How/Why p-hacking works?

The probability of getting a significant result increases with multiple testing due to the principles of probability. When you conduct multiple independent tests, the chance of encountering at least one significant result by random chance increases, even if none of the tests individually indicate a true effect.

Mathematical Explanation:

Suppose you are testing a hypothesis at the 0.05 significance level. This means there is a 5% chance (0.05 probability) of obtaining a significant result purely by chance for any single test, assuming the null hypothesis is true.

For a single test, the probability of not finding a significant result is 1 - 0.05 = 0.95.

If you conduct n independent tests, the probability that none of them will be significant is:

(1?0.05)^n=0.95^n

The probability of finding at least one significant result among these n tests is:

1?0.95^n

As n increases, 0.95^n decreases, and thus 1?0.95^n increases. This demonstrates that the probability of obtaining at least one significant result by chance increases with the number of tests.

Example Calculation:

Let's calculate the probability of finding at least one significant result for different numbers of tests:

For n=1: 1?0.95^1 = 0.05
For n=5: 1?0.95^5≈0.226 (22.6%)
For n=10: 1?0.95^10≈0.401 (40.1%)
For n=20: 1 - 0.95^20 ≈0.642 (64.2%)

As seen from the calculations, the probability of obtaining at least one significant result by chance increases substantially as the number of tests increases.

Why Should you do it?

There's a "publish or perish" culture in academia, which incentivizes producing significant, novel results. You might be pressured to find and report significant findings to secure funding, tenure or simply career advancement
There is often a bias in favor of publishing significant results over non-significant ones.

要查看或添加评论，请登录

Satwik Behera的更多文章

Changing the results By Observation

2024年6月29日

Changing the results By Observation

In 1923, Thomas Edison (yes, THAT Thomas Edison) was chairing the 'Committee on the Relation of Quality and Quantity of…
Misleading Through Percentages

2024年6月23日

Misleading Through Percentages

Recently, I came across this article from Amazon : Amazon announces its largest reduction in plastic packaging in North…
AZ900 Cert Prep :: Lesson 11 : Monitoring Tools

2024年6月3日

AZ900 Cert Prep :: Lesson 11 : Monitoring Tools

Azure Advisor Is a services that offers tools to help you ensure high availability of you resources and also efficiency…
AZ900 Cert Prep :: Lesson 10 : Features and Tools for Managing and Deploying Resources

2024年6月3日

AZ900 Cert Prep :: Lesson 10 : Features and Tools for Managing and Deploying Resources

Azure Portal This is the most common way for creating and managing Azure resources The Azure Portal is a web-based…
AZ900 Cert Prep :: Lesson 9 : Features and Tools for Governance and Compliance

2024年6月3日

AZ900 Cert Prep :: Lesson 9 : Features and Tools for Governance and Compliance

Azure Blueprints Till now, we've talked about agility, primarily in the context of scaling resources. But another…

1 条评论
AZ900 Cert Prep :: Lesson 8 : Cost Management in Azure

2024年6月3日

AZ900 Cert Prep :: Lesson 8 : Cost Management in Azure

Factors that can affect costs 1. Meters The first factor that can affect cost is meters.
AZ900 Cert Prep :: Lesson 7 : Azure Identity, Access and Security

2024年5月26日

AZ900 Cert Prep :: Lesson 7 : Azure Identity, Access and Security

There are 3 main types of modern authentication methods that are present in Azure SSO (Single Sign-On) MFA…
AZ900 Cert Prep :: Lesson 6 : Azure Storage Services

2024年3月28日

AZ900 Cert Prep :: Lesson 6 : Azure Storage Services

Storage in the cloud refers to anything that you need to store whether that's for use by an application or archival…
AZ900 Cert Prep :: Lesson 5 : Azure Compute and Networking Services

2024年3月25日

AZ900 Cert Prep :: Lesson 5 : Azure Compute and Networking Services

Let's get a few definitions out of the way before we get into all the different types of compute services available in…
AZ900 Cert Prep :: Lesson 4 : Core Architectural Components

2024年3月10日

AZ900 Cert Prep :: Lesson 4 : Core Architectural Components

Regions, Region Pairs, and Sovereign Regions Microsoft has several data centers in different geographies. Sometimes…

See all articles

p-hacking

Satwik Behera

Machine Learning Engineer | Data Scientist

Original Analysis

The researcher conducts a memory test on 2 groups: one listens to classical music before the test, and the other group does not listen to any music. The results show no significant difference between the two groups (p > 0.05)

p-hacking techniques:

1. Post Hoc Subgroup Analysis

2. Selective Reporting

3. Re-defining Variables

4. Data Exclusion

5. Stopping Data Collection

Outcome:

领英推荐

How/Why p-hacking works?

Why Should you do it?

Satwik Behera的更多文章

社区洞察

其他会员也浏览了

'How to survive a robot uprising: Using AI safely'

Reinventing Yourself in the Age of AI: Lessons from My Own Transformation

A CISO's Perspective on How to Make AI an Accelerator, Not a Blocker

Let's philosophize #1: The nuance paradox, why less precision leads to more complexity.

Peeling Back the Layers: A Comical Guide to Model Inversion Attacks

The Role of AI in Revolutionizing IT Services for Businesses Across Industries

How LinkedIn Addresses Content Related Threats and Abuse Using Machine Learning (AutoML)

Embracing Artificial Intelligence: A Security Manager’s Perspective

The Best of Technology Right Here!

January 29, 2024

Original Analysis

The researcher conducts a memory test on 2 groups: one listens to classical music before the test, and the other group does not listen to any music. The results show no significant difference between the two groups (p > 0.05)

p-hacking techniques:

1. Post Hoc Subgroup Analysis

2. Selective Reporting

3. Re-defining Variables

4. Data Exclusion

5. Stopping Data Collection

Outcome:

领英推荐

How/Why p-hacking works?

Why Should you do it?

Satwik Behera的更多文章

Changing the results By Observation

Misleading Through Percentages

AZ900 Cert Prep :: Lesson 11 : Monitoring Tools

AZ900 Cert Prep :: Lesson 10 : Features and Tools for Managing and Deploying Resources

AZ900 Cert Prep :: Lesson 9 : Features and Tools for Governance and Compliance

AZ900 Cert Prep :: Lesson 8 : Cost Management in Azure

AZ900 Cert Prep :: Lesson 7 : Azure Identity, Access and Security

AZ900 Cert Prep :: Lesson 6 : Azure Storage Services

AZ900 Cert Prep :: Lesson 5 : Azure Compute and Networking Services

AZ900 Cert Prep :: Lesson 4 : Core Architectural Components

社区洞察

其他会员也浏览了

'How to survive a robot uprising: Using AI safely'

Reinventing Yourself in the Age of AI: Lessons from My Own Transformation

A CISO's Perspective on How to Make AI an Accelerator, Not a Blocker

Let's philosophize #1: The nuance paradox, why less precision leads to more complexity.

Peeling Back the Layers: A Comical Guide to Model Inversion Attacks

The Role of AI in Revolutionizing IT Services for Businesses Across Industries

How LinkedIn Addresses Content Related Threats and Abuse Using Machine Learning (AutoML)

Embracing Artificial Intelligence: A Security Manager’s Perspective

The Best of Technology Right Here!

January 29, 2024