登录查看更多内容

Meta-Analysis in R (RStudio) part I

Darko Medin

Data Scientist and a Biostatistician. Developer of ML/AI models. Researcher in the fields of Biology and Clinical Research. Helping companies with Digital products, Artificial intelligence, Machine Learning.

发布日期: 2022年2月28日

In this tutorial i will be showing how to practically perform a Meta-analysis in R [1] using RStudio[2] and the 'meta' package, one of the most validated packages with 238 pages of documentation [3] and optimal for use in Biomedical and Pharmaceutical industry.

Library 'meta' allows for very quick performance. In fact, Meta-analysis can be performed in just couple of lines of code.

Let me start with the Effect size/Data explanation...

AS it can be seen the effect size of the Studies (presented as numbers from 1-19 as this is a hypothetical dataset) is presented as standardized mean difference (smd on the image) and is accompanied by its standard error. I will use a Generic Inverse Variance model which weights the studies effects according their inverse variance. Measure of this weight is actually calculated using the standard error (ste). Now to the coding part:

After installing and loading the 'meta' library, the metagen() function is used to create the generic inverse variance model. It can be noticed how i used the object oriented programming to be as efficient as possible and stored objects like mg (metagen model) and the fpmg (forest plot of the metagen object) using names as short as possible but still informative enough. Defining TE (treatment effect) and seTE(standard error of the treament effect), sm = "SMD" (setting standardized mean difference as the main effect) is main framework for developing the model. I also set the method.tau="DL", meaning i will use DerSimonian&Laird method for estimating between study heterogeneity [4].

Ok, once the model is defined as mg, i will create the forest plot (fmpg) in a very simple way, fmpg=forest(mg) and voila, the first forest plot.

Before interpreting the the results i can order the studies according to their effect sizes.

This will be done by adding sortvar argument, so the code will be

>fpmg=forest(mg, sorvar=smd)

领英推荐

Hypergraphs and RDF

Kurt Cagle 1 年前

MAKING STOCHASTIC PROGRAMING MODELS - FILLING THE…

Jesus Velasquez-Bermudez 5 年前

Vector and Covector Fields

Patrick Nicolas 11 个月前

Now the plot is visually much more interpretable in terms of both individual studies and pooled effect. One thing i would point out is that there are 2 pooled effect by default. Common model is actually a Fixed effects model and the Random effects model is bellow.

Heterogeneity is also presented on the plot and the associated tau squared and the p value (<0.001) are related to heterogeneity test which is positive in this case. Its important to differentiate Heterogeneity p value from the pooled effects p values which could also be accessed and added simply by running summary(mg) or by setting other arguments within the function which will be discussed further in next tutorials.

Next tutorial will be about making subgroup analysis.

By Darko Medin

Clinical Biostatistics, Data Science and AI expert

References :

1.R Core Team (2014). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/.

2. RStudio Team (2015). RStudio: Integrated Development for R. RStudio, Inc., Boston, MA URL https://www.rstudio.com/.

3.Balduzzi S, Rücker G, Schwarzer G (2019). “How to perform a meta-analysis with R: a practical tutorial.” Evidence-Based Mental Health, 153–160.

4.DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials. 1986 Sep;7(3):177-88. doi: 10.1016/0197-2456(86)90046-2. PMID: 3802833.

GANAPATHY PALANIMUHTU

Chartered Accountant & Tech Enthusiast | Specialized in Statistics, Meta Analysis & Visualization in R | YouTube Educator on ShinyApp & Machine Learning | Value-Driven Finance Prof | Seeking Business & Data Analytic role

1 年

Excellent, very well presented

Darko Medin

Data Scientist and a Biostatistician. Developer of ML/AI models. Researcher in the fields of Biology and Clinical Research. Helping companies with Digital products, Artificial intelligence, Machine Learning.

3 年

Some of the topics covered is also understanding the forest plot effect size, weighting based on Generic inverse variance and Fixed and Random effects models parts.

查看更多评论

要查看或添加评论，请登录

Darko Medin的更多文章

OncoNeo400 - A new Precision Oncology Research AI tool on BioAIWorks

2025年3月16日

OncoNeo400 - A new Precision Oncology Research AI tool on BioAIWorks

In this edition the OncoNeo400, novel Precision Oncology Research AI tool on BioAIWorks platform (bioaiworks.com).

7 条评论
LARVOL CLIN - New modules

2025年3月3日

LARVOL CLIN - New modules

This featuring article is about the new modules Larvol Pseudo-IPD and Larvol NMA on https://clin.larvol.

1 条评论
AI Developer tech skillsets.

2025年2月24日

AI Developer tech skillsets.

While these skills may vary according to the role, i will discuss the most significant ones that almost every AI…

2 条评论
Featuring article - the book : How To Be an Effective Statistician by Dr. Alexander Schacht

2025年2月16日

Featuring article - the book : How To Be an Effective Statistician by Dr. Alexander Schacht

The book How To Be an Effective Statistician: A Guide for Statisticians, Data Scientists, and Other Quantitative…

2 条评论
Causal Inference II Live - The ORIENTATION

2025年2月11日

Causal Inference II Live - The ORIENTATION

Causal Inference II is a Live Linkedin Event by Justin Bélair and Darko Medin . Here is the orientation on how and when…

9 条评论
Simulated and Synthetic Data Generation - Edition 1

2024年10月31日

Simulated and Synthetic Data Generation - Edition 1

The first in the series for Simulated and Synthetic Data Generation - by Darko Medin. Where to read :…
Simulated and Synthetic Data Series by Darko Medin - An ORIENTATION

2024年10月20日

Simulated and Synthetic Data Series by Darko Medin - An ORIENTATION

This is the orientation for my upcoming Series on Simulated and Synthetic Data. If you have any additional suggestions…

5 条评论
Simulated and Synthetic Data Generation - The Effective Statistician Workshop ORIENTATION - Lead by Darko Medin

2024年10月13日

Simulated and Synthetic Data Generation - The Effective Statistician Workshop ORIENTATION - Lead by Darko Medin

In today's data-driven world ability to generate Simulated and Synthetic data is one of the most important Data Science…
INTRODUCTION TO DEEP LEARNING

2024年10月3日

INTRODUCTION TO DEEP LEARNING

The INTRODUCTION TO DEEP LEARNING tutorial. Where to find? adatascience.
BioAIworks - The novel AI platform

2024年9月25日

BioAIworks - The novel AI platform

Bio AI works is a novel AI platform, with main focus on AI Data Generation, Augmenting Biology and Biomedical Research…

8 条评论

See all articles

Meta-Analysis in R (RStudio) part I

Darko Medin

Data Scientist and a Biostatistician. Developer of ML/AI models. Researcher in the fields of Biology and Clinical Research. Helping companies with Digital products, Artificial intelligence, Machine Learning.

领英推荐

Darko Medin的更多文章

社区洞察

其他会员也浏览了

Is it Time for Everything Apps?

?? Linear Algebra & Matrix Computations: The Power Behind AI ??

Algorithms — Big O Notation

I Ran Billions of Simulations to Simplify Multi-Armed Bandit Algorithms for You

Fundamentals of Quantization - Quantization of LLMs, Part-3

From Basics to Business Impact: Unlocking the Power of Graph Theory and Graph Rag

Mathematics for Data Science and Machine Learning - Part 1

Visualization of Mathematical Engineering of Transformers - Part 2

Day 06 — Support Vector Machine

Ever Wondered How Google Maps Finds the Shortest Route? The Secret Lies in Graph Theory!

领英推荐

Darko Medin的更多文章

OncoNeo400 - A new Precision Oncology Research AI tool on BioAIWorks

LARVOL CLIN - New modules

AI Developer tech skillsets.

Featuring article - the book : How To Be an Effective Statistician by Dr. Alexander Schacht

Causal Inference II Live - The ORIENTATION

Simulated and Synthetic Data Generation - Edition 1

Simulated and Synthetic Data Series by Darko Medin - An ORIENTATION

Simulated and Synthetic Data Generation - The Effective Statistician Workshop ORIENTATION - Lead by Darko Medin

INTRODUCTION TO DEEP LEARNING

BioAIworks - The novel AI platform

社区洞察

其他会员也浏览了

Is it Time for Everything Apps?

?? Linear Algebra & Matrix Computations: The Power Behind AI ??

Algorithms — Big O Notation

I Ran Billions of Simulations to Simplify Multi-Armed Bandit Algorithms for You

Fundamentals of Quantization - Quantization of LLMs, Part-3

From Basics to Business Impact: Unlocking the Power of Graph Theory and Graph Rag

Mathematics for Data Science and Machine Learning - Part 1

Visualization of Mathematical Engineering of Transformers - Part 2

Day 06 — Support Vector Machine

Ever Wondered How Google Maps Finds the Shortest Route? The Secret Lies in Graph Theory!