Lesson 4 From My Students: Size Matters – Getting Sample Size Right
Merle Werbeloff, PhD (Wits)
Guiding stressed postgraduate students to accelerate and confidently finish their dissertations through academic coaching, data analysis support, and essential skills.
The Scenario
A student’s dissertation I recently reviewed aimed to predict a rare medical disease using data from patient records. Unfortunately, the student had not done an upfront power analysis to determine the sample size needed. Given the infrequent occurrence of the event, a large sample was necessary to detect meaningful effects, but collecting such a large sample wasn’t feasible.
The analysis, predictably, had low statistical power, and no significant effects were found. This left the study inconclusive. Was there truly no relationship, or was the sample simply too small to detect one?
The Issue
The student’s scenario highlights the risks of low power: the inability to distinguish between a lack of evidence and a lack of effect. This principle is captured by the phrase: “Absence of evidence is not evidence of absence.”
Sample size directly impacts the reliability of your results. In quantitative research, inadequate sample size results in low statistical power, increasing the likelihood of failing to detect meaningful effects. Conversely, an overly large sample size can result in statistically significant findings for trivial differences, leading to overinterpretation. From an ethical perspective, oversampling can waste participants’ time and resources, which is equally problematic.
In qualitative research, the concern shifts to achieving data saturation, which requires thoughtful analysis rather than arbitrary numbers.
The Solution
Addressing these challenges requires both rigorous planning and a commitment to transparency:
The Key Takeaway
Appropriate sample size is essential for credible research. In quantitative studies, perform a power analysis upfront to avoid underpowered or overpowered results. In qualitative research, focus on achieving data saturation through iterative analysis.
Most importantly, when results are inconclusive due to limitations, embrace transparency. As in this case, descriptive statistics and a discussion grounded in “Absence of evidence is not evidence of absence” can add value and maintain the credibility of your work.