Why thumb rule of using just 5 test users might not be a good idea?
Ravi Surana
Design @Paystand and @Teampay | Design Leader & Mentor & Freelancer | AI Enthusiast | IIT Kharagpur (MIT of India)
Let’s first understand what the 5-test-users rule is.
In 1993, Jakob Nielsen and Thomas K. Landauer published a study titled "A Mathematical Model of the Finding of Usability Problems." According to their research, the optimal results are achieved by testing no more than 5 users.
You may wonder, how did they arrive at this number? Why not 10, 20, or 50?
After conducting thousands of studies, these researchers pondered whether they could develop a mathematical model to estimate the total number of usability problems that could be identified during the evaluation process. They formulated the following model:
Found(i) = N(1 – (1-A)^i)
Where:
Found(i) represents the number of usability problems found by i evaluators.
i is the number of testers.
N is the total number of usability problems.
A is the probability of finding a usability problem with one additional person, given that it has not been found yet. In simpler terms, it represents the percentage of problems that a single user would find.
Using the equation provided, they created the following graph:
If we assume:
A = 0.31 (31% is what Nielsen found after conducting testing on 11 interfaces using 359 evaluators)
N (Total usability problems) = 100
i (Number of Testers) = 5
Usability Issues Found (by 5) = 100(1-(1-0.31)^5) = 84.35%
领英推荐
This indicates that by employing just 5 evaluators, you would be able to uncover 84.35% of usability issues. Now, let’s examine what happens with the addition of another evaluator.
Usability Issues Found (by 6) = 100(1-(1-0.31)^6) = 89.20%
By increasing the number of evaluators by one, the number of problems found increases by just 5%, which isn't substantial. Hence, the "magic number" 5 emerged.
In his renowned book "Sprint," Jack Knapp states: “They have seen the same phenomenon in their tests. By the time they observe the fifth customer, they are just confirming patterns that showed up in the first four interviews. We tried testing with more customers, but it was not worth it.”
However, my curiosity drove me to delve deeper into their research paper. Here are some important points to consider before adhering to the 5 testers rule:
When considering a study conducted 30 years ago with numerous constraints, do you believe it's wise to apply it blindly in 2024? I'm not in a position to assert that this study is incorrect. My intention is simply to shed light on certain facts that should be considered before applying it. Now, you might ask, if I don't choose option 5, how should I decide on this number? Providing an answer would be challenging as it depends on various factors such as:
Here are some tips that you can use when you are conducting your next research:
To be honest, Research is often considered a luxury, with limited resources available in many companies for conducting comprehensive studies. However, incorporating research into your process can still be achievable by implementing the following strategies:
Even more crucially, cultivate a testing mindset. Often, the obstacle to conducting research isn't solely the lack of budget and time but inertia. If research hasn't been a part of your organization's routine for some time, it can be daunting to begin. Overcome this inertia by initiating small tests gradually.
The next time you hesitate to conduct research, remember this:
While bypassing research may yield quick results and a sense of accomplishment, you're losing a significant opportunity to enhance your interface and, consequently, refine your own skill set.
You might find this paper interesting. It came out in 1994, the year after the publication of Nielsen and Landauer, but was written before their 1993 paper was published. https://www.researchgate.net/publication/336714538_Sample_Sizes_for_Usability_Studies_Additional_Considerations