ISO/IEC 17043:2023 - Outlier -Technique?to determine Outlier in Calibration.

ISO/IEC 17043:2023 - Outlier -Technique?to determine Outlier in Calibration.

Grubbs' Test

Grubbs' test, also known as the Grubbs' outlier test or Grubbs' extreme studentized deviate test, is a statistical test used to identify outliers in a univariate data set. The test is particularly useful when you suspect that there might be one or more extreme values (outliers) that are significantly different from the rest of the data.

The basic idea behind Grubbs' test is to compare the value of the potential outlier to the rest of the data points and determine if it is significantly different. The null hypothesis for the test is that there are no outliers in the data set. The alternative hypothesis is that there is at least one outlier.

The test statistic, G, is calculated as:

G=max(∣Xi?Xˉ∣)sG=smax(∣Xi?Xˉ∣)

Where:

  • XiXi is the potential outlier value,
  • Xˉ is the sample mean,
  • ss is the sample standard deviation.

Once the test statistic is calculated, it is compared to the critical value from the Grubbs' table or calculated using a significance level and the degrees of freedom. If the calculated test statistic is greater than the critical value, the null hypothesis is rejected, and the data point is considered an outlier.

Let's consider an example where Grubbs' test is used in the calibration of a multimeter.

In this scenario, you have a set of measurements taken by the multimeter, and you want to check if there are any outliers that might affect the calibration process.

Suppose you have the following resistance measurements (in ohms) from your multimeter:

R={2.3,2.4,2.5,2.4,2.6,100.0} R={2.3,2.4,2.5,2.4,2.6,100.0}

In this dataset, most values are around 2.4 ohms, but there's a potential outlier at 100.0 ohms. We'll use Grubbs' test to determine if this value is a significant outlier.

  1. Calculate the sample mean and standard deviation: Xˉ=2.3+2.4+2.5+2.4+2.6+100.06Xˉ=62.3+2.4+2.5+2.4+2.6+100.0 Xˉ≈18.87Xˉ≈18.87

s=∑i=16(Xi?Xˉ)2n?1s=n?1∑i=16(Xi?Xˉ)2 s≈35.76s≈35.76

  1. Calculate the test statistic, G: G=max(∣100.0?18.87∣)35.76G=35.76max(∣100.0?18.87∣) G≈2.66G≈2.66
  2. Determine the critical value: You would compare the calculated test statistic to the critical value from the Grubbs' table for a given significance level (e.g., 0.05) and degrees of freedom (n-1). For simplicity, let's assume a significance level of 0.05 and 5 degrees of freedom.

If G>GcriticalG>Gcritical, you would reject the null hypothesis (no outliers) in favor of the alternative hypothesis (at least one outlier).

  1. Compare G to the critical value: You can find the critical value from a Grubbs' table or use statistical software. If G>GcriticalG>Gcritical, you would reject the null hypothesis.

If Grubbs' test indicates that 100.0 ohms is an outlier, you might want to investigate the measurement further or consider excluding it from the calibration process if it is determined to be erroneous. Keep in mind that the example values used here are for illustrative purposes, and you should adapt them based on your actual data and the specifics of your calibration process.

Dixon's Q Test

Dixon's Q Test is a statistical test used for identifying outliers in a univariate dataset. It is particularly useful when you suspect that there might be one or two values that are significantly different from the rest of the data. The test is simple but effective and can be applied in various fields such as chemistry, environmental science, and quality control.

Here are the key steps and concepts associated with Dixon's Q Test:

  1. Data Arrangement:Sort the dataset in ascending order.
  2. Test Statistic (Q):Calculate the test statistic, QQ, using the following formula: Q=GapRange Q=RangeGap where Gap is the absolute difference between the value to be tested and its nearest neighbour, and Range is the difference between the maximum and minimum values in the dataset.
  3. Critical Value:Compare the calculated QQ value to the critical QQ value from statistical tables or charts. The critical value depends on the desired significance level (commonly 0.05 or 0.10) and the sample size.
  4. Decision Rule:If the calculated QQ value is greater than the critical QQ value, it suggests that the data point being tested is an outlier.
  5. Handling Outliers:If Dixon's Q Test identifies an outlier, further investigation is warranted. The cause of the outlier could be due to measurement error, instrument malfunction, or other factors. Depending on the context, you might choose to investigate and correct the measurement, repeat the measurement, or, in some cases, exclude the outlier from the analysis.

It's important to note that Dixon's Q Test has limitations, and its effectiveness decreases as the sample size increases. Moreover, the test assumes that the data follows a normal distribution. If the dataset significantly deviates from normality, other outlier detection methods or transformations may be more appropriate.

As with any statistical test, the interpretation of results should be done cautiously, and it's advisable to use Dixon's Q Test in conjunction with other outlier detection methods for a comprehensive analysis of your data.

Let's consider an example of applying Dixon's Q Test in the calibration of a multimeter.

Suppose you have taken a set of resistance measurements with your multimeter, and you want to check if there are any potential outliers in the data.

Here is a sample dataset:

R={2.1,2.3,2.4,2.5,2.4,2.6,2.8}R={2.1,2.3,2.4,2.5,2.4,2.6,2.8}

Let's say you suspect that the measurement 2.8 might be an outlier. We will apply Dixon's Q Test to test this suspicion.

  1. Data Arrangement: Sort the dataset in ascending order: R={2.1,2.3,2.4,2.4,2.5,2.6,2.8}R={2.1,2.3,2.4,2.4,2.5,2.6,2.8}
  2. Test Statistic (Q): Calculate the test statistic, QQ: Q=GapRangeQ=RangeGap where Gap is the absolute difference between the suspected outlier and its nearest neighbor, and Range is the difference between the maximum and minimum values in the dataset.For this example, Gap = |2.8 - 2.6| = 0.2 and Range = 2.8 - 2.1 = 0.7. Q=0.20.7≈0.2857Q=0.70.2≈0.2857
  3. Critical Value: Determine the critical QQ value based on the desired significance level and sample size. For a significance level of 0.05 and a sample size of 7, you can refer to statistical tables or online calculators to find the critical QQ value. Let's assume QcriticalQcritical is approximately 0.47.
  4. Decision Rule: Compare the calculated QQ value to the critical QQ value. If Q>QcriticalQ>Qcritical, then the data point (2.8 in this case) is considered an outlier.In this example, 0.2857<0.470.2857<0.47, so we do not reject the null hypothesis, and there is no evidence to suggest that 2.8 is an outlier based on Dixon's Q Test.

要查看或添加评论,请登录

社区洞察

其他会员也浏览了