Back to chapter

1.16:

Detection of Gross Error: The Q Test

JoVE 핵심
Analytical Chemistry
JoVE 비디오를 활용하시려면 도서관을 통한 기관 구독이 필요합니다.  전체 비디오를 보시려면 로그인하거나 무료 트라이얼을 시작하세요.
JoVE 핵심 Analytical Chemistry
Detection of Gross Error: The Q Test

Languages

소셜에 공유하기

Outliers are those data points extremely different from the rest of the data set.

Dixon's Q-test is a significance test that helps determine whether to retain or reject these inconsistent data points in the data set.

Mathematically, the Q-test statistic is the ratio of the absolute difference between the outlier to its nearest data point and the range of the population.

The Q-test statistic or rejection quotient is then compared with the tabulated critical Q value for a particular significance level and degrees of freedom.

When the rejection quotient is equal to or larger than the tabulated value, the null hypothesis is rejected, and the suspected outlier can be rejected.

On the contrary, the null hypothesis is accepted when the experimental value is smaller than the tabulated Q value. Then, the suspected outlier needs to be retained in the data set.

1.16:

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods to help determine whether to retain the outlier. A statistical method that can help us retain or reject outliers is called the Q-test. To perform the Q-test, we first arrange the values in a data set in order of increasing value. Then, we calculate the Q value by taking the ratio of the absolute difference between a data point and its adjacent data point. This Q value is then compared with the tabulated critical Q value at a chosen significance level and appropriate degrees of freedom. If the Q value equals or exceeds the reference Q value in the table, the data point is considered an outlier and therefore rejected from the data set. Here, it is reasonable to disregard the data point as an outlier because the magnitude of the deviation cannot be logically accounted for by random (indeterminate) errors. On the other hand, if the Q value is smaller than the reference value in the table, the data point should be retained, and the interpretation is that the difference between this data point and the rest of the data is within reasonable (statistical) expectation.