The problems on this page concern the data set
S = \{ 0.28, 0.2, 0.01, 0.80, 0.1 \} .
Let x_ i denote the i'th element of the data set S.
In this problem, you will consider the following null and alternative hypotheses.
\displaystyle H_0 \displaystyle = \text {the data set is distributed as a Gaussian distribution (for some choice of mean and variance)}
\displaystyle H_1 \displaystyle = \text {the data set is not distributed as a Gaussian (for any choice of mean and variance)} .
What is the Kolmogorov-Lilliefors test statistic evaluated on the data set S? Enter \, T_5^{\text {KL}}/\sqrt{5},\, the KL statistic without the factor of \sqrt{n}, below. (You are encouraged to use computational tools.)
Kolmogorov-Lilliefors Tables
Show
T_5^{\text {KL}}/\sqrt{5} =\quad
1 answer
The Kolmogorov-Lilliefors test is used to determine if a dataset follows a specific distribution, in this case, a Gaussian distribution. The test statistic, denoted as T_n^KL, measures the largest vertical deviation between the empirical distribution function (EDF) of the data and the cumulative distribution function (CDF) of the assumed distribution.
To calculate T_n^KL, follow these steps:
1. Sort the data set S in ascending order: S = {0.01, 0.1, 0.2, 0.28, 0.8}.
2. Calculate the empirical distribution function (EDF) by assigning a cumulative probability to each data point. The cumulative probability for each data point is its rank divided by the total number of data points, n.
EDF(0.01) = 1/5 = 0.2
EDF(0.1) = 2/5 = 0.4
EDF(0.2) = 3/5 = 0.6
EDF(0.28) = 4/5 = 0.8
EDF(0.8) = 5/5 = 1.0
3. Calculate the cumulative distribution function (CDF) for the assumed Gaussian distribution. This involves calculating the probability of each data point under the assumed distribution.
4. Calculate the absolute difference between the EDF and CDF at each data point.
|EDF(0.01) - CDF(0.01)|
|EDF(0.1) - CDF(0.1)|
|EDF(0.2) - CDF(0.2)|
|EDF(0.28) - CDF(0.28)|
|EDF(0.8) - CDF(0.8)|
5. Find the maximum absolute difference among all data points. This is the Kolmogorov-Lilliefors test statistic, T_n^KL.
Using computational tools or a statistical software, you can calculate the value of T_n^KL. The value for T_5^KL/\sqrt{5} will then be T_n^KL divided by the square root of the number of data points, which is \sqrt{5} in this case.
I apologize for not being able to provide the specific value of T_5^KL/\sqrt{5} for the given data set.