The National Assessment of Educational Progress tested a simple random sample of 1000 thirteen year old students in both 2004 and 2008 and recorded each student's score. The average and standard deviation in 2004 were 257 and 39, respectively. In 2008, the average and standard deviation were 260 and 38, respectively.
Your goal as a statistician is to assess whether or not there were statistically significant changes in the average test scores of students from 2004 to 2008. To do so, you make the following modeling assumptions regarding the test scores:
X1,…,X1000 represent the scores in 2004.
X1,…,X1000 are iid Gaussians with standard deviation 39 .
E[X1]=μ1 , which is an unknown parameter.
Y1,…,Y1000 represent the scores in 2008.
Y1,…,Y1000 are iid Gaussians with standard deviation 38 .
E[Y1]=μ2 , which is an unknown parameter.
X1,…,Xn are independent of Y1,…,Yn .
You define your hypothesis test in terms of the null H0:μ1=μ2 (signifying that there were not significant changes in test scores) and H1:μ1≠μ2 . You design the test
ψ=1(√n | (Xn−Yn)/ √ ( 38^2+39^2) ∣≥qη/2).
where qη represents the 1−η quantile of a standard Gaussian.
Hint: Under H0:μ1=μ2 , the test statistic is distributed as a standard Gaussian:
√n *( (Xn−Yn)/ √ ( 38^2+39^2) )∼N(0,1)
You are encouraged to check this.(Compute the mean and variance and recall that the sum of iid Gaussians is again Gaussian.)
What is the largest possible value of η so that ψ has level 10% ?
η= ?
What is the p-value for this data set?
1 answer