Let 𝑋1,…,𝑋𝑛 be i.i.d. Bernoulli random variables with unknown parameter π‘βˆˆ(0,1) . Suppose we want to test

𝐻0:π‘βˆˆ[0.48,0.51]vs𝐻1:π‘βˆ‰[0.48,0.51]

We want to construct an asymptotic test πœ“ for these hypotheses using π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›. For this problem, we specifically consider the family of tests πœ“π‘1,𝑐2 where we reject the null hypothesis if either π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›<𝑐1≀0.48 or π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›>𝑐2β‰₯0.51 for some 𝑐1 and 𝑐2 that may depend on 𝑛 , i.e.

πœ“π‘1,𝑐2=1((π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›<𝑐1)βˆͺ(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›>𝑐2))where 𝑐1<0.48<0.51<𝑐2.

Throughout this problem, we will discuss possible choices for constants 𝑐1 and 𝑐2 , and their impact to both the asymptotic and non-asymptotic level of the test.

b) Use the central limit theorem and the approximation 𝑝(1βˆ’π‘)β€Ύβ€Ύβ€Ύβ€Ύβ€Ύβ€Ύβ€Ύβ€Ύβˆšβ‰ˆ12 for π‘βˆˆ[0.48,0.51] to approximate 𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›<𝑐1) and 𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›>𝑐2) for large 𝑛. Express your answers as a formula in terms of 𝑐1, 𝑐2, 𝑛 and 𝑝.

(Write Phi for the cdf of a Normal distribution, c_1 for 𝑐1, and c_2 for 𝑐2.)

𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›<𝑐1)β‰ˆ ?
For what value of π‘βˆˆ[0.48,0.51] is the expression above for 𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›<𝑐1) maximized?

𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›<𝑐1)is max at 𝑝= ?

𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›>𝑐2)β‰ˆ ?

For what value of π‘βˆˆ[0.48,0.51] is the expression above for 𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›>𝑐2) maximized?

𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›>𝑐2)is max at 𝑝= ?

d) Suppose that we wish to have a level 𝛼=0.05. What 𝑐1 and 𝑐2 will achieve 𝛼=0.05? Choose 𝑐1 and 𝑐2 by setting the expressions you obtained above for maxπ‘βˆˆ[0.48,0.51]𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›<𝑐1) and maxπ‘βˆˆ[0.48,0.51]𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›>𝑐2) to both be 0.025.

(If applicable, enter q(alpha) for π‘žπ›Ό, the 1βˆ’π›Ό-quantile of a standard normal distribution, e.g. enter q(0.01) for π‘ž0.01. )

𝑐1= ?

𝑐2=?

e) We will now show that the values we just derived for 𝑐1 and 𝑐2 are in fact too conservative.

Recall the expression from part (b) for 𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›<𝑐1) for large 𝑛. For 𝑝>0.48 (note the strict inequality), find limπ‘›β†’βˆžππ‘(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›<𝑐1).

limπ‘›β†’βˆžππ‘>0.48(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›<𝑐1)= ?

Similarly, for 𝑝<0.51 (note the strict inequality), find limπ‘›β†’βˆžππ‘(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›>𝑐2). Use the expression you found in part (b) for 𝐏𝑝(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›>𝑐2).

limπ‘›β†’βˆžππ‘<0.51(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›>𝑐2)= ?

f) Next, we analyze the asymptotic test given different possible values of 𝑝, in order to choose suitable and sufficiently-tight 𝑐1 and 𝑐2. Looking more closely at part (d), we may note that the asymptotic behavior of the expressions for the errors are different depending on whether 𝑝=0.48, 0.48<𝑝<0.51, or 𝑝=0.51.

Based on your answers and work from the previous part, evaluate the asymptotic Type 1 error

𝐏(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›<𝑐1)+𝐏(π‘‹βŽ―βŽ―βŽ―βŽ―βŽ―π‘›>𝑐2).

on each of the three cases for the value of 𝑝 in terms of 𝑐1, 𝑐2, and 𝑛, and determine in each case which component(s) of the Type 1 error will converge to zero.

This would allow you to come up with a new set of conditions for 𝑐1 and 𝑐2 in terms of 𝑛, given the desired level of 5%. Enter these values (in terms of 𝑛) below.

(If applicable, enter q(alpha) for π‘žπ›Ό, the 1βˆ’π›Ό-quantile of a standard normal distribution, e.g. enter q(0.01) for π‘ž0.01. Do not worry about the parser not rendering q(alpha) properly; the grader will work nonetheless. You could also enclose q(alpha) by brackets for the rendering to show properly.)

𝑐1= ?

𝑐2= ?

1 answer

8.(a). α=maxp∈[0.48,0.51](Pp(Xn<c1)+Pp(Xn>c2))