Exercise: CLT applicability

Consider the class average in an exam in a few different settings. In all cases, assume that we have a large class consisting of equally well prepared students. Think about the assumptions behind the central limit theorem, and choose the most appropriate response under the given description of the different settings.

1. Consider the class average in an exam of fixed difficulty.

(a) The class average is approximately normal

(b) The class average is not approximately normal because the student scores are strongly dependent

(c) The class average is not approximately normal because the student scores are not identically distributed

2. Consider the class average in an exam that is equally likely to be very easy or very hard.

(a) The class average is approximately normal

(b) The class average is not approximately normal because the student scores are strongly dependent

(c) The class average is not approximately normal because the student scores are not identically distributed

3. Consider the class average if the class if split into two equal size sections. One section gets an easy exam and the other section gets a hard exam.

(a) The class average is approximately normal

(b) The class average is not approximately normal because the student scores are strongly dependent

(c) The class average is not approximately normal because the student scores are not identically distributed

4. Consider the class average if every student is (randomly and independently) given either an easy or a hard exam.

(a) The class average is approximately normal

(b) The class average is not approximately normal because the student scores are strongly dependent

(c) The class average is not approximately normal because the student scores are not identically distributed

2 answers

1. (a)
2. (b)
3. (a)
4. (a)
1. Since students are equally well-prepared and the difficulty level is fixed, the only randomness in a student's score comes from luck or accidental mistakes of that student. It is then plausible to assume that each student's score will be an independent random variable drawn from the same distribution, and the CLT applies.

2. Here, the score of each student depends strongly on the difficulty level of the exam, which is random but common for all students. This creates a strong dependence between the student scores, and the CLT does not apply.

3. This is more subtle. The scores of the different students are not identically distributed. However, let 𝑌𝑖 be the score of the 𝑖 th student from the first section and let 𝑍𝑖 be the score of the 𝑖 th student in the second section. The class average is the average of the random variables (𝑌𝑖+𝑍𝑖)/2 . Under our assumptions, these latter random variables can be modeled as i.i.d., and the CLT applies.

4. Unlike part (2), here the student scores are i.i.d., and the CLT applies.