To test the hypothesis that viewing a word cloud results in a faster conclusion than reviewing a text summary, we can perform a paired t-test. We'll define our null hypothesis (\(H_0\)) and alternative hypothesis (\(H_a\)) as follows:
- \(H_0\): There is no difference in the time to determine if a document is worth reading between the text summary and the word cloud (mean difference \( \mu_d = 0 \)).
- \(H_a\): The word cloud is faster than the text summary (mean difference \( \mu_d < 0 \)).
First, we need to calculate the difference in time for each individual. The difference is calculated as:
\[ \text{Difference} = \text{Time to do Text Scan} - \text{Time to view Word Cloud} \]
Using the provided data, we can compile the differences:
| Document | Time to do Text Scan (sec) | Time to view Word Cloud (sec) | Difference (Text Scan - Word Cloud) | |----------|----------------------------|-------------------------------|-------------------------------------| | 1 | 3.51 | 2.93 | 0.58 | | 2 | 2.90 | 3.05 | -0.15 | | 3 | 3.73 | 2.69 | 1.04 | | 4 | 2.59 | 1.95 | 0.64 | | 5 | 2.42 | 2.19 | 0.23 | | 6 | 5.41 | 3.60 | 1.81 | | 7 | 1.93 | 1.89 | 0.04 | | 8 | 2.37 | 2.01 | 0.36 | | 9 | 2.81 | 2.39 | 0.42 | | 10 | 2.67 | 2.75 | -0.08 |
Calculate the Differences
Now, let's calculate the average difference, standard deviation of the differences, and the sample size:
1. Differences: \[ d = [0.58, -0.15, 1.04, 0.64, 0.23, 1.81, 0.04, 0.36, 0.42, -0.08] \]
2. Calculate \(\bar{d}\) (Mean of differences) and \(s_d\) (Standard deviation of differences):
- Mean of differences:
\[ \bar{d} = \frac{\sum{d}}{n} = \frac{0.58 - 0.15 + 1.04 + 0.64 + 0.23 + 1.81 + 0.04 + 0.36 + 0.42 - 0.08}{10} = \frac{4.89}{10} = 0.489 \]
- Standard deviation of differences:
\[ s_d = \sqrt{\frac{\sum{(d_i - \bar{d})^2}}{n - 1}} \]
Calculating \(s_d\):
\[ s_d = \sqrt{\frac{(0.58 - 0.489)^2 + (-0.15 - 0.489)^2 + (1.04 - 0.489)^2 + (0.64 - 0.489)^2 + (0.23 - 0.489)^2 + (1.81 - 0.489)^2 + (0.04 - 0.489)^2 + (0.36 - 0.489)^2 + (0.42 - 0.489)^2 + (-0.08 - 0.489)^2}{10 - 1}} \]
Calculating \(s_d\):
\[ s_d \approx \sqrt{\frac{(0.091^2) + (-0.639^2) + (0.551^2) + (0.151^2) + (-0.259^2) + (1.321^2) + (-0.449^2) + (-0.129^2) + (-0.069^2) + (-0.569^2)}{9}} \]
Let's compute this in detail:
Calculate squared differences:
- \( (0.091)^2 = 0.008281 \)
- \( (-0.639)^2 = 0.408321 \)
- \( (0.551)^2 = 0.303601 \)
- \( (0.151)^2 = 0.022801 \)
- \( (-0.259)^2 = 0.067081 \)
- \( (1.321)^2 = 1.748761 \)
- \( (-0.449)^2 = 0.201601 \)
- \( (-0.129)^2 = 0.016641 \)
- \( (-0.069)^2 = 0.004761 \)
- \( (-0.569)^2 = 0.323761 \)
Sum: \[ \sum (d_i - \bar{d})^2 = 0.008281 + 0.408321 + 0.303601 + 0.022801 + 0.067081 + 1.748761 + 0.201601 + 0.016641 + 0.004761 + 0.323761 \approx 2.616728 \] \[ s_d \approx \sqrt{\frac{2.616728}{9}} \approx \sqrt{0.290748} \approx 0.539 \]
3. Calculate the t-statistic: Using the formula for the t-statistic for paired differences:
\[ t = \frac{\bar{d}}{s_d / \sqrt{n}} = \frac{0.489}{0.539 / \sqrt{10}} \approx \frac{0.489}{0.170} \approx 2.876 \]
4. Degrees of freedom: The degrees of freedom \(df = n - 1 = 10 - 1 = 9\).
5. Critical t-value: For a one-tailed test at \(\alpha = 0.05\) and \(df = 9\):
Using a t-table or calculator: ( t_{critical} = -1.833 )
6. Conclusion: Since our calculated t-value \(2.876\) is greater than the critical value from the t-distribution (1.833 for a one-tailed test), we reject the null hypothesis.
Thus, we conclude that viewing the word cloud results in a faster determination of whether a document is worth reading compared to the text summary at the \(\alpha = 0.05\) significance level.