Confidence Interval Estimate of the Mean
From central limit theorem we know that
\[Z = \frac{{\overline X – \mu }}{{\frac{\sigma }{{\sqrt n }}}}\]
is a standard normal variate. From the discussion of an introduction to interval estimation we know that P (–1.96 < Z< 1.96) = 0.95 has the least possible range. With this inequality we can construct a 95% confidence interval estimate of the population mean$$\mu $$ if we replace Z by \[\overline X – \mu /\frac{\sigma }{{\sqrt n }}\]
In general if the margin of error in our interval estimate is $$\alpha $$ (this is also referred to as the level of significance), then the level of confidence would be $$\left( {1 – \alpha } \right)$$ or $$\left( {1 – \alpha } \right) \times 100\% $$. For example, if $$\alpha = 0.05$$ or 5% then $$\left( {1 – \alpha } \right) = 0.95$$ or 95% would be the level of confidence. To have the shortest interval the margin of error $$\alpha $$ is distributed equally on the two sides of the curve, as shown in the figure below:
From the above discussion we can write the following probability statement: $$P\left[ { – {Z_{\alpha /2}} < Z < {Z_{\alpha /2}}} \right] = \left( {1 – \alpha } \right)$$. The $$\left( {1 – \alpha } \right)$$ 100% confidence interval estimate of the mean can be obtained by replacing Z with $$\overline X – \mu /\frac{\sigma }{{\sqrt n }}$$, thus
\[\begin{gathered} P\left[ { – {Z_{\alpha /2}} < \frac{{\overline X – \mu }}{{\sigma /\sqrt n }} < {Z_{\alpha /2}}} \right] = 1 – \alpha \\ P\left[ { – {Z_{\alpha /2}}\frac{\sigma }{{\sqrt n }} < \overline X – \mu < {Z_{\alpha /2}}\frac{\sigma }{{\sqrt n }}} \right] = 1 – \alpha \\ P\left[ { – \overline X – {Z_{\alpha /2}}\frac{\sigma }{{\sqrt n }} < – \mu < – \overline X + {Z_{\alpha /2}}\frac{\sigma }{{\sqrt n }}} \right] = 1 – \alpha \\ P\left[ {\overline X + {Z_{\alpha /2}}\frac{\sigma }{{\sqrt n }} > \mu > \overline X – {Z_{\alpha /2}}\frac{\sigma }{{\sqrt n }}} \right] = 1 – \alpha \\ P\left[ {\overline X – {Z_{\alpha /2}}\frac{\sigma }{{\sqrt n }} < \mu < \overline X + {Z_{\alpha /2}}\frac{\sigma }{{\sqrt n }}} \right] = 1 – \alpha \\ \end{gathered} \]
The $$\left( {1 – \alpha } \right)$$ 100% lower and upper confidence limits are, therefore, $$\overline X \mp {Z_{\alpha /2}}\frac{\sigma }{{\sqrt n }}$$.
As a specific example, let $$\alpha = 0.10$$, so that $$\left( {1 – \alpha } \right) = 0.90$$ or 90%. The probability statement may thus be written as
$$P\left[ { – {Z_{0.05}} < Z < {Z_{0.05}}} \right] = 0.90$$ [Since $$\alpha = 0.1$$, $$\therefore \frac{\alpha }{2} = 0.05$$]
Consulting the normal table, $${Z_{0.05}} = 1.64$$. Hence we can write P [–1.64 <Z< 1.64] = 0.90.
Replacing Z with $$\overline X – \mu /\frac{\sigma }{{\sqrt n }}$$, we have $$P\left[ { – 1.64 < \frac{{\overline X – \mu }}{{\sigma /\sqrt n }} < 1.64} \right] = 0.90$$
Upon simplification, this reduces to $$P\left[ {\overline X – 1.64\frac{\sigma }{{\sqrt n }} < \mu < \overline X + 1.64\frac{\sigma }{{\sqrt n }}} \right] = 0.90$$.
Therefore, the 90% confidence limits are: $$\overline X \mp 1.64\frac{\sigma }{{\sqrt n }}$$.
$$\sigma $$ is usually not known; therefore, for large samples it can be replaced by S (the sample standard deviation) which may be calculated from the sample by using the following formula: $${S^2} = \frac{{\sum {{\left( {{X_i} – \overline X } \right)}^2}}}{{n – 1}}$$.
Proceeding along the same lines, the 95 and 99 percent confidence limits may be stated as
$$\overline X \mp 1.96\frac{\sigma }{{\sqrt n }}$$ for 95%
$$\overline X \mp 2.57\frac{\sigma }{{\sqrt n }}$$ for 99%
Example: A random sample of 64 MS students achieved an average score of 60 with a standard deviation score of 15. Construct a 99% confidence interval estimation for the mean score of the entire class.
Solution: We have the following data: $$\overline X = 60,\,\,\,\,n = 64,\,\,\,\sigma = 15$$
The formula for 99% confidence interval may be written as
\[P\left[ {\overline X – 2.57\frac{\sigma }{{\sqrt n }} < \mu < \overline X + 2.57\frac{\sigma }{{\sqrt n }}} \right]\]
The lower confidence limit is
\[\begin{gathered} \overline X – 2.57\frac{\sigma }{{\sqrt n }} = 60 – 2.57\left( {\frac{{15}}{8}} \right) \\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, = 60 – 4.82 = 55.18 \approx 55 \\ \end{gathered}\]
The upper confidence limit is
\[\begin{gathered} \overline X + 2.57\frac{\sigma }{{\sqrt n }} = 60 + 2.57\left( {\frac{{15}}{8}} \right) \\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, = 60 + 4.82 = 64.82 \approx 65 \\ \end{gathered}\]
Hence, the 99% confidence interval estimate for the mean score will be $$55 < \mu < 65$$, i.e., the mean score is between 55 and 65.