# Confidence Interval Estimate for the Difference Between Means

**(1) Large Samples**

The difference between two means is of considerable importance in testing the homogeneity of populations. In this tutorial we are concerned with the confidence interval estimate for the difference between two population means.

With a non-rigorous logic from the central limit theorem we can state that “If we have two populations with means and , and variances and respectively, then the sampling distribution of the difference of their sample means is said to be approximately normal with the mean and standard deviation , , and are the two sample sizes both larger than 30 from the two populations.”

This formula of combined standard deviation is obtained from the knowledge of the theorem stated, i.e., the variance of a sum or the difference of two independent random variables is the sum of their variances. Thus,

Hence

Therefore, the standard deviation of which is stated as

We can also standardize as follows

Here is standard normal variate. From this value of we can directly state 100% confidence limits for the difference between two population means as

And the confidence interval may be stated as

It must be remembered that the above results only hold for large samples or small samples from normal populations provided the population variance is known. If and are not known, for a large sample they can be replaced by and (the sample variances), which are computed by the formula . The larger of the two sample means should be considered as .

**(2) Small Samples**

When at least of the two sample sizes are small, then “” takes the place of . Two different kinds of interval estimates are obtained depending on whether the two populations are assumed to have the same variances

or unequal variances

.

If the two populations are assumed to have unknown and unequal population variances

then, , 100% confidence limits may be stated as

Here and are calculated by using formula .

If the two populations are assumed to have equal but unknown population variances

then, , 100% confidence limits may be stated as

Here

It may be noted that the statistic from the table should be obtained against the degree of freedom in both of the above cases.

__Example__:

A random sample of 100 students from an MBA class had an average score of 60 with a standard deviation score of 15 in statistics. A random sample of 64 students from a BS class had an average score of 66 with a standard deviation of 16 in the same course. Construct a 95% confidence interval for the difference between the mean score of the two classes.

__Solution__:

Since both sample sizes are large we will use the statistic to construct the interval. We have the following information:

Using formula , the 95% lower confidence limit for the difference between two population means would be

Also, the upper limit would be

Hence, the 95% confidence interval for the difference between the two population means is