"

8.1 A Single Population Mean (Known σ)

Parameter values, such as population mean or population proportion, are generally unknown. For example, if we wanted to know the percentage of college students who skip breakfast in the morning, then to know this actual percentage we would have to survey all college students (also need to define who qualifies as a college student). This is not quite easily attainable. The best we can do is to take a representative sample from the population of college students and use the percentage of students in the sample who skip breakfast as an estimate for the entire college student population. However, since a sample doesn’t include every member of the population, these sample statistics are often going to be slightly off from the actual population value of the parameter. The difference between the sample statistic and the population parameter is called the sampling error. We expect the sample results to naturally vary from one sample to another, but knowing the sampling distribution of the sample statistics from the Central Limit Theorem (CLT) allows us to calculate the maximum value of the sample error for different levels of desired confidence. This maximum error of estimate is called the Margin of Error (ME). The margin of error depends on the standard error of the sampling distribution of the sample statistic estimating the parameter and the desired level of confidence:

Margin of Error = (Critical Value) • (Standard Error)

Finding Critical Values

Use Desmos or StatKey. (Note: If you need to use more than 3 decimal places for the critical value, you may want to skip StatKey)

Critical z value using Desmos | Critical z-value using StatKey (no audio)

 

Standard Errors (SE)

Recall that the CLT says the sampling distribution of sample statistics will be normal or approximately normal if CLT requirements are met.

For means, use the Standard Error of the mean = the standard deviation of the sampling distribution of sample means ([latex]\bar x[/latex])

Standard Error of the mean [latex]= \sigma_{\bar x} = \dfrac{\sigma}{\sqrt n}[/latex].

If the population standard deviation,[latex]\sigma[/latex], is unknown, we can use the sample standard deviation, [latex]s[/latex], as a substitute for [latex]\sigma[/latex]. This substitution works well for larger sample sizes. We’ll discuss the approach for smaller sample sizes in the next section.

 

Estimating an Unknown Parameter

In general, we can estimate parameters in a population which are generally unknown with their corresponding sample statistics. A sample statistic used to estimate a population parameter is called a point estimate. We can represent a point estimate as a point on a number line.

Population Parameter Sample Statistic
(Point Estimate)
Mean [latex]\mu[/latex] [latex]\bar x[/latex]
Proportion [latex]p[/latex] [latex]\hat p[/latex]
Standard Deviation [latex]\sigma[/latex] [latex]s[/latex]

When estimating an unknown parameter using a sample statistic, there’s always going to be uncertainty involved due to variability in sampling. What are the chances that our sample statistic exactly equals what we’re trying to estimate in the population? Very little. So, instead we build an interval around the point estimate (sample statistic) by going a certain distance (called Margin of Error) to its left and the same distance to the right. We use that interval to estimate the unknown parameter. This interval is called a confidence interval.

Confidence Interval shown on a line with point estimate and margin of error..
Computing Confidence Intervals
We compute confidence intervals as follows:

  • Start with your point estimate: [latex]\bar x[/latex] (x-bar) or [latex]\hat p[/latex] (p-hat).
  • Subtract the Margin of Error (ME) from the point estimate to obtain the Left endpoint of the confidence interval
  • Add the Margin of Error (ME) to the point estimate to obtain the right endpoint of the confidence interval
    Note that the Margin of Error (ME) is also known as EBM (for means) and as EBP (for proportions)

Confidence Intervals Explained

Three Ways to Write a Confidence Interval
Once you compute the interval, you can represent the interval in multiple ways.
Three Ways to Write a Confidence Interval  © mathbootcamps.com (Link Reported Not working). Try this archive link instead.

  1. Plus minus notation
  2. Interval Notation
  3. Inequality/tri-linear Notation

If you need algebra refresher on how to write interval notations: Interval Notation @Coolmath

EXAMPLE

Suppose the left and right end points of a confidence interval are 30 and 50. Since the point estimate lies right in the middle of the interval, we can find it using the midpoint formula.

The point estimate here is [latex]\dfrac{30+50}{2}=40[/latex]. The left and right end points of the interval are exactly one Margin or Error (ME) away from the point estimate at the center.  The left end point of the interval (at 30) is 10 units to the left from the point estimate and the right end point (at 50) is also 10 away. So the margin of error (ME) = 10.

  1. Point estimate + ME = Right endpoint
  2. Point estimate – ME = Left endpoint

Combine (1) and (2) above together to write the confidence interval (CI):

Confidence Interval = Point estimate [latex]\pm[/latex] Margin of Error

The final answer is:

Confidence Interval [latex]= 40 \pm 10[/latex]

PRACTICE

PRACTICE

Changing the Confidence Level or Sample Size

Effect of Changing the Confidence Level (Example 8.4)

  • Increasing the confidence level increases the error bound, making the confidence interval wider.
  • Decreasing the confidence level decreases the error bound, making the confidence interval narrower.

Effect of Changing the Sample Size (Example 8.5)

  • Increasing the sample size causes the error bound to decrease, making the confidence interval narrower.
  • Decreasing the sample size causes the error bound to increase, making the confidence interval wider.

WORKED OUT EXAMPLE

Suppose that an accounting firm does a study to determine the time needed to complete one person’s tax forms. It randomly surveys 100 people. The sample mean is 23.6 hours. There is a known standard deviation of 7.0 hours. The population distribution is assumed to be normal.

Round your answers to 3 decimals.

  1. Find the point estimate.
  2. In words, define [latex]X[/latex] and [latex]\bar X[/latex].
  3. Which distribution should you use for this problem? Explain your choice.
  4. Construct a 90% confidence interval for the population mean time to complete the tax forms.
  5. If the firm wished to increase its level of confidence and keep the error bound the same by taking another survey, what changes should it make?
  6. If the firm did another survey, kept the error bound the same, and only surveyed 49 people, what would happen to the level of confidence? Why?
  7. Suppose that the firm decided that it needed to be at least 96% confident of the population mean length of time to within one hour. How would the number of people the firm surveys change? Why?
SHOW SOLUTION
  1. Point Estimate, [latex]\bar x = 23.6[/latex] hours
  2. Random variable [latex]X[/latex] is the time needed to complete an individual tax form. [latex]\bar X[/latex] is the mean time to complete tax forms from a sample of 100 customers.
  3. Use z-distribution. WHY? The sampling distribution of sample means will be approximately normal with a mean [latex]\mu_{\bar x} =[/latex]of 23.6 and a standard error [latex]\sigma_{\bar x} = {\dfrac{7}{\sqrt{100}}}[/latex]
  4. Confidence Interval
    Margin of Error  = ( Critical Value ) • ( Standard Error )

    Critical Value On Desmos

    [latex]\begin{cases} \text{$X$ = normaldist()}\\ \text{$X$.inversecdf(0.95)$=1.64485362695$}\\ \end{cases}[/latex]

    [latex]\begin{aligned} \text{Margin of Error} &= \text{( Critical Value ) • ( Standard Error )}\\&= (1.64485362695)\cdot\left(\dfrac{7}{\sqrt{100}}\right)=1.15139753886\end{aligned}[/latex]
    [latex]\begin{aligned}\text{Confidence Interval} &= \text{( Point Estimate ) $\pm$ ( Margin of Error )}\\&=23.6 \pm 1.15139753886\end{aligned}[/latex]

    You can also use an online calculator to obtain the above results:

    SUBEDI Calculator

    Go to Confidence Interval for a Mean calculator @ rsubedi.com

    Confidence Level (in decimal), [latex]c: \fbox{$\mathstrut \;0.90\;$}[/latex]

    Number of Samples

    ✅ ONE SAMPLE
    🔲 TWO SAMPLES

    Input Type:

    ✅ SAMPLE STATISTICS
    🔲 DATA

    Distribution Type to Use?

    ✅ NORMAL
    🔲 Student-t

    Enter Population Standard Deviation [latex]\sigma: \fbox{$\mathstrut \;7\;$}[/latex]

    Enter Sample Statistics.

    Sample size, [latex]n: \fbox{$\mathstrut \;100\;$}[/latex]

    Sample Mean, [latex]\bar x: \fbox{$\mathstrut \;23.6\;$}[/latex]

    CALCULATE
    Results show in a panel to the right. CI is displayed in [latex]\pm[/latex] notation.

    Interval form: [latex](22.449, 24.751)[/latex]

    LibreText Calculator

    Go to: Confidence Interval for a Mean With Statistics from the list of online calculators.

    Enter the following values and press Calculate.

    [latex]\sigma[/latex] unknown
    [latex]\sigma[/latex] known 🔘
    [latex]n: \fbox{$\mathstrut \;100\;$}[/latex]
    [latex]\bar x: \fbox{$\mathstrut \;23.6\;$}[/latex]
    [latex]\sigma: \fbox{$\mathstrut \;7\;$}[/latex]
    [latex]\text{CL}: \fbox{$\mathstrut \;0.90\;$}[/latex]

    CALCULATE

    Results displayed are:

    The results are shown on the right: Lowerbound, Upperbound, and [latex]\hat p[/latex] are given.
    Your confidence interval is (Lowerbound, Upperbound)
  5. It will need to change the sample size. The firm needs to determine what the confidence level should be, then apply the error bound formula to determine the necessary sample size.
  6. The confidence level would increase as a result of a larger interval. Smaller sample sizes result in more variability. To capture the true population mean, we need to have a larger interval.
  7. According to the error bound formula, the firm needs to survey 206 people. Since we increase the confidence level, we need to increase either our error bound or the sample size.

 

 

Sample Size Calculations

According to Central Limit Theorem, sample means follow a normal distribution with a mean equal to the population mean and a standard error of \[\sigma_{\bar x} = \frac{\sigma}{\sqrt n}\] Recall that margin of error is given by \[ME = z_c\cdot (\text{Standard Error})\] that is \[ME = z_c\cdot\frac{\sigma}{\sqrt n} = \frac{z_c\cdot\sigma}{\sqrt n}\]

We solve this for [latex]n[/latex]. Start by multiplying both sides by [latex]\sqrt n[/latex], and then divide the resulting equation by ME on both sides. \[\sqrt n = \frac{z_c\cdot\sigma}{ME}\] Squaring both sides results in \[n = \left(\frac{z_c\cdot\sigma}{ME}\right)^2\]
Calculating the required sample size using this formula will almost always result in a decimal value for [latex]n[/latex]. Since the sample size has to be a positive integer, we will need to do some rounding. We will always round up the final answer for sample size calculations[1] and indicate this in the formula by using the ceiling function (round up to the nearest integer):
\[n = \left\lceil\left(\frac{z_c\cdot\sigma}{ME}\right)^2\right\rceil\]

DEMO EXAMPLE WITH PRACTICE

ONLINE CALCULATOR Approach

SUBEDI Calculator

Go to Sample Size Calculator @ rsubedi.com

Estimate for:

✅ MEAN
🔲 PROPORTION

Population Standard Deviation, [latex]\sigma: \fbox{$\mathstrut \phantom{EMPTY} $}[/latex]

Margin of Error, [latex]ME: \fbox{$\mathstrut \phantom{EMPTY} $}[/latex]

Confidence Level (in decimal), [latex]c: \fbox{$\mathstrut \phantom{EMPTY} $}[/latex]

CALCULATE
Results show in a panel to the right. Be sure to round your answer as necessary.

LibreText Calculator

Go to: Sample Size for a Mean from the list of online calculators.

Enter the following values and press Calculate.

[latex]\sigma: \fbox{$\mathstrut \phantom{EMPTY} $}[/latex]

[latex]\text{E}: \fbox{$\mathstrut \phantom{EMPTY} $}[/latex]

[latex]\text{CL}: \fbox{$\mathstrut \phantom{EMPTY} $}[/latex]

CALCULATE
Results displayed are:

[latex]n[/latex]: The required Sample size.

 

Additional Practice


  1. If the value of [latex]n[/latex] comes out to, say, 46.234, then we need at least that many to ensure all the requirements are met. Although the conventional rounding of 46.234 will give us 46, a sample size of 46 is not enough guarantee that the required margin of error and the confidence level are satisfied. We should, therefore, choose an integer that's just above our calculated decimal value of [latex]n[/latex], which in this case is 47.

License

Icon for the Creative Commons Attribution-ShareAlike 4.0 International License

Statistics Study Guide Copyright © by Ram Subedi is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License, except where otherwise noted.