6.1 The Standard Normal Distribution

Ram Subedi

6.1 The Standard Normal Distribution

The normal distribution is an extremely important distribution in statistics. When graphing naturally occurring data from many real life scenarios, the shape of the data distribution takes a bell-shaped symmetric form.

What is a distribution?

Introduction to the Normal Distribution

The curve enveloping the histogram is called a normal curve. It is bell-shaped, symmetric about the center. As a symmetric graph, note that the mean, median, and mode are all at the center as well.

Notation
[latex]X \sim N(\mu, \sigma)[/latex] means that the random variable [latex]X[/latex] is normally distributed with mean of [latex]\mu[/latex] and standard deviation [latex]\sigma[/latex].

DESMOS DEMO

Use the Desmos Demo to explore the effects of changing the mean and standard deviation on the shape and center of normal distributions.

Properties of the Normal Curve

The curve has inflection points, the points where the graph changes curvature, at exactly one standard deviation from the mean.
The total area under the curve is equal to 100% (or 1 in decimal).
The area under the curve to the left of the mean is equal to the area under the curve to the right of mean, which equals 0.5.
The graph tails off towards the horizontal axis on the left and right. The x-axis is the horizontal asymptote.

What makes a distribution NORMAL?

Recall from UNIT 2 ...

In Unit 2, we explored z-scores as well as the Empirical Rule for bell-shaped distributions. We'll look at these bell-shaped distributions, more specifically, normal distributions in this unit.

Normal distribution problem: z-scores

Normal distributions and the empirical rule

Probability and Normal Distributions

Since the total area under the normal curve equals 1, and its horizontal axis represents all possible outcomes of the random variable whose distribution is represented by the normal curve, we can think of the areas under the curve as probabilities.

This is easier to comprehend if we look at a more familiar shape than the normal curve. Let's say we have a sandbox in the shape of a square of side 3 feet in a playground. A child randomly tosses a ball into the sandbox from a distance. Let's assume that the child is equally likely to hit any place inside the sandbox and will never miss the sandbox.

A 2 by 2 grid with numbers 1, 2, 3, and 4. The top right cell containing the number 2 is shaded blue. — Sandbox

If we divide the sandbox into 4 quarters and label them 1, 2, 3, and 4 as follows, what are the chances (or what is the probability) that the child will hit the quarter 2? Since the area shaded is 25% of the total, we're going to say the chances are 25% or 0.25.

If we observe the child play this game all day, we'd expect 25% of the tosses to land in the quarter labeled 2. That is the proportion of the observations landing in quarter 2 will be 0.25. Similarly, under the normal curve as well, we can say that since the mean is at the center (under the peak), if we randomly select a value of the random variable whose distribution is normal, we'll see that there's a 50% chance that it will be below the mean and the same chance for it being above the mean.

In general, the area under the normal curve over some interval represents the probability (or the chances) of observing those values of the random variable in that interval. It is also the proportion of the outcomes of the random variable that are expected to be in that interval. When we want to calculate probabilities for a given interval, say (3, 6), the area will be different depending on the mean and the standard deviation of the normal distribution. Instead of having to deal with infinitely many normal distributions, statisticians, especially before technology became ubiquitous, simply transformed the given normal distribution to a standard normal distribution and computed the appropriate area from the standard normal distribution using standard normal or z-tables.

The Standard Normal Distribution

The standard normal distribution is a normal distribution of standardized values called z-scores. Note that z-scores answers the question: How many standard deviations away from the mean is this given data? z-scores are measured in units of the standard deviation. For any normal distribution, the z-score tells you how many standard deviations a value x is above (to the right of) or below (to the left of) the mean, [latex]\mu[/latex]. Values of x that are larger than the mean have positive z-scores, and values of x that are smaller than the mean have negative z-scores. The z-score of the mean [latex]\mu[/latex] is 0. No matter what the mean and the standard deviations are for a normal distribution, we can always talk about that normal distribution in terms of how many standard deviations away from their mean is any given x value. This leads us to the standard normal distribution, which has a mean of 0 and standard deviation of 1. We can transform a normal distribution with any mean, [latex]\mu[/latex], and any positive standard deviation, [latex]\sigma[/latex], into a standard normal distribution with the z-score formula: \[z=\frac{x-\mu}{\sigma}.\]Standard Normal Distribution is also known as the [latex]z-[/latex]distribution: [latex]Z \sim N(0,1)[/latex].

Since we have powerful technology at our fingertips, we'll NOT be using z-Tables in this course. Instead, we'll calculate normal distribution probabilities using either an online or a physical calculator. It is also not necessary to transform every normal distribution to a z-distribution for calculations as our calculators can handle that. We still need to understand z-scores because they play a prominent role in various statistical methods we're about to meet in this course.

Standard Normal Distribution Probability Calculations

There are going to be two types of calculations/operations we will need to be familiar with when working with probabilities and normal distributions.

Find AREA/PROBABILITY given VALUE
Given [latex]z-[/latex]score , find the associated probability (area)
Find VALUE given AREA/PROBABILITY
Given a probability (an area), find the [latex]z-[/latex]score

EXAMPLE

Find AREA/PROBABILITY given VALUE

For the standard normal variable [latex]Z[/latex], find the following probabilities:

[latex]P(z\le 1.74)[/latex].
[latex]P(z\gt -2.48)[/latex].
[latex]P(-2.37 \le z \le 1.02)[/latex].

SHOW SOLUTION

DESMOS CALCULATOR
Calculator Usage Guide

In the first input box on Desmos calculator, enter: \[\text{normaldist}()\] and click on the magnifying glass icon to see the normal curve.

1. [latex]P(z\le 1.74)[/latex]
Click on the ▶ Cumulative Probability to open the inputs panel.

As we are looking for the probability/area associated with z-score of LESS than [latex]1.74[/latex], choose Left for the REGION and Area for COMPUTE.

Enter 1.74 in the entry box inside the probability expression displayed:

\[P(x \le \colorbox{#ccffcc} {$\;1.74\;$} ) = \colorbox{#cccccc}{$\;0.959\;\vcenter{\tiny{▾}}\;$}\]

Result: Left tail area is shaded on the graph. Area/Probability [latex]=0.959070491021[/latex]

Your answer will be in the grayed-out box with a down arrow at the end. Click on the down arrow next to the answer to see more decimals. You can paste the answer into the cell (expression list) by clicking on the icon next to the answer.

2. [latex]P(z\gt -2.48)[/latex]
Click on the ▶ Cumulative Probability to open the inputs panel.

As we are looking for the probability/area associated with z-score of GREATER than [latex]-2.48[/latex], choose Right for the REGION and Area for COMPUTE.

Enter -2.48 in the entry box inside the probability expression displayed:

\[P(x \gt \colorbox{#ccffcc} {$\;-2.48\;$} ) = \colorbox{#cccccc}{$\;0.993\;\vcenter{\tiny{▾}}\;$}\]

Result: Right tail area is shaded on the graph. Area/Probability [latex]=0.993430880864[/latex]

3. [latex]P(-2.37 \le z \le 1.02)[/latex]
Click on the ▶ Cumulative Probability to open the inputs panel.

As we are looking for the probability/area associated with z-score of In-Between two values, choose Inner for the REGION and Area for COMPUTE.

Enter -2.37 and 1.02 in the probability expression displayed:

\[P(\colorbox{#ccffcc} {$\;-2.37\;$} \le x \le \colorbox{#ccffcc} {$\;1.02\;$} ) = \colorbox{#cccccc}{$\;0.837\;\vcenter{\tiny{▾}}\;$}\]

Result: Inner (middle) area is shaded on the graph. Area/Probability [latex]=0.837241726997[/latex]

View on Desmos

EXAMPLE

Find VALUE given AREA/PROBABILITY

For the standard normal variable [latex]Z[/latex], find the following:

Find the z-score that separates the top 10% of the area.
Find the z-score corresponding to the 90th percentile.
Find the z-scores that separate the middle 95% of the area.

SHOW SOLUTION

DESMOS CALCULATOR
Calculator Usage Guide

In the first input box on Desmos calculator, enter: \[\text{normaldist}()\] and click on the magnifying glass icon to see the normal curve.

1. Find the z-score that separates the top 10% of the area.

We want the z-score that cuts off the top 10% of the normal curve. This means 10% of the area is on the right and 90% is on the left of that z-score.

Click on the ▶ Cumulative Probability to open the inputs panel. Select Right for the REGION and Bounds for COMPUTE, then enter 0.10 after the [latex]=[/latex] sign in the probability expression:

\[P(x \gt \colorbox{#cccccc}{$\;1.282\;\vcenter{\tiny{▾}}\;$}) =\colorbox{#ccffcc} {$\;0.10\;$} \]

Result: Corresponding right-tail area is shaded on the graph.
z-score [latex]=1.28155156554[/latex]

Your answer will be in the gray box with a down arrow at the end. Click on the down arrow next to the answer to view your answer with more decimals. You can paste the answer into the cell below (in the expression list) by clicking on the snapshot icon next to the answer.

Question: How would the inputs change if we selected Left for the REGION in this question?

2. Find the z-score corresponding to the 90th percentile.

The 90th percentile is the point on the standard normal curve with 90% of the area to its left (and 10% to its right).

Click on the ▶ Cumulative Probability to open the inputs panel. Select Left for the REGION and Bounds for COMPUTE, then enter 0.90 after the [latex]=[/latex] sign in the probability expression:
\[P(x \le \colorbox{#cccccc}{$\;1.282\;\vcenter{\tiny{▾}}\;$}) =\colorbox{#ccffcc} {$\;0.90\;$} \]

Result: Corresponding left-tail area is shaded on the graph.
z-score [latex]=1.28155156554[/latex]
Same answer as above. Why?

Question: If we know a z-score with 90% of the area to its left, how can we determine the z-score with 90% of the area to its right without doing any calculations?

3. Find the z-scores that separate the middle 95% of the area.
The standard normal distribution is symmetric around its mean of zero, so it has the same shape on both sides of the mean. Due to this symmetry, the two z-scores that enclose the middle 95% are the same distance from zero but in opposite directions—one negative z-score on the left and one positive on the right.

Click on the ▶ Cumulative Probability to open the inputs panel.

Select Inner for the REGION and Bounds for COMPUTE, then enter 0.90 after the [latex]=[/latex] sign in the probability expression:

\[P(\colorbox{#cccccc}{$\;-1.96\;\vcenter{\tiny{▾}}\;$} \le x \le \colorbox{#cccccc}{$\;1.96\;\vcenter{\tiny{▾}}\;$}) =\colorbox{#ccffcc} {$\;0.95\;$} \]

Result: Corresponding inner (middle) area is shaded on the graph.
z-scores [latex]=\pm 1.95996398454[/latex]

View on Desmos

Practice

License

Icon for the Creative Commons Attribution-ShareAlike 4.0 International License

Properties of the Normal Curve

Probability and Normal Distributions

The Standard Normal Distribution

Standard Normal Distribution Probability Calculations

License

Share This Book