n=1000 From normal population μ=100 σ=10
n=10000 From normal population μ=100 σ=10
class widths narrower, more of them.
Histogram here scaled to look same size as above. It has ten times the data, so is actually ten times as large, unscaled.
In the limit, as n gets very large, e.g. N, (actually, to ∞), class width narrows to zero
and the histogram becomes smoother and smoother, we can step into a math function and its curve. And do
math stuff.
| ƒ(x) = |
| 😂 |
|---|
| μ=100 σ=10 | ƒ(x) = |
The orange is the area under the curve on the interval [100,110], which is [μ,1σ],
which is an Empircal Rule "slice" of 34.1%.
μ=0 σ=1 is the standard normal function/curve/distribution.
Its function simplifies 😉 to
| ƒ(x) = |
|
| ƒ(z) = |
|
For a normal population, a Z score is the number of standard deviations (σ=1) from the mean (μ=0) of this standard normal curve.
We convert a datum to its z-score because we then find out its CDF (cumulative density function) value
(the area under the curve from -∞ to this z score, which equals the
percent of the population that is less than this datum, which equals the
probability that a random selection from the population will be less than this datum)
by either looking up the CDF in a
Z table or having a calculator or
software calculate it for us.