{{Probability distribution|
name =Normal|
type =density|
pdf_image =

Probability density function for the normal distribution
The green line is the standard normal distribution|
cdf_image =

Cumulative distribution function for the normal distribution
Colors match the image above|
parameters =
location (
real)
squared
scale (real)|
support =
|
pdf =
|
cdf =
|
mean =
|
median =
|
mode =
|
variance =
|
skewness =0|
kurtosis =0|
entropy =
|
mgf =
|
char =
|
}}
The 'normal distribution', also called the 'Gaussian distribution', is an important family of
continuous probability distributions, applicable in many fields.
Each member of the family may be defined by two parameters, ''location'' and ''scale'': the
mean ("average") and
standard deviation ("variability"), respectively.
The 'standard normal distribution' is the normal distribution with a
mean of zero and a
variance of one (the green curves in the plots to the right).
Carl Friedrich Gauss became associated with this set of distributions when he analyzed astronomical data using them
[1], and defined the equation of its probability density function.
It is often called the 'bell curve' because the graph of its
probability density resembles a
bell.
The importance of the normal distribution as a model of quantitative phenomena in the
natural and
behavioral sciences is due to the
central limit theorem. Many
psychological measurements and
physical phenomena (like
noise) can be approximated well by the normal distribution. While the mechanisms underlying these phenomena are often unknown, the use of the normal model can be theoretically justified by assuming that many small, independent effects are additively contributing to each observation.
The normal distribution also arises in many areas of
statistics. For example, the
sampling distribution of the
sample mean is approximately normal, even if the distribution of the population from which the sample is taken is not normal. In addition, the normal distribution maximizes
information entropy among all distributions with known mean and variance, which makes it the natural choice of underlying distribution for data summarized in terms of sample mean and variance. The normal distribution is the most widely used family of distributions in statistics and many statistical tests are based on the assumption of normality. In
probability theory, normal distributions arise as the
limiting distributions of several continuous and
discrete families of distributions.
History
The normal distribution was first introduced by
Abraham de Moivre in an article in
1734, which was reprinted in the second edition of his ''
The Doctrine of Chances'',
1738 in the context of approximating certain
binomial distributions for large ''n''. His result was extended by
Laplace in his book ''
Analytical Theory of Probabilities'' (
1812), and is now called the
theorem of de Moivre-Laplace.
Laplace used the normal distribution in the
analysis of errors of experiments. The important
method of least squares was introduced by
Legendre in
1805.
Gauss, who claimed to have used the method since
1794, justified it rigorously in
1809 by assuming a normal distribution of the errors.
The name "bell curve" goes back to
Jouffret who first used the term "bell surface" in
1872 for a
bivariate normal with independent components. The name "normal distribution" was coined independently by
Charles S. Peirce,
Francis Galton and
Wilhelm Lexis around
1875. This terminology unfortunately encourages the fallacy that many or all other probability distributions are not "normal". (See the discussion of "occurrence" below.)
Characterization
There are various ways to
characterize a
probability distribution. The most visual is the
probability density function (PDF); the PDF of the normal distribution is plotted at the beginning of this article. Equivalent ways are the
cumulative distribution function, the
moments, the
cumulants, the
characteristic function, the
moment-generating function, the cumulant-
generating function, and
Maxwell's theorem. See
probability distribution for a discussion.
To indicate that a real-valued
random variable ''X'' is normally distributed with mean ''μ'' and variance ''σ''
2 ≥ 0, we write
:
While it is certainly useful for certain limit theorems (e.g.
asymptotic normality of estimators) and for the theory of
Gaussian processes to consider the probability distribution concentrated at ''μ'' (see
Dirac measure) as a normal distribution with mean ''μ'' and variance ''σ''
2 = 0, this degenerated case is often excluded from the considerations because no density with respect to the
Lebesgue measure exists.
Probability density function
The continuous
probability density function of the 'normal distribution' is the
Gaussian function
:
where ''σ'' > 0 is the
standard deviation, the real parameter ''μ'' is the
expected value, and
:
is the density function of the "standard" normal distribution, i.e., the normal distribution with ''μ'' = 0 and ''σ'' = 1. To verify that the
integral of
over the
real line is indeed equal to one, see
Gaussian integral.
As a Gaussian function with the denominator of the exponent equal to 2, the standard normal density function
is an
eigenfunction of the
Fourier transform.
Some notable qualities of the probability density function:
★ The density function is symmetric about its mean value ''μ''.
★ The mean ''μ'' is also its
mode and
median.
★ The
inflection points of the curve occur at one standard deviation away from the mean, i.e. at ''μ'' − ''σ'' and ''μ'' + ''σ''.
Cumulative distribution function
The
cumulative distribution function (cdf) of a
probability distribution, evaluated at a number (lower-case) ''x'', is the probability of the event that a
random variable (capital) ''X'' with that distribution is less than or equal to ''x''. The cumulative distribution function of the normal distribution is expressed in terms of the density function as follows:
:
where the standard normal cdf Φ is just the general cdf evaluated with ''μ'' = 0 and ''σ'' = 1:
:
The standard normal cdf can be expressed in terms of a
special function called the
error function, as
:
and the cdf itself can hence be expressed as
:
The inverse standard normal cumulative distribution function, or
quantile function, can be expressed in terms of the inverse error function:
:
and the inverse cumulative distribution function can hence be expressed as
:
This quantile function is sometimes called the
probit function. There is no elementary
primitive for the probit function. This is not to say merely that none is known, but rather that the non-existence of such a function has been proved. Several accurate methods exist for approximating the quantile function for the normal distribution - see
quantile function for a discussion and references.
The values Φ(''x'') may be approximated very accurately by a variety of methods, such as
numerical integration,
Taylor series,
asymptotic series and
continued fractions.
Strict lower and upper bounds for the cdf
For large ''x'' the standard normal cdf
is close to 1 and
is close to 0. The elementary bounds
:
in terms of the density
are useful.
Using the
substitution ''v'' = ''u''
2/2, the upper bound is derived as follows:
:
Similarly, using
and the
quotient rule,
:
Solving for
provides the lower bound.
Generating functions
Moment generating function
The
moment generating function is defined as the
expected value of exp(''tX''). For a normal distribution, the moment generating function is
:
as can be seen by
completing the square in the exponent.
Cumulant generating function
The
cumulant generating function is the logarithm of the moment generating function: ''g''(''t'') = μ''t'' + σ
2''t''
2/2. Since this is a quadratic polynomial in ''t'', only the first two cumulants are nonzero.
Characteristic function
The
characteristic function is defined as the
expected value of
, where
is the
imaginary unit. So the characteristic function is obtained by replacing ''t'' with ''i t'' in the moment-generating function.
For a normal distribution, the characteristic function is
:
Properties
Some of the properties of the normal distribution:
#If
and
and
are
real numbers, then
(see
expected value and
variance).
#If
and
are
independent normal
random variables, then:
#
★ Their sum is normally distributed with
(
proof).
#
★ Their difference is normally distributed with
.
#
★ If the variances of ''X'' and ''Y'' are equal, then ''U'' and ''V'' are independent of each other.
#
★ The
Kullback-Leibler divergence,
#If
and
are independent normal random variables, then:
#
★ Their product
follows a distribution with density
given by
#
★ :
where
is a
modified Bessel function of the second kind.
#
★ Their ratio follows a
Cauchy distribution with
. Thus the Cauchy distribution is a special kind of
ratio distribution.
#If
are independent standard normal variables, then
has a
chi-square distribution with ''n'' degrees of freedom.
Standardizing normal random variables
As a consequence of Property 1, it is possible to relate all normal random variables to the standard normal.
If
~
, then
:
is a standard normal random variable:
~
.
An important consequence is that the cdf of a general normal distribution is therefore
:
Conversely, if
is a standard normal distribution,
~
, then
:
is a normal random variable with mean
and variance
.
The standard normal distribution has been tabulated (usually in the form of value of the cumulative distribution function Φ), and the other normal distributions are the simple transformations, as described above, of the standard one. Therefore, one can use tabulated values of the cdf of the standard normal distribution to find values of the cdf of a general normal distribution.
Moments
Some of the first few
moments of the normal distribution are:
| Number | Raw moment | Central moment | Cumulant |
|---|
| 0 | 1 | 1 | |
| 1 | | 0 | |
| 2 | | | |
| 3 | | 0 | 0 |
| 4 | | | 0 |
All
cumulants of the normal distribution beyond the second are zero.
Generating values for normal random variables
For computer simulations, it is often useful to generate values that have a normal distribution.
There are several methods and the most basic is to invert the standard normal cdf.
More efficient methods are also known, one such method being the
Box-Muller transform. An even faster algorithm is the
ziggurat algorithm.
The Box-Muller algorithm says that, if you have two numbers ''a'' and ''b''
uniformly distributed on (0, 1], (e.g. the output from a
random number generator), then two standard normally distributed random variables are ''c'' and ''d'', where:
:
:
This is because the chi-square distribution with two degrees of freedom (see property 4 above) is an easily-generated exponential random variable.
The central limit theorem
Main articles: central limit theorem

Plot of the pdf of a normal distribution with μ = 12 and σ = 3, approximating the pdf of a binomial distribution with ''n'' = 48 and ''p'' = 1/4
The normal distribution has the very important property that under certain conditions, the distribution of a sum of a large number of identically distributed
independent variables is approximately normal.
This is the central limit theorem.
The practical importance of the central limit theorem is that the normal distribution can be used as an approximation to some other distributions.
★ A
binomial distribution with parameters
and
is approximately normal for large
and
not too close to 1 or 0 (some books recommend using this approximation only if
and
are both at least 5; in this case, a
continuity correction should be applied).
The approximating normal distribution has mean
and variance
.
★ A
Poisson distribution with parameter
is approximately normal for large
.
The approximating normal distribution has mean
and variance
.
Whether these approximations are sufficiently accurate depends on the purpose for which they are needed, and the rate of convergence to the normal distribution.
It is typically the case that such approximations are less accurate in the tails of the distribution.
Infinite divisibility
The normal distributions are
infinitely divisible probability distributions.
Stability
The normal distributions are strictly
stable probability distributions.
Standard deviation

Dark blue is less than one
standard deviation from the
mean. For the normal distribution, this accounts for about 68% of the set (dark blue) while two standard deviations from the mean (medium and dark blue) account for about 95% and three standard deviations (light, medium, and dark blue) account for about 99.7%.
About 68% of values drawn from a standard normal distribution are within one standard deviation away from the mean; about 95% of the values are within two standard deviations and about 99.7% lie within three standard deviations. This is known as the "
68-95-99.7 rule" or the "
empirical rule."
To be more precise, the area under the curve between
and
is
:
where erf(''x'') is the
error function. To 12 decimal places the first 6 values of the 1,2 and 3 sigma points are given in the following table:
{| class="wikitable" style="text-align:center"
|- bgcolor="#CCCCCC"
!
!!
|-
|1 || 0.682689492137
|-
|2 || 0.954499736104
|-
|3 || 0.997300203937
|-
|4 || 0.999936657516
|-
|5 || 0.999999426697
|-
|6 || 0.999999998027
|}
Exponential family
The Normal distribution is a two-parameter
exponential family with natural parameters
and
, and natural statistics
and
.
Complex Gaussian process
Consider complex Gaussian random variable,
:
where
and
are real and independent Gaussian variables with equal variances
. The pdf of the joint variables is then
:
Because
, the resulting PDF for the complex Gaussian variable
is
:
Related distributions
★
is a
Rayleigh distribution if
where
and
are two independent normal distributions.
★
is a
chi-square distribution with
degrees of freedom if
where
for
and are independent.
★
is a
Cauchy distribution if
for
and
are two
independent normal distributions.
★
is a
log-normal distribution if
and
.
★ Relation to
Lévy skew alpha-stable distribution: if
then
.
★
Truncated normal distribution. If,
then, truncating below at
and above at
will lead to a random variable with mean
, where
and
and
, where
is the
probability density function of a standard normal random variable.
★ If
is a random variable with a normal distribution, and
, then
has a
folded normal distribution.
Descriptive and inferential statistics
Scores
Many scores are derived from the normal distribution, including
percentile ranks ("percentiles"),
normal curve equivalents,
stanines,
z-scores, and T-scores. Additionally, a number of behavioral
statistical procedures are based on the assumption that scores are normally distributed; for example,
t-tests and
ANOVAs (see below).
Bell curve grading assigns relative grades based on a normal distribution of scores.
Normality tests
Main articles: normality test
Normality tests check a given set of data for similarity to the normal distribution. The
null hypothesis is that the data set is similar to the normal distribution, therefore a sufficiently small
P-value indicates non-normal data.
★
Kolmogorov-Smirnov test
★
Lilliefors test
★
Anderson-Darling test
★
Ryan-Joiner test
★
Shapiro-Wilk test
★
Normal probability plot (
rankit plot)
★
Jarque-Bera test
Estimation of parameters
Maximum likelihood estimation of parameters
Suppose
:
are
independent and each is normally distributed with expectation ''μ'' and variance ''σ''
2 > 0. In the language of statisticians, the observed values of these ''n'' random variables make up a "sample of size ''n'' from a normally distributed population." It is desired to estimate the "population mean" ''μ'' and the "population standard deviation" ''σ'', based on the observed values of this sample. The continuous joint probability density function of these ''n'' independent random variables is
:
As a function of ''μ'' and ''σ'', the
likelihood function based on the observations ''X''
1, ..., ''X''
''n'' is
:
with some constant ''C'' > 0 (which in general would be even allowed to depend on ''X''
1, ..., ''X''
''n'', but will vanish anyway when partial derivatives of the log-likelihood function with respect to the parameters are computed, see below).
In the method of
maximum likelihood, the values of ''μ'' and ''σ'' that maximize the likelihood function are taken as estimates of the population parameters ''μ'' and ''σ''.
Usually in maximizing a function of two variables, one might consider
partial derivatives. But here we will exploit the fact that the value of ''μ'' that maximizes the likelihood function with ''σ'' fixed does not depend on ''σ''. Therefore, we can find that value of ''μ'', then substitute it for ''μ'' in the likelihood function, and finally find the value of ''σ'' that maximizes the resulting expression.
It is evident that the likelihood function is a decreasing function of the sum
:
So we want the value of ''μ'' that ''minimizes'' this sum. Let
:
be the "sample mean" based on the ''n'' observations. Observe that
:
Only the last term depends on ''μ'' and it is minimized by
:
That is the maximum-likelihood estimate of ''μ'' based on the ''n'' observations ''X''
1, ..., ''X''
''n''. When we substitute that estimate for ''μ'' into the likelihood function, we get
:
It is conventional to denote the "log-likelihood function", i.e., the logarithm of the likelihood function, by a lower-case
, and we have
:
and then
:
This derivative is positive, zero, or negative according as ''σ''
2 is between 0 and
:
or equal to that quantity, or greater than that quantity. (If there is just one observation, meaning that ''n'' = 1, or if ''X''
1 = ... = ''X''
''n'', which only happens with probability zero, then
by this formula, reflecting the fact that in these cases the likelihood function is unbounded as ''σ'' decreases to zero.)
Consequently this average of squares of
residuals is the maximum-likelihood estimate of ''σ''
2, and its square root is the maximum-likelihood estimate of ''σ'' based on the ''n'' observations. This estimator
is
biased, but has a smaller
mean squared error than the usual unbiased estimator, which is ''n''/(''n'' − 1) times this estimator.
Surprising generalization
The derivation of the maximum-likelihood estimator of the
covariance matrix of a
multivariate normal distribution is subtle. It involves the
spectral theorem and the reason it can be better to view a
scalar as the
trace of a 1×1
matrix than as a mere scalar. See
estimation of covariance matrices.
Unbiased estimation of parameters
The maximum likelihood estimator of the population mean
from a sample is an
unbiased estimator of the mean, as is the variance when the mean of the population is known ''a priori''. However, if we are faced with a sample and have no knowledge of the mean or the variance of the population from which it is drawn, the unbiased estimator of the variance
is:
:
This "sample variance" follows a
Gamma distribution if all ''X'' are independent identically distributed (iid):
:
Occurrence
''Approximately'' normal distributions occur in many situations, as a result of the
central limit theorem.
When there is reason to suspect the presence of a large number of small effects ''acting additively and independently'', it is reasonable to assume that observations will be normal.
There are statistical methods to empirically test that assumption, for example the
Kolmogorov-Smirnov test.
Effects can also act as 'multiplicative' (rather than additive) modifications. In that case, the assumption of normality is not justified, and it is the
logarithm of the variable of interest that is normally distributed. The distribution of the directly observed variable is then called
log-normal.
Finally, if there is a single external influence which has a large effect on the variable under consideration, the assumption of normality is not justified either. This is true even if, when the external variable is held constant, the resulting marginal distributions are indeed normal. The full distribution will be a superposition of normal variables, which is not in general normal. This is related to the theory of errors (see below).
To summarize, here is a list of situations where approximate normality
is sometimes assumed. For a fuller discussion, see below.
★ In counting problems (so the
central limit theorem includes a discrete-to-continuum approximation) where
reproductive random variables are involved, such as
★
★
Binomial random variables, associated to yes/no questions;
★
★
Poisson random variables, associated to
rare events;
★ In physiological measurements of biological specimens:
★
★ The ''logarithm'' of measures of size of living tissue (length, height, skin area, weight);
★
★ The ''length'' of ''inert'' appendages (hair, claws, nails, teeth) of biological specimens, ''in the direction of growth''; presumably the thickness of tree bark also falls under this category;
★
★ Other physiological measures may be normally distributed, but there is no reason to expect that ''a priori'';
★ Measurement errors are often ''assumed'' to be normally distributed, and any deviation from normality is considered something which should be explained;
★ Financial variables
★
★ Changes in the ''logarithm'' of exchange rates, price indices, and stock market indices; these variables behave like compound interest, not like simple interest, and so are multiplicative;
★
★ Other financial variables may be normally distributed, but there is no reason to expect that ''a priori'';
★ Light intensity
★
★ The intensity of laser light is normally distributed;
★
★ Thermal light has a
Bose-Einstein distribution on very short time scales, and a normal distribution on longer timescales due to the central limit theorem.
Of relevance to biology and economics is the fact that complex systems tend to display
power laws rather than normality.
Photon counting
Light intensity from a single source varies with time, as thermal fluctuations can be observed if the light is analyzed at sufficiently high time resolution. The intensity is usually assumed to be normally distributed.
Quantum mechanics interprets measurements of light intensity as
photon counting. The natural assumption in this setting is the
Poisson distribution. When light intensity is integrated over times longer than the coherence time and is large, the Poisson-to-normal limit is appropriate.
Measurement errors
Normality is the ''central 'assumption''' of the mathematical
theory of errors. Similarly, in statistical model-fitting, an indicator of goodness of fit is that the
residuals (as the errors are called in that setting) be independent and normally distributed. The assumption is that any deviation from normality needs to be explained. In that sense, both in model-fitting and in the theory of errors, normality is the only observation that need not be explained, being expected. However, if the original data are not normally distributed (for instance if they follow a
Cauchy distribution), then the residuals will also not be normally distributed. This fact is usually ignored in practice.
Repeated measurements of the same quantity are expected to yield results which are clustered around a particular value. If all major sources of errors have been taken into account, it is ''assumed'' that the remaining error must be the result of a large number of very small ''additive'' effects, and hence normal. Deviations from normality are interpreted as indications of systematic errors which have not been taken into account. Whether this assumption is valid is debatable.
Physical characteristics of biological specimens
The sizes of full-grown animals is approximately lognormal.
The evidence and an explanation based on models of growth was first published in the 1932 book ''
Problems of Relative Growth'' by
Julian Huxley.
Differences in size due to sexual dimorphism, or other polymorphisms like the worker/soldier/queen division in social insects, further make the distribution of sizes deviate from lognormality.
The assumption that linear size of biological specimens is normal (rather than lognormal) leads to a non-normal distribution of weight (since weight or volume is roughly proportional to the 2nd or 3rd power of length, and Gaussian distributions are only preserved by linear transformations), and conversely assuming that weight is normal leads to non-normal lengths. This is a problem, because there is no ''a priori'' reason why one of length, or body mass, and not the other, should be normally distributed. Lognormal distributions, on the other hand, are preserved by powers so the "problem" goes away if lognormality is assumed.
On the other hand, there are some biological measures where normality is assumed, such as
blood pressure of adult humans. This is supposed to be normally distributed, but only after separating males and females into different populations (each of which is normally distributed).
Financial variables
Because of the exponential nature of
inflation, financial indicators such as
stock values, or
commodity prices make good examples of ''multiplicative'' behavior. As such, periodic changes in them (for example, yearly changes) should not be expected to be normal, but perhaps lognormal. This was the theory proposed in
1900 by
Louis Bachelier.
However,
Benoît Mandelbrot, the popularizer of
fractals, showed that even the assumption of lognormality is flawed--the changes in logarithm over short periods (such as a day) are approximated well by distributions that do not have a finite variance, and therefore the central limit theorem does not apply. Rather, the sum of many such changes gives
log-Levy distributions.
Distribution in testing and intelligence
Sometimes, the difficulty and number of questions on an
IQ test is selected in order to yield normal distributed results. Or else, the raw test scores are converted to IQ values by fitting them to the normal distribution. In either case, it is the deliberate result of test construction or score interpretation that leads to IQ scores being normally distributed for the majority of the population. However, the question whether ''
intelligence'' itself is normally distributed is more involved, because
intelligence is a
latent variable, therefore its distribution cannot be observed directly.
Numerical approximations of the normal distribution and its CDF
The normal distribution is widely used in scientific and statistical computing. Therefore, it has been implemented in various ways.
The
GNU Scientific Library calculates values of the standard normal CDF using
piecewise approximations by
rational functions. Another approximation method uses third-degree polynomials on intervals
[1].
Generation of deviates from the unit normal is normally done using the
Box-Muller method of choosing an angle uniformly and a radius exponential and then transforming to (normally distributed) ''x'' and ''y'' coordinates. If log, cos or sin are expensive then a simple alternative is to simply sum 12 uniform [−1/2, 1/2] deviates. This is equivalent to a twelfth-order polynomial approximation to the normal distribution and is quite usable in many applications.
A method that is much faster than the Box-Muller transform but which is still exact is the so called
Ziggurat algorithm developed by George Marsaglia. In about 97% of all cases it uses only two random numbers, one random integer and one random uniform, one multiplication and an if-test. Only in 3% of the cases where the combination of those two falls outside the "core of the ziggurat" a kind of rejection sampling using logarithms, exponentials and more uniform random numbers has to be employed.
There is also some investigation into the connection between the fast
Hadamard transform and the normal distribution since the transform employs just addition and subtraction and by the central limit theorem random numbers from almost any distribution will be transformed into the normal distribution. In this regard a series of Hadamard transforms can be combined with random permutations to turn arbitrary data sets into a normally distributed data.
In
Microsoft Excel the function NORMSINV() calculates the cdf of the standard normal distribution.
Trivia
★ The last series of the 10
Deutsche Mark banknotes featured
Carl Friedrich Gauss and a graph and formula of the normal probability density function.
See also
★
Behrens-Fisher problem
★
Data transformation (statistics) - simple techniques to transform data into normal distribution
★
Erdős-Kac theorem, on the occurrence of the normal distribution in
number theory
★
Gaussian blur,
convolution using the normal distribution as a kernel
★
Gaussian function
★
Iannis Xenakis, Gaussian distribution in
music.
★
Lognormal distribution
★
Multivariate normal distribution
★
Matrix normal distribution
★
Normal-gamma distribution
★
Normally distributed and uncorrelated does not imply independent (an example of two normally distributed uncorrelated random variables that are not independent; this cannot happen in the presence of
joint normality)
★
Probit function
★
Sample size
★
Student's t-distribution
★
Gaussian process
★
★
Wiener process
★
★
Brownian bridge
★
★
Ornstein-Uhlenbeck process
★
Inverse Gaussian distribution
References
★ John Aldrich.
Earliest Uses of Symbols in Probability and Statistics. Electronic document, retrieved
March 20,
2005. (''See "Symbols associated with the Normal Distribution".'')
★
Abraham de Moivre (
1738). ''
The Doctrine of Chances''.
★
Stephen Jay Gould (
1981). ''
The Mismeasure of Man''. First edition. W. W. Norton. ISBN 0-393-01489-4 .
★ Havil, 2003. ''Gamma, Exploring Euler's Constant'', Princeton, NJ: Princeton University Press, p. 157.
★
R. J. Herrnstein and
Charles Murray (
1994). ''
The Bell Curve: Intelligence and Class Structure in American Life''.
Free Press. ISBN 0-02-914673-9 .
★
Pierre-Simon Laplace (
1812). ''
Analytical Theory of Probabilities''.
★ Jeff Miller, John Aldrich, et al.
Earliest Known Uses of Some of the Words of Mathematics. In particular, the entries for
"bell-shaped and bell curve",
"normal" (distribution),
"Gaussian", and
"Error, law of error, theory of errors, etc.". Electronic documents, retrieved
December 13,
2005.
★ S. M. Stigler (
1999). ''Statistics on the Table'', chapter 22. Harvard University Press. (''History of the term "normal distribution".'')
★
Eric W. Weisstein et al.
Normal Distribution at
MathWorld. Electronic document, retrieved
March 20,
2005.
★ Marvin Zelen and Norman C. Severo (
1964). Probability Functions. Chapter 26 of ''
Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables'', ed, by
Milton Abramowitz and
Irene A. Stegun.
National Bureau of Standards.
External links
★
Calculating the Cumulative Normal distribution, C++, VBA, sitmo.com
★
Java Applet on Normal Distributions
★
Interactive Distribution Modeler (incl. Normal Distribution).
★
Free Area Under the Normal Curve Calculator from Daniel Soper's ''Free Statistics Calculators'' website. Computes the cumulative area under the normal curve (i.e., the cumulative probability), given a z-score.
★
PlanetMath: normal random variable
★
GNU Scientific Library – Reference Manual – The Gaussian Distribution
★
Distribution Calculator – Calculates probabilities and critical values for normal, ''
t'',
chi-square and
''F''-distribution.
★
Public Domain Normal Distribution Table
★
Is normal distribution due to Karl Gauss? Euler, his family of gamma functions, and place in history of statistics
★
Maxwell demons: Simulating probability distributions with functions of propositional calculus
★
Normal distribution table
★
An algorithm for computing the inverse normal cumulative distribution function by Peter J. Acklam – has examples for several
programming languages
★
[2]Intuitive derivation.