21.11 Sample statistics 623
is the probability density for the errorε
i1 = 1 y
i1 − 1 f(x
i). If the errors are independent, the
total probability density for the Nvaluesε
1, ε
2, =, ε
kis the product
(21.61)
When the true functionf(x)is replaced by an approximate function, the expression
(21.61) is a measure of the likelihood of the fit, and the likelihood is maximized when
χ
2is as small as possible.
Regression
The fitting of data points to functions is often called regressionby statisticians;
the curves about which points are clustered are called ‘regression curves’, and their
equations are ‘regression equations’. For a straight line, the regression is called linear.
The name has its origin in Galton’s studies on heredity in which he found that, on
average, fathers whose heights deviate from the mean height of all fathers have sons
whose heights deviate from the mean height of all sons by lesser amounts.
6He called
this ‘regression to mediocrity’, and the word is widely used, particularly in the social
sciences. The type of data of interest in the physical sciences, and the purpose for
its statistical analysis, is of a quite different nature. The word regression is entirely
inappropriate as a description of the use of statistics in chemistry.
21.11 Sample statistics
In any practical experiment the number of measurements, the sample size, is
necessarily finite so that only estimatesof the parent population or distribution are
obtained from the statistics of the sample. Thus, the sample mean defined in equation
(21.2) gives an estimate of the population mean,
(21.62)
and the variance and standard deviation defined in equations (21.5) and (21.6) are
estimates of the variance and standard deviation of the population,
(21.63)
σ22121
≈= −
=∑
s
N
xx
iNi()
μ≈=
=∑
x
N
x
iNi1
1=−
−=∏
1122
1
2
π
iNiσχexp
ρε ρε()() ( )ρε σ
12112
NiNi=
−=∏
π exp −−
=∑
1
2
12iNiiε
σ
6Francis Galton (1822–1911), born in Birmingham. He discovered the importance of anti-cyclones in weather
systems and was influential in the establishment of the Meteorological Office and the National Physical
Laboratory. Best known for his work on heredity, he coined the word ‘eugenics’ and advocated the application of
scientific breeding to human populations.