doing so, however, a complication arises in that theoretical probabilities pi
defined by Equation (10.2) are, being functions of the distribution parameters,
functions of the sample. The statistic D now takes the form
where is an estimator for pi and is thus a statistic. We see that D is now
a much more co mplicated function of X 1 ,X 2 ,...,Xn. The important question
to be answered is: what is the new distribution of D?
The problem of determining the limiting distribution of D in this situation
was first considered by Fisher (1922, 1924), who showed that, as n , the
distribution of D needs to be modified, and the modification obviously depends
on the method of parameter estimation used. Fortunately, for a class of
important methods of estimation, such as the maximum likelihood method,
the modification required is a simple one, namely, statistic D still approaches a
ch i-squared distribution as n but now with (k r 1 ) degrees of fr ee-
dom, where r is the number of parameters in the hypothesized distribution to be
estimated. In other words, it is only necessary to reduce the number of degrees
of freedom in the limiting distribution defined by Equation (10.5) by one for
each parameter estimated from the sample.
We can now state a step-by-step procedure for the case in which r parameters
in the distribution are to be estimated from the data.
. (^) Step 1: divide range space X into k mutually exclusive and numerically con-
venient intervals Ai,i 1,...,k.Let ni be the number of sample values fall-
ing into Ai. As a rule, if the number of sample values in any Ai is less than 5,
combine interval Ai with either Ai 1 or Ai 1.
. (^) Step 2: estimate the r parameters by the method of maximum likelihood from
the data.
. (^) Step 3: co mpute theoretical probabilities P(Ai)pi,i 1,...,k,bymeansof
the hypothesized distribution with estimated parameter values.
. (^) Step 4: construct d as given by Equation (10.7).
. (^) Step 5: choose a value of and determine from Table A.5 for the^2
distribution of (k r 1) degrees of fr eedom the value of^2 kr1,.Itis
assumed, of co ur se, that k r 1 > 0.
. (^) Step 6: reject hypothesis H if d^2 kr1,. Otherwise, accept H.
Example 10.3.Problem: vehicle arrivals at a toll gate on the New York State
Thruway were recorded. The vehicle counts at one-minute intervals were taken
for 106 minutes and are given in Table 10.4. On the basis of these observations,
determine whether a Poisson distribution is appropriate for X, the number of
arrivals per minute, at the 5% significance level.
M odel Verification 323
D
Xk
i 1
n
P^i
Ni
n
P^i
2
Xk
i 1
Ni^2
nP^i
n;
10 : 10
!1
!1
>
P^i