Palgrave Handbook of Econometrics: Applied Econometrics

(Grace) #1

518 Discrete Choice Modeling


The authors argued that, although the various calculus courses can be ordered
discretely by the material covered, the differences between the levels cannot be
measured directly. Thus, this is an application of the ordered probit model. The
independent variables in this first-step model included SAT (scholastic aptitude
test) scores, foreign language proficiency, indicators of intended major, and several
other variables related to areas of study.
The second step of the estimator involves regression analysis of the grade in the
intermediate microeconomics or macroeconomics course. Grades in these courses
were translated to a granular continuous scale (A=4.0, A−=3.7, etc.). A linear
regression is then specified:


Gradei=z′iδ+ui, whereui|zi∼N[0,σu^2 ].

Independent variables in this regression include, among others, (1) dummy vari-
ables for which the outcome in the ordered probit model applies to the student
(with the zero reference case omitted), (2) grade in the last calculus course, (3) sev-
eral other variables related to prior courses, (4) class size, (5) freshman grade point
average, etc. The unobservables in theGradeequation and the math attainment
are clearly correlated, a feature captured by the additional assumption that (εi,


ui|xi,zi)∼N 2 [(0,0),(1,σu^2 ),ρσu]. A non-zeroρcaptures this “selection” effect. With
this in place, the dummy variables now become endogenous. The solution is a
“selection” correction where the modified equation becomes:


Gradei|mi=z′iδ+E[ui|mi]+vi
=z′iδ+(ρσu)[λ(x′iβ,μ 1 ,...,μ 5 )]+vi.

They thus adopt a “control function” approach to accommodate the endogeneity
of the math attainment dummy variables. The termλ(x′iβ,μ 1 ,...,μ 5 )is a gener-
alized residual that is constructed using the estimates from the first-stage ordered
probit model (a precise statement of the form of this variable is given in Tobias and
Li, 2006). Linear regression of the course grade onziand this constructed regressor
is computed at the second step. The standard errors at the second step must be
corrected for the use of the estimated regressor using what amounts to a Murphy
and Topel (1985) correction.
Tobias and Li (2006), in a replication of and comment on Butler, Finegan and
Siegfried (1998), after roughly replicating the classical estimation results with a
Bayesian estimator, observe that theGradeequation above could also be treated as
an ordered probit model. The resulting bivariate ordered probit model would be:


m∗i= x′iβ+εi, and gi∗ =z′iδ+ui,
mi = 0if−∞<m∗i≤0, gi =0if−∞<gi∗≤0,
1if0<m∗i≤μ 1 , 1 if 0<g∗i≤α 1 ,
... ...
6ifμ 5 <m∗i<+∞ 11 ifμ 9 <g∗i<+∞,

where (εi,ui|xi,zi)∼N 2 [(0,0),(1,σu^2 ),ρσu].

Free download pdf