0% found this document useful (0 votes)
16 views

Roprobit

Ordered probit regression is used to analyze relationships between an ordinal dependent variable (with ordered but categorical values) and independent variables. It estimates an underlying linear score as a function of predictors and cutpoints defining value ranges for each outcome. The model was fit to a automotive repair quality dataset to explore the effects of origin, size, and fuel efficiency on the ordinal repair rating. The results found that foreign, larger, and more fuel efficient cars tended to have better repair records.

Uploaded by

Amoussou
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views

Roprobit

Ordered probit regression is used to analyze relationships between an ordinal dependent variable (with ordered but categorical values) and independent variables. It estimates an underlying linear score as a function of predictors and cutpoints defining value ranges for each outcome. The model was fit to a automotive repair quality dataset to explore the effects of origin, size, and fuel efficiency on the ordinal repair rating. The results found that foreign, larger, and more fuel efficient cars tended to have better repair records.

Uploaded by

Amoussou
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Title stata.

com
oprobit — Ordered probit regression

Description Quick start Menu Syntax


Options Remarks and examples Stored results Methods and formulas
References Also see

Description
oprobit fits ordered probit models of ordinal variable depvar on the independent variables
indepvars. The actual values taken on by the dependent variable are irrelevant, except that larger
values are assumed to correspond to “higher” outcomes.

Quick start
Ordinal probit model of y on x1 and categorical variables a and b
oprobit y x1 i.a i.b
Model of y on x1 and a one-period lagged value of x1 using tsset data
oprobit y x1 L.x1
Same as above, but calculate results for each level of catvar and save statistics to myfile.dta
statsby, by(catvar) saving(myfile): oprobit y x1 L.x1

Menu
Statistics > Ordinal outcomes > Ordered probit regression

1
2 oprobit — Ordered probit regression

Syntax
         
oprobit depvar indepvars if in weight , options

options Description
Model
offset(varname) include varname in model with coefficient constrained to 1
constraints(constraints) apply specified linear constraints
SE/Robust
vce(vcetype) vcetype may be oim, opg, robust, cluster clustvar, bootstrap,
or jackknife
Reporting
level(#) set confidence level; default is level(95)
nocnsreport do not display constraints
display options control columns and column formats, row spacing, line width,
display of omitted variables and base and empty cells, and
factor-variable labeling
Maximization
maximize options control the maximization process; seldom used
collinear keep collinear variables
coeflegend display legend instead of statistics
indepvars may contain factor variables; see [U] 11.4.3 Factor variables.
depvar and indepvars may contain time-series operators; see [U] 11.4.4 Time-series varlists.
bayes, bootstrap, by, collect, fmm, fp, jackknife, mfp, mi estimate, nestreg, rolling, statsby, stepwise,
and svy are allowed; see [U] 11.1.10 Prefix commands. For more details, see [BAYES] bayes: oprobit and
[FMM] fmm: oprobit.
vce(bootstrap) and vce(jackknife) are not allowed with the mi estimate prefix; see [MI] mi estimate.
Weights are not allowed with the bootstrap prefix; see [R] bootstrap.
vce() and weights are not allowed with the svy prefix; see [SVY] svy.
fweights, iweights, and pweights are allowed; see [U] 11.1.6 weight.
collinear and coeflegend do not appear in the dialog box.
See [U] 20 Estimation and postestimation commands for more capabilities of estimation commands.

Options

 Model
offset(varname), constraints(constraints); see [R] Estimation options.

 SE/Robust
vce(vcetype) specifies the type of standard error reported, which includes types that are derived from
asymptotic theory (oim, opg), that are robust to some kinds of misspecification (robust), that
allow for intragroup correlation (cluster clustvar), and that use bootstrap or jackknife methods
(bootstrap, jackknife); see [R] vce option.
oprobit — Ordered probit regression 3


 Reporting
level(#); see [R] Estimation options.
nocnsreport; see [R] Estimation options.
display options: noci, nopvalues, noomitted, vsquish, noemptycells, baselevels,
allbaselevels, nofvlabel, fvwrap(#), fvwrapon(style), cformat(% fmt), pformat(% fmt),
sformat(% fmt), and nolstretch; see [R] Estimation options.

 Maximization
 
maximize options: difficult, technique(algorithm spec), iterate(#), no log, trace,
gradient, showstep, hessian, showtolerance, tolerance(#), ltolerance(#),
nrtolerance(#), nonrtolerance, and from(init specs); see [R] Maximize. These options are
seldom used.

The following options are available with oprobit but is not shown in the dialog box:
collinear, coeflegend; see [R] Estimation options.

Remarks and examples stata.com


An ordered probit model is used to estimate relationships between an ordinal dependent variable
and a set of independent variables. An ordinal variable is a variable that is categorical and ordered,
for instance, “poor”, “good”, and “excellent”, which might indicate a person’s current health status or
the repair record of a car. If there are only two outcomes, see [R] logistic, [R] logit, and [R] probit.
This entry is concerned only with more than two outcomes. If the outcomes cannot be ordered (for
example, residency in the north, east, south, or west), see [R] mlogit. This entry is concerned only
with models in which the outcomes can be ordered. See [R] logistic for a list of related estimation
commands.
In ordered probit, an underlying score is estimated as a linear function of the independent variables
and a set of cutpoints. The probability of observing outcome i corresponds to the probability that the
estimated linear function, plus random error, is within the range of the cutpoints estimated for the
outcome:

Pr(outcomej = i) = Pr(κi−1 < β1 x1j + β2 x2j + · · · + βk xkj + uj ≤ κi )

uj is assumed to be normally distributed. In either case, we estimate the coefficients β1 , β2 , . . . ,


βk together with the cutpoints κ1 , κ2 , . . . , κI−1 , where I is the number of possible outcomes.
κ0 is taken as −∞, and κI is taken as +∞. All of this is a direct generalization of the ordinary
two-outcome probit model.

Example 1
In example 2 of [R] ologit, we use a variation of the automobile dataset (see [U] 1.2.2 Example
datasets) to analyze the 1977 repair records of 66 foreign and domestic cars. We use ordered logit
to explore the relationship of rep77 in terms of foreign (origin of manufacture), length (a proxy
for size), and mpg. Here we fit the same model using ordered probit rather than ordered logit:
4 oprobit — Ordered probit regression

. use https://www.stata-press.com/data/r18/fullauto
(Automobile models)
. oprobit rep77 foreign length mpg
Iteration 0: Log likelihood = -89.895098
Iteration 1: Log likelihood = -78.106316
Iteration 2: Log likelihood = -78.020086
Iteration 3: Log likelihood = -78.020025
Iteration 4: Log likelihood = -78.020025
Ordered probit regression Number of obs = 66
LR chi2(3) = 23.75
Prob > chi2 = 0.0000
Log likelihood = -78.020025 Pseudo R2 = 0.1321

rep77 Coefficient Std. err. z P>|z| [95% conf. interval]

foreign 1.704861 .4246796 4.01 0.000 .8725037 2.537217


length .0468675 .012648 3.71 0.000 .022078 .0716571
mpg .1304559 .0378628 3.45 0.001 .0562463 .2046656

/cut1 10.1589 3.076754 4.128577 16.18923


/cut2 11.21003 3.107527 5.119389 17.30067
/cut3 12.54561 3.155233 6.361467 18.72975
/cut4 13.98059 3.218793 7.671874 20.28931

We find that foreign cars have better repair records, as do larger cars and cars with better mileage
ratings.

Stored results
oprobit stores the following in e():
Scalars
e(N) number of observations
e(N cd) number of completely determined observations
e(k cat) number of categories
e(k) number of parameters
e(k aux) number of auxiliary parameters
e(k eq) number of equations in e(b)
e(k eq model) number of equations in overall model test
e(k dv) number of dependent variables
e(df m) model degrees of freedom
e(r2 p) pseudo-R2
e(ll) log likelihood
e(ll 0) log likelihood, constant-only model
e(N clust) number of clusters
e(chi2) χ2
e(p) p-value for model test
e(rank) rank of e(V)
e(ic) number of iterations
e(rc) return code
e(converged) 1 if converged, 0 otherwise
Macros
e(cmd) oprobit
e(cmdline) command as typed
e(depvar) name of dependent variable
e(wtype) weight type
oprobit — Ordered probit regression 5

e(wexp) weight expression


e(title) title in estimation output
e(clustvar) name of cluster variable
e(offset) linear offset variable
e(chi2type) Wald or LR; type of model χ2 test
e(vce) vcetype specified in vce()
e(vcetype) title used to label Std. err.
e(opt) type of optimization
e(which) max or min; whether optimizer is to perform maximization or minimization
e(ml method) type of ml method
e(user) name of likelihood-evaluator program
e(technique) maximization technique
e(properties) b V
e(predict) program used to implement predict
e(marginsdefault) default predict() specification for margins
e(asbalanced) factor variables fvset as asbalanced
e(asobserved) factor variables fvset as asobserved
Matrices
e(b) coefficient vector
e(Cns) constraints matrix
e(ilog) iteration log (up to 20 iterations)
e(gradient) gradient vector
e(cat) category values
e(V) variance–covariance matrix of the estimators
e(V modelbased) model-based variance
Functions
e(sample) marks estimation sample

In addition to the above, the following is stored in r():


Matrices
r(table) matrix containing the coefficients with their standard errors, test statistics, p-values,
and confidence intervals

Note that results stored in r() are updated when the command is replayed and will be replaced when
any r-class command is run after the estimation command.

Methods and formulas


See Methods and formulas of [R] ologit.

References
Aitchison, J., and S. D. Silvey. 1957. The generalization of probit analysis to the case of multiple responses. Biometrika
44: 131–140. https://doi.org/10.2307/2333245.
Bauldry, S., J. Xu, and A. S. Fullerton. 2018. gencrm: A new command for generalized continuation-ratio models.
Stata Journal 18: 924–936.
Cameron, A. C., and P. K. Trivedi. 2005. Microeconometrics: Methods and Applications. New York: Cambridge
University Press.
Canette, I. 2013. Fitting ordered probit models with endogenous covariates with Stata’s gsem command. The Stata Blog:
Not Elsewhere Classified. http://blog.stata.com/2013/11/07/fitting-ordered-probit-models-with-endogenous-covariates-
with-statas-gsem-command/.
Chiburis, R., and M. Lokshin. 2007. Maximum likelihood and two-step estimation of an ordered-probit selection
model. Stata Journal 7: 167–182.
De Luca, G., and V. Perotti. 2011. Estimation of ordered response models with sample selection. Stata Journal 11:
213–239.
6 oprobit — Ordered probit regression

Drukker, D. M. 2016. An ordered-probit inverse probability weighted (IPW) estimator. The Stata Blog: Not Elsewhere
Classified. http://blog.stata.com/2016/09/13/an-ordered-probit-inverse-probability-weighted-ipw-estimator/.
Huismans, J., J. W. Nijenhuis, and A. Sirchenko. 2022. A mixture of ordered probit models with endogenous switching
between two latent classes. Stata Journal 22: 557–596.
Long, J. S. 1997. Regression Models for Categorical and Limited Dependent Variables. Thousand Oaks, CA: Sage.
Long, J. S., and J. Freese. 2014. Regression Models for Categorical Dependent Variables Using Stata. 3rd ed. College
Station, TX: Stata Press.
Miranda, A., and S. Rabe-Hesketh. 2006. Maximum likelihood estimation of endogenous switching and sample
selection models for binary, ordinal, and count variables. Stata Journal 6: 285–308.
Smith, E. K., M. G. Lacy, and A. Mayer. 2019. Performance simulations for categorical mediation: Analyzing khb
estimates of mediation in ordinal regression models. Stata Journal 19: 913–930.
Stewart, M. B. 2004. Semi-nonparametric estimation of extended ordered probit models. Stata Journal 4: 27–39.
Williams, R. 2010. Fitting heterogeneous choice models with oglm. Stata Journal 10: 540–567.
Xu, J., and J. S. Long. 2005. Confidence intervals for predicted outcomes in regression models for categorical
outcomes. Stata Journal 5: 537–559.

Also see
[R] oprobit postestimation — Postestimation tools for oprobit
[R] heckoprobit — Ordered probit model with sample selection
[R] hetoprobit — Heteroskedastic ordered probit regression
[R] logistic — Logistic regression, reporting odds ratios
[R] mlogit — Multinomial (polytomous) logistic regression
[R] mprobit — Multinomial probit regression
[R] ologit — Ordered logistic regression
[R] probit — Probit regression
[R] zioprobit — Zero-inflated ordered probit regression
[BAYES] bayes: oprobit — Bayesian ordered probit regression
[CM] cmroprobit — Rank-ordered probit choice model
[ERM] eoprobit — Extended ordered probit regression
[FMM] fmm: oprobit — Finite mixtures of ordered probit regression models
[ME] meoprobit — Multilevel mixed-effects ordered probit regression
[MI] Estimation — Estimation commands for use with mi estimate
[SVY] svy estimation — Estimation commands for survey data
[XT] xtoprobit — Random-effects ordered probit models
[U] 20 Estimation and postestimation commands

Stata, Stata Press, and Mata are registered trademarks of StataCorp LLC. Stata and
®
Stata Press are registered trademarks with the World Intellectual Property Organization
of the United Nations. Other brand and product names are registered trademarks or
trademarks of their respective companies. Copyright c 1985–2023 StataCorp LLC,
College Station, TX, USA. All rights reserved.

You might also like