Semc 3 Q
Semc 3 Q
SEM Basics
General steps in Structural Equation Model-
ing are:
(1) Specication
(2) Identication
(3) Estimation
(4) Testing
(5) Modication
1
3.1 Model Specication
Involves using all available relevant theory,
research, and information to construct the
theoretical model.
In structural equation modeling this results
to specifying the relationships between the
relevant variables describing the phenomenon
in interest.
2
Example 3.1: For a study in an industrial sales force
job sat
perform
?
?
Remark 3.1: In diagrams the (latent) theoretical vari-
ables are presented as circles or ovals and the observed
variables in rectangles.
job sat
perform
`
? ?
achv mot
self est
verb int
'
&
'
&
'
&
4
Remark 3.2: In SEM graphs, like above, the one-
headed arrows indicate causal relations and the two-
headed arched arrows indicate mutual dependencies
(correlation).
Remark 3.3: Inclusion of unimportant factors or ex-
clusion of important factors will produce implied mod-
els that are misspecied.
Remark 3.4: Exclusion of important factors may re-
sult in biased parameter estimates.
Remark 3.5: Inclusion of unimportant factors may re-
sult in loss of estimation accuracy.
Remark 3.6: A misspecied model cannot reproduce
adequately the observed covariances, and hence will
not t the data.
5
Measurement Models
If some of the variables in the model are
latent (not directly observable), a measure-
ment model must be specied.
This results to constructing observable vari-
ables that are supposed to measure the latent
variables and dening relationships between
these variables and latent variables.
6
Example 3.2: In the above example e.g. the achieve-
ment motivation was measured with two indices, each
of which were constructed as a sum of several other
variables.
The measurement model is constructed for the ex-
planatory (exogenous) latent variables and the ex-
plained (endogenous) latent variables separately.
Thus there are two measurement models, one for the
exogenous variables and one fore the endogenous vari-
ables.
7
Measurement model for the exogenous factors:
achv mot
self est
verb int
achm1
achm2
sestm1
sestm2
intlm1
intlm2
$
%
$
%
$
%
error
age
sev
anx
>
>
>
>
>
>
>
>
0
= sar
1
age
2
sev
3
anx,
where hat () denotes the estimate of the correspond-
ing parameter and the bar indicates sample mean of
the corresponding variable.
In the SEM computer output the just-identied is in-
dicated by the notion that the model is saturated and
2
= 0, df = 0, and p-val= 0.
19
Estimation:
Model estimation involves estimating the parameters
in the regression model (1).
This can be done with usual statistical packages (SAS,
SPSS, etc), and even in Excel.
Here we demonstrate PRELIS and LISREL.
In PRELIS data is the following:
Menu Statistics Regressions opens
20
Selecting
and clicking Run gives output
21
Model Testing:
In regression the the global t is most often mea-
sured with the R-square (coecient of determina-
tion), which indicates the fraction the model can ex-
plain from the total variation in the dependent vari-
able.
R
2
=
n
i=1
( y
i
y)
2
n
i=1
(y
i
y)
2
, (2)
where y is the predicted (tted) value of y with the
estimated regression model and y is the sample mean.
In the example R
2
= 0.685, which indicates variables
age, sev, and anx esplain 68.5 percent of the variation
in sat, the rest 31.5% is from unknown sources.
The Error Variance in the out put is the variance
estimate of the error term u in the regression (1).
22
A global test whether any of the explanatory variables
have eect on the dependent variables is the F-test.
The null hypothesis of the test is whether all the slope
parameters are simultaneously zero, i.e.,
H
0
:
1
=
2
=
3
= 0. (3)
Unlike standard regression programs, PRELIS does
not print this global.
However, if the null hypothesis is true, it implies that
the R-square, R
2
= 0 in the population regression.
Thus, hypothesis (3) is equivalent to the null hypoth-
esis that the population R
2
is zero (remember that
statistical testing is always about testing population
parameters).
Consequently the F-test can be formulated in terms
of the R
2
and
F =
R
2
1 R
2
(n p 1)
p
, (4)
where p is the number of explanatory variables, here
p = 3 and n is the number of observations.
23
Thus,
F =
0.685
1 0.685
(24 3 1)
3
14.5
The degrees of freedom of the nominator is p = 3 and
denominator n p 1 = 20.
The implied p-value is 0.00003.
Thus, the null hypothesis (3) is clearly rejected, and
the conclusion is that at least one of the explanatory
variables aect the level of satisfaction.
24
Individual parameters are tested with the t-test.
More general parameter restrictions are again tested
with a suitable F-test.
Inspection of the individual coecients reveals that
the coecient estimate of sev is not statistically sig-
nicant (p-val 0.626).
Thus, there is no empirical evidence that the severity
of illness aects the patient satisfaction.
25
Model Modication:
Variable sev proved not to be statistically signicant,
thus it can be dropped out from the model.
Doing this gives
The goodness-of-t in terms of R
2
= 0.682 shows that
the reduced model is virtually as good as the original
one.
If the explanatory power of about 68 percent is not
good enough, additional variables should be included
if more resent research indicates that another variables
was important in explaining patient satisfaction.
26
Example 3.5: LISREL demonstration of estimating
the model. LISREL is not really intended for simple
regression analysis applications, but the running the
example in LISREL serves demonstrating the use of
graphical modeling in forthcoming more complicated
cases.
27
In LISREL one can estimate the model by writing the
syntax or starting with the path window to generate
the path model (Figure 3.1)
The raw data is in le ex34.psf (PRELIS system le).
In LISREL select File New, which opens
Scroll down to get Path Diagram, select it and click OK
to open
Type in the name you want to save your path diagram
(here ex35) (the extension will be .pth) and click Save.
28
Next opens
From the menu select Setup Title and Comments. . .
to open
Type in the Title and click Next to open the Group
Names dialog.
29
We have only one group, thus click Next to open
Type in variables names
Click Next
select Raw Data in Statistics from: and in File name:
nd the le you have the raw data saved as a PRELIS
system le (here, ex34.psf).
After nding the le, click OK.
30
Next opens
Check sat as an Y-variable as indicated above.
31
Next formulate the path-diagram by dragging and drop-
ping the variables to get (you can use the Image menu
to tinker the diagram) and using the tool window
to dene the relationships to get.
Click the Run LISREL to generate SIMPLIS code.
Clicking again the Run LISREL button estimates the
model
32
The estimated are output also to the path diagram
In the equation form it is as before
33
Activating the .PTH window you can output dier-
ent path statistics to the diagram from the View
Estimations sub-menu
For example the standardized coecients are
The bi-variate relations between explanatory variables
(x-variables) are correlations.
34