I'm planning on writing more posts on one of my favorite statistical techniques (or set of techniques): structural equation modeling. For today, I'm going to write about some terminology frequently used in SEM - exogenous and endogenous variables.
(Note, these terms are used in other contexts as well. My goal is to discuss how they're used in SEM specifically, as a set-up for future SEM posts.)
Whenever you put together a structural equation model, you're hypothesizing paths between variables. A path means one variable influences/is caused by another. In a measurement model, where observed variables are being used to reflect an underlying (latent) construct, the path from the construct to each of the variables signifies that the construct influences/causes the values of the observed variables.
Created with the semPlot package using a lavaan dataset - look for a future blog post on this topic! |
Created with the semPlot package using a lavaan dataset - look for a future blog post on this topic! |
You may be wondering what we would call a variable that is caused by 1 or more variables, and in turn, causes 1 or more variables. In this terminology, we would still call them endogenous, but we might also use another term: mediator.
Stay tuned for more SEM posts, where we'll start digging into the figures above and showing how it all works! I'm also gearing up for Blogging A to Z; look for a theme reveal on March 19. Spoiler alert: It will be stats related.
Given their use in other contexts, these terms makes sense. Exogenous means to come from the outside. Endogenous means from within. Endogenous variables have their variance explained from within the model, i.e., explained by the exogenous variable. Exogenous variables would have to have their variances explained by other unmodeled sources of variances.
ReplyDelete