The mechanism of auxiliary variables in full information maximum likelihood–based structural equation models with missing data

doi:10.3724/SP.J.1041.2016.01489

Abstract

Abstract:

In social and behavioral studies, missing data cannot be avoided in the process of data collection, especially in longitudinal studies. Because sample with missing data lose the balance characteristics of their complete counterparts, which may distort parameter estimates and degrade the performance of confidence intervals, special methods have to be developed for these analysis. Two modern missing data analysis techniques, maximum likelihood estimation and multiple imputation, have been widely studied in the methodological literature during the last decade. Since the maximum likelihood estimation and multiple imputation require the MAR (missing at random) assumption, including auxiliary variables can help fine-tune the missing data handling procedure, either by reducing bias or by increasing power. A useful auxiliary variable is a potential cause or a correlate of the incomplete variables in the analysis model. Notably, Graham (2003) proposed a “saturated correlates model”, which allows us to include auxiliary variables in FIML-based structural equation models easily. However, some questions about the inclusion of auxiliary variables are needed to further study. The main research question was under what condition the auxiliary variables will be effective in the FIML-based structural equation modeling. The current study investigates the effect of including auxiliary variables during estimation of structural equation modeling parameters with FIML estimation through Monte Carlo simulation. It focused on the missing values of the auxiliary variables and variables of interests simultaneously. The simulation repeated 5,000 times for each of 576 combinations: common missing rates (5 percent, 10 percent, 15 percent, and 20 percent), missing mechanism combinations (MCAR-MCAR, MCAR-MAR, MCAR-MNAR, MAR-MCAR, MAR-MAR, and MAR-MNAR), correlations (low, moderate to high), number of auxiliary variables (1, 3, 5), and sample sizes (100, 200, 500, 1000). The evaluation criteria are bias and confidence intervals coverage of parameters. Data generates according to Enders (2008) model. All data generate and analyze by Mplus 7.0. Auxiliary variables without missing values outperformed auxiliary variables with missing values. Including auxiliary variables which had missing values in the analysis procedure was found to improve parameter estimation efficiently in most cases. Results showed that the bias was more serious when the missing mechanism of the auxiliary variables was MCAR than MNAR. In the FIML-based structural equation modeling, the inclusion of more than a single auxiliary variable for MAR-MCAR or MAR-MNAR combined mechanisms is beneficial, while for MAR-MAR combined mechanism, a single auxiliary variable would be better. In addition, it is beneficial to include auxiliary variables which had low correlation with variables of interests in this model. However, simulation results indicated that the common missing rates had little impact on bias. Overall, this study indicates that the inclusion of incomplete auxiliary variables is beneficial, even if the auxiliary variables and variables of interests have a relative proportion of missing data.

Key words: missing data, missing mechanism, SEM, full information maximum likelihood, auxiliary variable, Monte Carlo simulation

WANG Meng-Cheng; DENG Qiaowen. (2016). The mechanism of auxiliary variables in full information maximum likelihood–based structural equation models with missing data. Acta Psychologica Sinica, 48(11), 1489-1498.

[1]	ZHOU Yuxi, LIU Yuhao, ZHANG Qingfang. The influence of stimulus onset asynchrony on semantic effect in spoken word production: A picture-word interference paradigm study [J]. Acta Psychologica Sinica, 2022, 54(5): 453-465.
[2]	SONG Zhilin, GUO Lei, ZHENG Tianpeng. Comparison of missing data handling methods in cognitive diagnosis: Zero replacement, multiple imputation and maximum likelihood estimation [J]. Acta Psychologica Sinica, 2022, 54(4): 426-440.
[3]	WANG Lili, DONG Menglu. Does male beauty really work: The impact of male endorsements on female consumers’ evaluation of female-gender-imaged product [J]. Acta Psychologica Sinica, 2022, 54(2): 192-204.
[4]	CHEN Shi, LIANG Zheng, LI Xianglan, CHEN Yanran, ZHAO Qingbai, YU Quanlei, LI Songqing, ZHOU Zhijin, LIU Lizhong. The role of novel semantic association in the promoting effect of insight on memory [J]. Acta Psychologica Sinica, 2021, 53(8): 837-846.
[5]	ZHANG Huan, WANG Xin, LIU Yibei, CAO Xiancai, WU Jie. The influence of members’ relationship on collaborative remembering [J]. Acta Psychologica Sinica, 2021, 53(5): 481-493.
[6]	ZUO Bin, DAI Yuee, WEN Fangfang, GAO Jia, XIE Zhijie, HE Saifei. “You were what you eat”: Food-gender stereotypes and their impact on evaluation of impression [J]. Acta Psychologica Sinica, 2021, 53(3): 259-272.
[7]	Yaxuan RAN,Jiani LIU,Yishi ZHANG,Haiying WEI. The magic of one person: The effect of the number of endorsers on brand attitude [J]. Acta Psychologica Sinica, 2020, 52(3): 371-385.
[8]	ZHANG Kai,SHI Jinjing,LUO Wenhao. How can leader’s voice endorsement promote employee voice: An integrated mechanism based on the goal self-organization perspective [J]. Acta Psychologica Sinica, 2020, 52(2): 229-239.
[9]	HU Jingjing, XU Haokui, CAO Liren. Visual representation of items with semantic information in sensory memory [J]. Acta Psychologica Sinica, 2019, 51(9): 982-991.
[10]	WANG Juan,MA Xuemei,LI Bingbing,ZHANG Jijia. The neighborhood effect of semantic and phonetic radicals in phonogram recognition [J]. Acta Psychologica Sinica, 2019, 51(8): 857-868.
[11]	WANG Dan,WANG Ting,QIN Song,ZHANG Jijia. Location effect of Chinese wordable components in the component priming paradigm [J]. Acta Psychologica Sinica, 2019, 51(2): 163-176.
[12]	WU Baizhou, LI Jie, HE Hu, HOU You, JIA Yingqi, FENG Shenxing. Categorical perception of color can be instantly influenced by color vision fatigue and semantic satiation [J]. Acta Psychologica Sinica, 2019, 51(2): 196-206.
[13]	WANG Bin,LI Zhirui,WU Limei,ZHANG Jijia. Effects of embodied simulation on understanding Chinese body action verbs [J]. Acta Psychologica Sinica, 2019, 51(12): 1291-1305.
[14]	ZHANG Yuzhi, ZHANG Jijia. Effects of task type, family size, and grammatical consistency on the activation of grammatical information of semantic radicals [J]. Acta Psychologica Sinica, 2019, 51(10): 1091-1101.
[15]	HUANG Minxue, YAO Shunyu, LIU Maohong. Self-enhancing or self-deprecating: How can celebrity endorsement enhance the marketing effectiveness of advertisements in social media [J]. Acta Psychologica Sinica, 2018, 50(8): 907-919.

The mechanism of auxiliary variables in full information maximum likelihood–based structural equation models with missing data

Knowledge

Review File

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments