Estimating Variance Components of Missing Data for Generalizability Theor

doi:10.3724/SP.J.1041.2014.01897

Abstract

Abstract:

Missing observations are common in operational performance assessment settings or psychological surveys and experiments. Since these assessments are time-consuming to administer and score, examinees seldom respond to all test items and raters seldom evaluate all examinee responses. As a result, a frequent problem encountered by those using generalizability theory with large-scale performance assessments is working with missing data. Data from such examinations compose a missing data matrix. Researchers usually concern about how to make good use of the full data and often ignore missing data. As for these missing data, a common practice is to delete them or make an imputaion for missing records; however, it may cause problems in following aspects. Firstly, deleting or interpolating missing data may result in ineffective statistical analysis. Secondly, it is difficult for researchers to choose an unbiased method among diverse rules of interpolation. As a result of missing data, a series of problems may be caused when estimating variance components of unbalanced data in generalizability theory. A key issue with generalizability theory lies in how to effectively utilize the existing missing data to their maximum statistical analysis capacity. This article provides four methods to estimate variance components of missing data for unbalanced random p×i×r design of generalizability theory: formulas method, restricted maximum likelihood estimation (REML) method, subdividing method, and Markov Chain Monte Carlo (MCMC) method. Based on the estimating formulas of p×i design by Brennan (2001), formulas method is the deduction of estimating variance components formulas for p×i×r design with missing data. The aim of this article is to investigate which method is superior in estimating variance components of missing data rapidly and effectively. MATLAB 7.0 was used to simulate data, and generalizability theory was used to estimate variance components. Three conditions were simulated respectively: (1) persons sample with small size (200 students), medium size (1000 students) and large size (5000 students); (2) item sample with 2 items, 4 items and 6 items; (3) raters sample with 5 raters, 10 raters and 20 raters. The authors also developed some programs for MATLAB, WinBUGS, SAS and urGENOVA software in order to estimate variance components of p×i×r missing data with four methods. Criterions were made for the purpose of comparing the four methods. For example, bias was the criterion when estimating variance components. The reliability of the results increased as the absolute bias decreased. Results indicate that: (1) MCMC method has a strong advantage for estimating variance components of p×i×r missing data over the other three methods. MCMC method is superior to formulas method because of smaller deviation for variance components estimation. It is better than REML method because iteration of MCMC method converge, while REML method does not. Unlike subdividing method, MCMC method does not require variance components to be combined in order to obtain accurate estimations. (2) Item and rater are two important influencing factors for estimating variance components of missing data. If manpower and material resources are limited, priority should be given to increase the number of items in order to increase estimation accuracy. If researchers cannot increase the number of items, the next-best thing is to increase the number of raters. However, the number of raters should be cautiously controlled.

Key words: Generalizability Theory, missing data, estimating variance components, p×i×r design, Markov Chain Monte Carlo (MCMC)

ZHANG Minqiang; ZHANG Wenyi; LI Guangming; LIU Xiaoyu; Huang Feifei. (2014). Estimating Variance Components of Missing Data for Generalizability Theor. Acta Psychologica Sinica, 46(12), 1897-1909.

[1]	SONG Zhilin, GUO Lei, ZHENG Tianpeng. Comparison of missing data handling methods in cognitive diagnosis: Zero replacement, multiple imputation and maximum likelihood estimation [J]. Acta Psychologica Sinica, 2022, 54(4): 426-440.
[2]	WANG Meng-Cheng; DENG Qiaowen. The mechanism of auxiliary variables in full information maximum likelihood–based structural equation models with missing data [J]. Acta Psychologica Sinica, 2016, 48(11): 1489-1498.
[3]	LUO Zhaosheng;GUO Xiaojun. The Optimal Size of Material in Psychological Experiment: The Applications of Multivariate Generalizability Theory [J]. Acta Psychologica Sinica, 2014, 46(6): 876-884.
[4]	LI Guangming;ZHANG Minqiang. Using Adjusted Bootstrap to Improve the Estimation of Variance Components and Their Variability for Generalizability Theory [J]. Acta Psychologica Sinica, 2013, 45(1): 114-124.
[5]	FANG Jie;ZHANG Min-Qiang. Assessing Point and Interval Estimation for the Mediating Effect: Distribution of the Product, Nonparametric Bootstrap and Markov Chain Monte Carlo Methods [J]. Acta Psychologica Sinica, 2012, 44(10): 1408-1420.
[6]	LI Guang-Ming,ZHANG Min-Qiang. Estimating the Variability of Estimated Variance Components for Generalizability Theory [J]. , 2009, 41(09): 889-901.
[7]	YU Zong-Huo,TANG Xiao-Juan,WANG Deng-Feng. A Comparison of GT and IRT: An Analysis of Performance Rating of Men’s 10 Me-ters Platform Diving in Beijing Olympic Games [J]. , 2009, 41(08): 773-784.
[8]	ZENG Li,XIN Tao,ZHANG Shu-Mei. Comparison of Two MCMC Approaches to Missing Response Data in 2PL Model [J]. , 2009, 41(03): 276-282.
[9]	Yang Zhiming,Chang Lei,Ma Shiye. MULTIVARIATE GENERALIZABILITY ANALYSIS OF THE CHINESE COLLEGE ENTRANCE COMPREHENSIVE EXAMINATION [J]. , 2004, 36(02): 195-200.
[10]	Yan Fang, Li Weiming (Department of Psychology, East China Normal University, Shanghai 200062). USING STRUCTURAL EQUATION MODELS TO ESTIMATE RATER RELIABILITY IN GENERALIZABILITY THEORY [J]. , 2002, 34(05): 92-97.
[11]	Yang Zhiming,Chang Lei (Department of Educational Psychology, The Chinese University of Hong Kong). A STUDY ON PUTONGHUA TESTING BY MULTIVARIATE GENERALIZABILITY THEORY [J]. , 2002, 34(01): 51-56.
[12]	Li Weiming Yan Fang(Department of psychology, East-China Normal University, Shanghai 200062). MODEL SELECTIONS, VARIANCE COMPONENT EXPLANATIONS AND INDEXCOMPARISONS IN THE APPLICATION OF GENERALIZABILITYTHEORY:COMMENTS ON LIU AND ZHANG (1998,1999) [J]. , 2001, 33(05): 84-87.
[13]	Liu Yuanwo (Personnel Testing Authorities,Minisity of Personnel P.R.C.,Beijing 100054) Zhang Houcan (Beijing Normal University. Beijing 100875). APPLICATION OF GENERALIZABILITY THEORY IN COMPOSITION SCORING [J]. , 1998, 30(02): 211-218.

Estimating Variance Components of Missing Data for Generalizability Theor

Knowledge

Review File

Abstract

Cite this article

share this article

References

Related Articles 13

Recommended Articles

Metrics

Comments