Developing On-line Calibration Methods for Cognitive Diagnostic Computerized Adaptive Testing

Abstract

Abstract: Like all computerized adaptive testing (CAT) applications, some items in the item bank maybe flawed or obsolete or overexposed and they should be replaced by new items (Wainer & Mislevy, 1990), item replenishing is essential for item bank maintenance and development in cognitive diagnostic CAT (CD-CAT). In regular CAT, on-line calibration method is commonly used to calibrate the item parameters of new items. However, until now no reference is publicly available about on-line calibration for CD-CAT. Thus, this study investigated the possibility to extend some current methods used in CAT to CD-CAT situation. Three representative on-line calibration methods in regular CAT were under investigation: Method A (Stocking, 1988), marginal maximum likelihood estimate with one EM cycle (OEM) method (Wainer & Mislevy, 1990) and marginal maximum likelihood estimate with multiple EM cycles (MEM) method (Ban, Hanson, Wang, Yi, & Harris, 2001). Under certain theoretical justifications based on the Deterministic Inputs, Noisy “and” Gate (DINA) model, these methods were generalized to CD-CAT situation, denoted as CD-Method A, CD-OEM and CD-MEM, respectively.
Two simulation studies were conducted to compare the performance of the three CD-CAT on-line calibration methods in terms of item-parameter recovery. In the first study, the new items were randomly assigned to the examinees and then were calibrated accordingly. 2000 examinees were generated assuming that each examinee has 50% probability of mastering each attribute, 360 operational items were simulated and their guessing and slipping parameters were all randomly drawn from U (0.05, 0.25). 20 new items were simulated and the Q matrix corresponding to the new items was constructed by randomly selecting 20 rows from the Q matrix corresponding to the operational items, and the item parameters of new items were also randomly drawn from U (0.05, 0.25). The Shannon Entropy method was employed to select the next item and the Maximum A Posterior method was used to update the knowledge state (KS) estimates of examinees. As to the second study, the new items were first administered to a sub-group of the examinees and then were pre-calibrated; then for the remaining examinees, the new items were selected adaptively according to their initial parameter estimates to fit the examinee’s current KS estimates; finally, the item parameters of the new items were re-calibrated sequentially. Note that all the simulation conditions in the second study remained the same as those in the first study except the new items were adaptively selected.
The results of Study 1 indicated that CD-Method A outperformed the other two methods in that it yielded the smallest estimation errors, and the simulated CD-CAT test was able to provide relatively accurate KS estimates for the examinees. The results of Study 2 showed that the adaptive calibration design could improve the item-parameter recovery compared with the random calibration design for CD-Method A, CD-Method A and CD-OEM.
Though the results from the two studies are very encouraging, further studies are proposed for the future investigations such as different sample sizes, different cognitive diagnostic models and different attribute hierarchical structures.

Key words: computerized adaptive testing, cognitive diagnosis, on-line calibration, operational item, new item

CHEN Ping,XIN Tao. (2011). Developing On-line Calibration Methods for Cognitive Diagnostic Computerized Adaptive Testing. , 43(06), 710-724.

[1]	SONG Zhilin, GUO Lei, ZHENG Tianpeng. Comparison of missing data handling methods in cognitive diagnosis: Zero replacement, multiple imputation and maximum likelihood estimation [J]. Acta Psychologica Sinica, 2022, 54(4): 426-440.
[2]	TAN Qingrong, WANG Daxun, LUO Fen, CAI Yan, TU Dongbo. A high-efficiency and new online calibration method in CD-CAT based on information gain of entropy and EM algorithm [J]. Acta Psychologica Sinica, 2021, 53(11): 1286-1298.
[3]	ZHAN Peida,YU Zhaohui,LI Feiming,WANG Lijun. Using a multi-order cognitive diagnosis model to assess scientific literacy [J]. Acta Psychologica Sinica, 2019, 51(6): 734-746.
[4]	GAO Xuliang,WANG Daxun,WANG Fang,CAI Yan,TU Dongbo. Development of a Generalized Cognitive Diagnosis Model for polytomous responses based on Partial Credit Model [J]. Acta Psychologica Sinica, 2019, 51(12): 1386-1397.
[5]	CHEN Ping. Two new online calibration methods for computerized adaptive testing [J]. Acta Psychologica Sinica, 2016, 48(9): 1184-1198.
[6]	GUO Lei; ZHENG Chanjin; BIAN Yufang; SONG Naiqing; XIA Lingxiang. New item selection methods in cognitive diagnostic computerized adaptive testing: Combining item discrimination indices [J]. Acta Psychologica Sinica, 2016, 48(7): 903-914.
[7]	LIU Yanlou; XIN Tao; LI Lingqing; TIAN Wei; LIU Xiaoxiao. An improved method for differential item functioning detection in cognitive diagnosis models: An application of Wald statistic based on observed information matrix [J]. Acta Psychologica Sinica, 2016, 48(5): 588-598.
[8]	ZHAN Peida; BIAN Yufang; WANG Lijun. Factors affecting the classification accuracy of reparametrized diagnostic classification models for expert-defined polytomous attributes [J]. Acta Psychologica Sinica, 2016, 48(3): 318-330.
[9]	CAI Yan; MIAO Ying; TU Dongbo. The polytomously scored cognitive diagnosis computerized adaptive testing [J]. Acta Psychologica Sinica, 2016, 48(10): 1338-1346.
[10]	ZHAN Peida; CHEN Ping; BIAN Yufang. Using confirmatory compensatory multidimensional IRT models to do cognitive diagnosis [J]. Acta Psychologica Sinica, 2016, 48(10): 1347-1356.
[11]	LIN Zhe; CHEN Pin; XIN Tao. The Block Item Pocket Method to Allow Item Review in CAT [J]. Acta Psychologica Sinica, 2015, 47(9): 1188-1198.
[12]	KANG Chunhua; REN Ping; ZENG Pingfei. Nonparametric Cognitive Diagnosis: A Cluster Diagnostic Method Based on Grade Response Items [J]. Acta Psychologica Sinica, 2015, 47(8): 1077-1088.
[13]	TANG Xiaojuan; DING Shuliang; YU Zonghuo. Application of Rough Set Theory in Item Cognitive Attribute Identification [J]. Acta Psychologica Sinica, 2015, 47(7): 950-962.
[14]	ZHAN Peida; LI Xiaomin; WANG Wen-Chung; BIAN Yufang; WANG Lijun. The Multidimensional Testlet-Effect Cognitive Diagnostic Models [J]. Acta Psychologica Sinica, 2015, 47(5): 689-701.
[15]	DAI Buyun; ZHANG Minqiang; JIAO Can; LI Guangming; ZHU Huawei; ZHANG Wenyi. Item Selection Using the Multiple-Strategy RRUM Based on CD-CAT [J]. Acta Psychologica Sinica, 2015, 47(12): 1511-1519.

Developing On-line Calibration Methods for Cognitive Diagnostic Computerized Adaptive Testing

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments