认知诊断计算机化自适应测验中在线标定方法的开发

心理学报 ›› 2011, Vol. 43 ›› Issue (06): 710-724.

• • 上一篇

认知诊断计算机化自适应测验中在线标定方法的开发

陈平;辛涛

北京师范大学发展心理研究所, 北京 100875

收稿日期:2010-11-16 修回日期:1900-01-01 发布日期:2011-06-30 出版日期:2011-06-30
通讯作者: 辛涛

Developing On-line Calibration Methods for Cognitive Diagnostic Computerized Adaptive Testing

CHEN Ping;XIN Tao

Institute of Developmental Psychology, Beijing Normal University, Beijing 100875, China

Received:2010-11-16 Revised:1900-01-01 Online:2011-06-30 Published:2011-06-30
Contact: XIN Tao

摘要/Abstract

摘要： 项目增补对认知诊断计算机化自适应测验(CD-CAT)中的题库维护至关重要。在传统CAT中, 在线标定方法经常用于估计新题的项目参数。然而直到现在, 在CD-CAT领域还没有任何关于在线标定的论文公开发表。为将传统CAT中3种有代表性的在线标定方法(Method A、OEM和 MEM)推广至CD-CAT (CD-Method A、CD-OEM和CD-MEM)建立分析基础, 并采用模拟方法对这3种方法进行比较。研究表明：CD-Method A方法在项目参数的返真性方面优于其它两种方法; 自适应标定设计较随机标定设计可以提高项目参数的返真质量。

关键词: 计算机化自适应测验, 认知诊断, 在线标定, 旧题, 新题

Abstract: Like all computerized adaptive testing (CAT) applications, some items in the item bank maybe flawed or obsolete or overexposed and they should be replaced by new items (Wainer & Mislevy, 1990), item replenishing is essential for item bank maintenance and development in cognitive diagnostic CAT (CD-CAT). In regular CAT, on-line calibration method is commonly used to calibrate the item parameters of new items. However, until now no reference is publicly available about on-line calibration for CD-CAT. Thus, this study investigated the possibility to extend some current methods used in CAT to CD-CAT situation. Three representative on-line calibration methods in regular CAT were under investigation: Method A (Stocking, 1988), marginal maximum likelihood estimate with one EM cycle (OEM) method (Wainer & Mislevy, 1990) and marginal maximum likelihood estimate with multiple EM cycles (MEM) method (Ban, Hanson, Wang, Yi, & Harris, 2001). Under certain theoretical justifications based on the Deterministic Inputs, Noisy “and” Gate (DINA) model, these methods were generalized to CD-CAT situation, denoted as CD-Method A, CD-OEM and CD-MEM, respectively.
Two simulation studies were conducted to compare the performance of the three CD-CAT on-line calibration methods in terms of item-parameter recovery. In the first study, the new items were randomly assigned to the examinees and then were calibrated accordingly. 2000 examinees were generated assuming that each examinee has 50% probability of mastering each attribute, 360 operational items were simulated and their guessing and slipping parameters were all randomly drawn from U (0.05, 0.25). 20 new items were simulated and the Q matrix corresponding to the new items was constructed by randomly selecting 20 rows from the Q matrix corresponding to the operational items, and the item parameters of new items were also randomly drawn from U (0.05, 0.25). The Shannon Entropy method was employed to select the next item and the Maximum A Posterior method was used to update the knowledge state (KS) estimates of examinees. As to the second study, the new items were first administered to a sub-group of the examinees and then were pre-calibrated; then for the remaining examinees, the new items were selected adaptively according to their initial parameter estimates to fit the examinee’s current KS estimates; finally, the item parameters of the new items were re-calibrated sequentially. Note that all the simulation conditions in the second study remained the same as those in the first study except the new items were adaptively selected.
The results of Study 1 indicated that CD-Method A outperformed the other two methods in that it yielded the smallest estimation errors, and the simulated CD-CAT test was able to provide relatively accurate KS estimates for the examinees. The results of Study 2 showed that the adaptive calibration design could improve the item-parameter recovery compared with the random calibration design for CD-Method A, CD-Method A and CD-OEM.
Though the results from the two studies are very encouraging, further studies are proposed for the future investigations such as different sample sizes, different cognitive diagnostic models and different attribute hierarchical structures.

Key words: computerized adaptive testing, cognitive diagnosis, on-line calibration, operational item, new item

陈平,辛涛. (2011). 认知诊断计算机化自适应测验中在线标定方法的开发. 心理学报, 43(06), 710-724.

CHEN Ping,XIN Tao. (2011). Developing On-line Calibration Methods for Cognitive Diagnostic Computerized Adaptive Testing. , 43(06), 710-724.

[1]	田亚淑, 詹沛达, 王立君. 联合作答精度和作答时间的概率态认知诊断模型[J]. 心理学报, 2023, 55(9): 1573-1586.
[2]	游晓锋, 杨建芹, 秦春影, 刘红云. 认知诊断测评中缺失数据的处理：随机森林阈值插补法[J]. 心理学报, 2023, 55(7): 1192-1206.
[3]	刘彦楼, 陈启山, 王一鸣, 姜晓彤. 模型参数点估计的可靠性：以CDM为例[J]. 心理学报, 2023, 55(10): 1712-1728.
[4]	刘彦楼, 吴琼琼. 认知诊断模型Q矩阵修正：完整信息矩阵的作用[J]. 心理学报, 2023, 55(1): 142-158.
[5]	孙小坚, 郭磊. 考虑题目选项信息的非参数认知诊断计算机自适应测验[J]. 心理学报, 2022, 54(9): 1137-1150.
[6]	李佳, 毛秀珍, 韦嘉. 一种简单有效的Q矩阵修正新方法[J]. 心理学报, 2022, 54(8): 996-1008.
[7]	刘彦楼. 认知诊断模型的标准误与置信区间估计：并行自助法[J]. 心理学报, 2022, 54(6): 703-724.
[8]	宋枝璘, 郭磊, 郑天鹏. 认知诊断缺失数据处理方法的比较：零替换、多重插补与极大似然估计法[J]. 心理学报, 2022, 54(4): 426-440.
[9]	詹沛达. 引入眼动注视点的联合-交叉负载多模态认知诊断建模[J]. 心理学报, 2022, 54(11): 1416-1423.
[10]	郭磊, 周文杰. 基于选项层面的认知诊断非参数方法[J]. 心理学报, 2021, 53(9): 1032-1043.
[11]	谭青蓉, 汪大勋, 罗芬, 蔡艳, 涂冬波. 一种高效的CD-CAT在线标定新方法：基于熵的信息增益与EM视角[J]. 心理学报, 2021, 53(11): 1286-1300.
[12]	罗芬, 王晓庆, 蔡艳, 涂冬波. 基于基尼指数的双目标CD-CAT选题策略[J]. 心理学报, 2020, 52(12): 1452-1465.
[13]	汪大勋, 高旭亮, 蔡艳, 涂冬波. 基于类别水平的多级计分认知诊断Q矩阵修正：相对拟合统计量视角[J]. 心理学报, 2020, 52(1): 93-106.
[14]	詹沛达, 于照辉, 李菲茗, 王立君. 一种基于多阶认知诊断模型测评科学素养的方法[J]. 心理学报, 2019, 51(6): 734-746.
[15]	高旭亮, 汪大勋, 王芳, 蔡艳, 涂冬波. 基于分部评分模型思路的多级评分认知诊断模型开发[J]. 心理学报, 2019, 51(12): 1386-1397.

认知诊断计算机化自适应测验中在线标定方法的开发

Developing On-line Calibration Methods for Cognitive Diagnostic Computerized Adaptive Testing

PDF (PC)

可视化

English Version

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价