多维题组效应认知诊断模型

doi:10.3724/SP.J.1041.2015.00689

心理学报 ›› 2015, Vol. 47 ›› Issue (5): 689-701.doi: 10.3724/SP.J.1041.2015.00689

多维题组效应认知诊断模型

詹沛达^1,2;李晓敏³;王文中³;边玉芳²;王立君¹

(¹浙江师范大学心理系, 金华 321004) (²北京师范大学认知神经科学与学习国家重点实验室, 北京 100875)(³香港教育学院评估研究中心, 香港)

收稿日期:2014-05-04 出版日期:2015-05-25 发布日期:2015-05-25
通讯作者: 边玉芳, E-mail: bianyufang66@126.com; 王立君, E-mail: frankwlj@163.com

The Multidimensional Testlet-Effect Cognitive Diagnostic Models

ZHAN Peida^1,2; LI Xiaomin³; WANG Wen-Chung³; BIAN Yufang²; WANG Lijun¹

(¹ Department of Psychology, Zhejiang Normal University, Jinhua 321004, China) (² National Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing 100875, China) (³ Assessment Research Center, The Hong Kong Institute of Education, Hong Kong, China)

Received:2014-05-04 Published:2015-05-25 Online:2015-05-25
Contact: BIAN Yufang, E-mail: bianyufang66@126.com; WANG Lijun, E-mail: frankwlj@163.com

摘要/Abstract

摘要：

当前认知诊断领域还缺少对包含题组的测验进行诊断分析的研究, 即已开发的认知诊断模型无法合理有效地处理含有题组效应的测验数据, 且已开发的题组反应模型也不具有对被试知识结构或认知过程进行诊断的功能。针对该问题, 本文尝试性地将多维题组效应向量参数引入线性Logistic模型中, 同时开发了属性间具有补偿作用的和属性间具有非补偿作用的多维题组效应认知诊断模型。模拟研究结果显示新模型合理有效, 与线性Logistic模型和DINA模型对比研究后表明：(1)作答数据含有题组效应时, 忽略题组效应会导致项目参数的偏差估计并降低对目标属性的判准率; (2)新模型更具普适性, 即便当作答数据不存在题组效应时, 采用新模型进行测验分析亦能得到很好的项目参数估计结果且不影响对目标属性的判准率。整体来看, 新模型既具有认知诊断功能又可有效处理题组效应。

关键词: 认知诊断, 题组, 项目反应理论, 多维题组效应, Logistic题组框架, DINA

Abstract:

Cognitive diagnosis, which is also referred as skill assessment or skill profiling, utilizes latent class models to provide fine-grained information about students’ strength and weakness in the learning process. The outcome of cognitive diagnostic models (CDMs) is a profile with binary element for each examinee to indicate the mastery/nonmastery status of every attribute/skill. Therefore, one major advantage of CDMs is the capacity to provide additional information about the instructional needs of students. In the past decades, extensive research has been conducted in the area of cognitive diagnosis and many statistical models based on a probabilistic approach have been proposed. Examples of CDMs include the deterministic inputs, noisy and gate (DINA) model (Junker & Sijtsma, 2001), the deterministic input, noisy or gate (DINO) model (Templin & Henson, 2006), and the linear Logistic model (LLM) (Maris, 1999). In educational measurement, one of the most commonly used formats is the testlet design, which is a cluster of items that share a common stimulus (e.g., a reading comprehension passage or a figure). Under the framework of item response theory (IRT), various testlet response models (TRM) have been proposed, such as the Rasch testlet model (Wang & Wilson, 2005) and the multidimensional testlet-effect Rasch model (MTERM) (Zhan, Wang, Wang, & Li, 2014). However, limited efforts have been contributed to the development of testlet models for CDMs. A question then naturally arises is the searching for a way to account for testlet effect under CDMs. To address this issue, this study proposed two testlet-CDMs. One followed the compensatory approach and the other followed the noncompensatory approach: (1) the compensatory multidimensional testlet-effect CDM (C-MTECDM) was based on the combination of LLM and MTERM, while (2) the noncompensatory multidimensional testlet-effect CDM (N-MTECDM) was based on the combination of (logit)DINA model and MTERM, respectively. Model parameters can be estimated by the Bayesian methods with Markov chain Monte Carlo (MCMC) algorithms, which have been implemented with the freeware WinBUGS. In study 1, a series of simulations were conducted to evaluate parameter recovery of two new models, and results showed that the model parameters could be recovered fairly well under all simulated conditions. In study 2, the two new models were compared with the LLM and the (logit)DINA model, respectively. Results showed that ignoring testlet effect would result in biased item parameter estimations and worse person classification rates. Additionally, fitting a more complicated model (i.e., MTECDM) to data with a simpler structure did litter harm on parameter recovery. In conclusion, the new models is feasible and flexible.

Key words: cognitive diagnosis, testlet, item response theory, multidimensional testlet-effect, Logistic testlet framework, DINA

詹沛达;李晓敏;王文中;边玉芳;王立君. (2015). 多维题组效应认知诊断模型. 心理学报, 47(5), 689-701.

ZHAN Peida; LI Xiaomin; WANG Wen-Chung; BIAN Yufang; WANG Lijun. (2015). The Multidimensional Testlet-Effect Cognitive Diagnostic Models. Acta Psychologica Sinica, 47(5), 689-701.

[1]	田亚淑, 詹沛达, 王立君. 联合作答精度和作答时间的概率态认知诊断模型[J]. 心理学报, 2023, 55(9): 1573-1586.
[2]	付颜斌, 陈琦鹏, 詹沛达. 问题解决任务中行动序列的二分类建模：单/两参数行动序列模型[J]. 心理学报, 2023, 55(8): 1383-1396.
[3]	游晓锋, 杨建芹, 秦春影, 刘红云. 认知诊断测评中缺失数据的处理：随机森林阈值插补法[J]. 心理学报, 2023, 55(7): 1192-1206.
[4]	刘彦楼, 陈启山, 王一鸣, 姜晓彤. 模型参数点估计的可靠性：以CDM为例[J]. 心理学报, 2023, 55(10): 1712-1728.
[5]	刘彦楼, 吴琼琼. 认知诊断模型Q矩阵修正：完整信息矩阵的作用[J]. 心理学报, 2023, 55(1): 142-158.
[6]	童昊, 喻晓锋, 秦春影, 彭亚风, 钟小缘. 多级计分测验中基于残差统计量的被试拟合研究[J]. 心理学报, 2022, 54(9): 1122-1136.
[7]	孙小坚, 郭磊. 考虑题目选项信息的非参数认知诊断计算机自适应测验[J]. 心理学报, 2022, 54(9): 1137-1150.
[8]	李佳, 毛秀珍, 韦嘉. 一种简单有效的Q矩阵修正新方法[J]. 心理学报, 2022, 54(8): 996-1008.
[9]	刘彦楼. 认知诊断模型的标准误与置信区间估计：并行自助法[J]. 心理学报, 2022, 54(6): 703-724.
[10]	宋枝璘, 郭磊, 郑天鹏. 认知诊断缺失数据处理方法的比较：零替换、多重插补与极大似然估计法[J]. 心理学报, 2022, 54(4): 426-440.
[11]	秦春影, 喻晓锋. 多级属性Q矩阵的验证与估计[J]. 心理学报, 2022, 54(11): 1403-1415.
[12]	詹沛达. 引入眼动注视点的联合-交叉负载多模态认知诊断建模[J]. 心理学报, 2022, 54(11): 1416-1423.
[13]	郭磊, 周文杰. 基于选项层面的认知诊断非参数方法[J]. 心理学报, 2021, 53(9): 1032-1043.
[14]	任赫, 陈平. 两种新的多维计算机化分类测验终止规则[J]. 心理学报, 2021, 53(9): 1044-1058.
[15]	谭青蓉, 汪大勋, 罗芬, 蔡艳, 涂冬波. 一种高效的CD-CAT在线标定新方法：基于熵的信息增益与EM视角[J]. 心理学报, 2021, 53(11): 1286-1300.

多维题组效应认知诊断模型

The Multidimensional Testlet-Effect Cognitive Diagnostic Models

PDF (PC)

评审附件

可视化

English Version

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价