改进的认知诊断模型项目功能差异检验方法 ——基于观察信息矩阵的Wald统计量

doi:10.3724/SP.J.1041.2016.00588

心理学报 ›› 2016, Vol. 48 ›› Issue (5): 588-598.doi: 10.3724/SP.J.1041.2016.00588

• 论文 • 上一篇

改进的认知诊断模型项目功能差异检验方法 ——基于观察信息矩阵的Wald统计量

刘彦楼¹;辛涛^1,2;李令青³;田伟²;刘笑笑¹

(¹北京师范大学发展心理研究所, 北京 100875) (²中国基础教育质量监测协同创新中心, 北京 100875) (³泰山学院教师教育学院, 山东泰安 271000)

收稿日期:2015-09-17 发布日期:2016-05-25 出版日期:2016-05-25
通讯作者: 辛涛, E-mail: xintao@bnu.edu.cn
基金资助:
国家自然科学基金面上项目(31371047); 中央高校基本科研业务费专项资金资助(SKZZX2013028)。

An improved method for differential item functioning detection in cognitive diagnosis models: An application of Wald statistic based on observed information matrix

LIU Yanlou¹; XIN Tao^1,2; LI Lingqing³; TIAN Wei²; LIU Xiaoxiao¹

(¹ Institute of Developmental Psychology, Beijing Normal University, Beijing 100875, China) (² National Innovation Center for Assessment of Basic Education Quality, Beijing 100875, China) (³ School of Teacher Education, Taishan University, Taian 271000, China)

Received:2015-09-17 Online:2016-05-25 Published:2016-05-25
Contact: XIN Tao, E-mail: xintao@bnu.edu.cn

摘要/Abstract

摘要：

Hou, de la Torre和Nandakumar (2014)提出可以使用Wald统计量检验DIF, 但其结果的一类错误率存在过度膨胀的问题。本研究中提出了一个使用观察信息矩阵进行计算的改进后的Wald统计量。结果表明：(1)使用观察信息矩阵计算的这一改进后的Wald统计量在DIF检验中具有良好的一类错误控制率, 尤其是在项目具有较高区分能力的时候, 解决了以往研究中一类错误率过度膨胀的问题。(2)随着样本量的增加以及DIF量的增大, 使用观察信息矩阵计算Wald统计量的统计检验力也在增加。

关键词: Wald统计量, 项目功能差异, 认知诊断模型, 观察信息矩阵, 经验交叉相乘信息矩阵

Abstract:

In cognitive diagnostic models (CDMs), differential item functioning (DIF) refers to the probabilities of success of an item being different for examinees with the same attribute mastery pattern in the groups. The detection of DIF is an important step to ensure the fairness and validity of results from CDMs for all groups. Hou et al. (2014) proposed that the Wald statistic can be used to detect DIF in CDMs. Unfortunately, their results revealed that the Wald statistic based on the information matrix estimation method developed by de la Torre (2009, 2011) yielded inflated Type I error rates. However, Li and Wang (2015) found that the Type I error rates of the Wald statistic in which MCMC algorithms were implemented were slightly inflated in their study under the same conditions. In this study, we proposed an improved Wald statistic based on the observed information matrix for DIF assessment. As a general demonstration, we took the log-linear cognitive diagnosis model (LCDM; Henson et al., 2009) as an example. In this simulation study, in order to compare the results with previous studies (e.g., Hou et al.,2014; Li & Wang, 2015), we followed the simulation design used by Hou et al. (2014), except that we implemented the observed or cross-product (XPD) information matrix in the Wald statistic computation. Parameters set in the studies were: the test length at 30, the number of attributes at 5, and the maximum number of required attributes for an item at 3. Binary item response data were generated from the DINA model. Three sets of true item parameter values were considered for the reference group. Two DIF sizes: .05 and .10, and two types of DIF: uniform and nonuniform, were manipulated. Two sample sizes were considered, 500 and 1,000. Each condition was replicated 1000 times, and the estimation code was written in R (R Core Team, 2015). The simulation results showed that: (1) for the relatively discriminating items, Wald statistic had accurate Type I error control when the observed information matrix was used in its computation. However, when the slip and guessing parameters were large , the Type I error control was slightly conservative. (2) When the XPD information matrix was used for the computation of the Wald statistic, the Type I error control was conservative; that is, the performance of the observed information matrix was better than the XPD information matrix. (3) The number of attributes required for success on the item did not have a notable impact on the Type I error control of Wald statistic, irrespective of whether the observed or the XPD information matrix was used for the statistic. (4) The power rates of Wald statistic for detecting DIF increased as the sample size increased. We conclude that our improved Wald statistic provided follows asymptotically a chi-square distribution with degrees of freedom equal to 2, for DINA model. The improved Wald statistic is a useful and powerful tool for DIF detection in CDMs.

Key words: Wald statistic, differential item functioning, cognitive diagnosis model, observed information matrix, cross-product information matrix

刘彦楼;辛涛;李令青;田伟;刘笑笑. (2016). 改进的认知诊断模型项目功能差异检验方法 ——基于观察信息矩阵的Wald统计量. 心理学报, 48(5), 588-598.

LIU Yanlou; XIN Tao; LI Lingqing; TIAN Wei; LIU Xiaoxiao. (2016). An improved method for differential item functioning detection in cognitive diagnosis models: An application of Wald statistic based on observed information matrix. Acta Psychologica Sinica, 48(5), 588-598.

[1]	刘彦楼, 陈启山, 王一鸣, 姜晓彤. 模型参数点估计的可靠性：以CDM为例[J]. 心理学报, 2023, 55(10): 1712-1728.
[2]	刘彦楼, 吴琼琼. 认知诊断模型Q矩阵修正：完整信息矩阵的作用[J]. 心理学报, 2023, 55(1): 142-158.
[3]	刘彦楼. 认知诊断模型的标准误与置信区间估计：并行自助法[J]. 心理学报, 2022, 54(6): 703-724.
[4]	高旭亮, 汪大勋, 王芳, 蔡艳, 涂冬波. 基于分部评分模型思路的多级评分认知诊断模型开发[J]. 心理学报, 2019, 51(12): 1386-1397.
[5]	詹沛达;陈平;边玉芳. 使用验证性补偿多维IRT模型进行认知诊断评估[J]. 心理学报, 2016, 48(10): 1347-1356.
[6]	蔡艳;涂冬波. 属性多级化的认知诊断模型拓展及其Q矩阵设计[J]. 心理学报, 2015, 47(10): 1300-1308.
[7]	王卓然; 郭磊; 边玉芳. 认知诊断测验中的项目功能差异检测方法比较[J]. 心理学报, 2014, 46(12): 1923-1932.
[8]	张勋;李凌艳;刘红云;孙研. IRT_Δb法和修正LR法对矩阵取样 DIF检验的有效性[J]. 心理学报, 2013, 45(8): 921-934.
[9]	蔡艳;涂冬波;丁树良. 五大认知诊断模型的诊断正确率比较及其影响因素：基于分布形态、属性数及样本容量的比较[J]. 心理学报, 2013, 45(11): 1295-1304.
[10]	刘红云,李冲,张平平,骆方. 分类数据测量等价性检验方法及其比较：项目阈值(难度)参数的组间差异性检验[J]. 心理学报, 2012, 44(8): 1124-1136.
[11]	郑蝉金,郭聪颖,边玉芳. 变通的题组项目功能差异检验方法在篇章阅读测验中的应用[J]. 心理学报, 2011, 43(07): 830-835.
[12]	涂冬波,蔡艳,戴海琦,丁树良. 一种多级评分的认知诊断模型：P-DINA模型的开发[J]. 心理学报, 2010, 42(10): 1011-1020.
[13]	罗,欢,丁树良,汪文义,喻晓锋,曹慧媛1. 属性不等权重的多级评分属性层级方法[J]. 心理学报, 2010, 42(04): 528-538.
[14]	曹亦薇. 项目功能差异在跨文化人格问卷分析中的应用[J]. 心理学报, 2003, 35(01): 120-126.
[15]	曹亦薇,张厚粲. 汉语词汇测验中的项目功能差异初探[J]. 心理学报, 1999, 31(4): 460-467.

改进的认知诊断模型项目功能差异检验方法 ——基于观察信息矩阵的Wald统计量

An improved method for differential item functioning detection in cognitive diagnosis models: An application of Wald statistic based on observed information matrix

PDF (PC)

评审附件

可视化

English Version

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价