认知诊断CAT中具有非统计约束选题方法的比较

doi:10.3724/SP.J.1041.2014.01910

心理学报 ›› 2014, Vol. 46 ›› Issue (12): 1910-1922.doi: 10.3724/SP.J.1041.2014.01910

认知诊断CAT中具有非统计约束选题方法的比较

毛秀珍¹ ;辛涛²

(¹四川师范大学教育科学学院, 成都 610066) (²北京师范大学发展心理研究所, 北京 100875)

收稿日期:2014-01-05 发布日期:2014-12-25 出版日期:2014-12-25
通讯作者: 辛涛, E-mail: xintao@bnu.edu.cn
基金资助:
四川省教育厅自然科学青年科研基金(13ZB0155)、国家自然科学青年项目(31400897)资助。

A Comparison of Item Selection Methods for Cognitive Diagnostic Computerized Adaptive Testing with Nonstatistical Constraints

MAO Xiuzhen¹; XIN Tao²

(¹ Institute of Education Science, Sichuan normal university, Chengdu 610066, China) (² Institute of Developmental Psychology, Beijing normal university, Beijing 100875, China)

Received:2014-01-05 Online:2014-12-25 Published:2014-12-25
Contact: XIN Tao, E-mail: xintao@bnu.edu.cn

摘要/Abstract

摘要：

项目曝光控制和内容约束关系到测验安全、测验的信度和效度, 是计算机化自适应测验(Computerized Adaptive Testing, CAT)中两类重要的非统计约束条件。本文在认知诊断CAT中针对内容约束和项目曝光控制要求, 运用5种方法选择测验项目。它们分别是：(1) Monte Carlo方法与项目合格方法相结合, 记为MC-IE; (2) Monte Carlo方法与最大优先指标方法相结合, 记为MC-MPI; (3) Monte Carlo方法与限制阈值方法相结合, 记为MC-RT; (4) Monte Carlo方法与限制进度指标方法相结合, 记为MC-RPG以及(5) Monte Carlo方法与最大后验概率方法相结合, 记为MC-PP。然后通过在线性、收敛、发散、无结构和独立五种属性结构下构建题库并运用重参化融融统和模型模拟被试反应比较它们的选题表现。研究发现, (1) 相同选题方法在不同属性结构下项目曝光率的分布类似, 测量精度按线性、收敛、发散、无结构和独立结构的顺序依次降低; (2) 相同属性结构下, 不同方法的测量精度高低依次为MC-PP、MC-IE、MC-RT、MC-MPI和MC-RPG方法; 项目曝光均匀性优劣依次为MC-RPG、MC-MPI、MC-RT、MC-IE和MC-PP方法。统一量纲值表明, MC-RPG方法的综合表现最好, MC-MPI方法的表现次之。

关键词: 认知诊断理论, 计算机化自适应测验, 测量精度, 项目曝光率, 内容约束。

Abstract:

It is well known that items in the bank of computerized adaptive testing (CAT) are always expected to be used equally. For one thing, a good deal of manpower and financial resources spent on constructing the item bank will surely be wasted if a large proportion of items are seldom exposed or even never be used. For the other, works for ensuring the test security and maintaining the item bank will become serious for test practitioners if items are exposed extremely skewed. In addition to controlling the item exposure, tests which assembled for different examinees are usually required to satisfy many constraints, such as (a) the well-proportional of each content domain; (b) the “enemy items” could not be appeared in the same test, and (c) the appropriate balance of item keys. Supposing some constraints are violated, it will give some unexpected reactions during the test and result in inaccuracy of trait estimates. Therefore, both item exposure control and content constraints are important non-statistical constraints. They have great influence on the test validity, measurement accuracy and comparability among examinees. So, they need to be incorporated into the designing of item selection for CAT in practical settings. When cognitive diagnostic theory is used in CAT, examinees can receive more detailed diagnostic information regarding their mastery of every attribute. Therefore, cognitive diagnostic CAT (CD-CAT) is a promising research area and has gained much attention because it integrates both the cognitive diagnostic method and adaptive testing. The present study compared the performances of five item selection methods in CD-CAT with item exposure control and content constraints. The item selection methods applied are (a) incorporating the Monte Carlo approach into the item eligibility approach (MC-IE); (b) incorporating the maximum priority index method into the Monte Carlo approach (MC-MPI); (c) incorporating the restrictive threshold method into the Monte Carlo approach (MC-RT); (d) incorporating the restrictive progressive method into the Monte Carlo approach (MC-RPG), and (e) incorporating the maximum post probability of knowledge states method into the Monte Carlo approach (MC-PP). The reparameterized unified model was implemented in the simulation experiments to generate item responses with respect to five item banks constructed according to attribute structures of linear, convergent, divergent, unstructured and independent, respectively. Results indicate that (a) the distributions of item exposure produced by the same item selection method in different item banks are similar, (b) the measurement precisions of each item selection method yield in attribute structures of linear, convergent, divergent, unstructured and independent are decreased gradually; (c) the performances of different item selection methods ordered by the measurement accuracy in each test condition are methods of the MC-PP, the MC-IE, the MC-MPI, the MC-RT, and the MC-RPG; their performances in terms of item exposure control are sorted in the opposite order. According to the value of uniformly dimensional, the MC-RPG method yields a best balance between item exposure control and test accuracy while satisfying some content constraints, and then followed by the MC-MPI method.

Key words: cognitive diagnostic theory, computerized adaptive testing, measurement accuracy, item exposure rate, content constraints.

毛秀珍;辛涛. (2014). 认知诊断CAT中具有非统计约束选题方法的比较. 心理学报, 46(12), 1910-1922.

MAO Xiuzhen; XIN Tao. (2014). A Comparison of Item Selection Methods for Cognitive Diagnostic Computerized Adaptive Testing with Nonstatistical Constraints. Acta Psychologica Sinica, 46(12), 1910-1922.

[1]	陈平. 两种新的计算机化自适应测验在线标定方法[J]. 心理学报, 2016, 48(9): 1184-1198.
[2]	郭磊; 郑蝉金; 边玉芳; 宋乃庆; 夏凌翔. 认知诊断计算机化自适应测验中新的选题策略：结合项目区分度指标[J]. 心理学报, 2016, 48(7): 903-914.
[3]	林喆;陈平;辛涛. 允许CAT题目检查的区块题目袋方法[J]. 心理学报, 2015, 47(9): 1188-1198.
[4]	罗照盛;喻晓锋;高椿雷;李喻骏;彭亚风;王睿;王钰彤. 基于属性掌握概率的认知诊断计算机化自适应测验选题策略[J]. 心理学报, 2015, 47(5): 679-688.
[5]	郭磊;郑蝉金;边玉芳. 变长CD-CAT中的曝光控制与终止规则[J]. 心理学报, 2015, 47(1): 129-140.
[6]	郭磊;王卓然;王丰;边玉芳. 结合a分层的兼具项目曝光和广义测验重叠率控制的选题策略[J]. 心理学报, 2014, 46(5): 702-713.
[7]	毛秀珍;辛涛. 认知诊断CAT中项目曝光控制方法的比较[J]. 心理学报, 2013, 45(6): 694-703.
[8]	罗芬,丁树良,王晓庆. 多级评分计算机化自适应测验动态综合选题策略[J]. 心理学报, 2012, 44(3): 400-412.
[9]	陈平,辛涛. 认知诊断计算机化自适应测验中的项目增补[J]. 心理学报, 2011, 43(07): 836-850.
[10]	陈平,辛涛. 认知诊断计算机化自适应测验中在线标定方法的开发[J]. 心理学报, 2011, 43(06): 710-724.
[11]	程小扬,丁树良,严深海,朱隆尹. 引入曝光因子的计算机化自适应测验选题策略[J]. 心理学报, 2011, 43(02): 203-212.
[12]	陈平,丁树良. 允许检查并修改答案的计算机化自适应测验[J]. 心理学报, 2008, 40(06): 737-747.
[13]	林海菁,丁树良. 具有认知诊断功能的计算机化自适应测验的研究与实现[J]. 心理学报, 2007, 39(04): 747-753.
[14]	戴海琦,陈德枝,丁树良,邓太萍. 多级评分题计算机自适应测验选题策略比较[J]. 心理学报, 2006, 38(05): 778-783.
[15]	陈平,丁树良,林海菁,周婕. 等级反应模型下计算机化自适应测验选题策略[J]. 心理学报, 2006, 38(03): 461-467.

认知诊断CAT中具有非统计约束选题方法的比较

A Comparison of Item Selection Methods for Cognitive Diagnostic Computerized Adaptive Testing with Nonstatistical Constraints

PDF (PC)

评审附件

可视化

English Version

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价