ISSN 0439-755X
CN 11-1911/B
主办:中国心理学会
   中国科学院心理研究所
出版:科学出版社

心理学报 ›› 2014, Vol. 46 ›› Issue (12): 1910-1922.doi: 10.3724/SP.J.1041.2014.01910

• 论文 • 上一篇    下一篇

认知诊断CAT中具有非统计约束选题方法的比较

毛秀珍1 ;辛涛2   

  1. (1四川师范大学教育科学学院, 成都 610066) (2北京师范大学发展心理研究所, 北京 100875)
  • 收稿日期:2014-01-05 发布日期:2014-12-25 出版日期:2014-12-25
  • 通讯作者: 辛涛, E-mail: xintao@bnu.edu.cn
  • 基金资助:

    四川省教育厅自然科学青年科研基金(13ZB0155)、国家自然科学青年项目(31400897)资助。

A Comparison of Item Selection Methods for Cognitive Diagnostic Computerized Adaptive Testing with Nonstatistical Constraints

MAO Xiuzhen1; XIN Tao2   

  1. (1 Institute of Education Science, Sichuan normal university, Chengdu 610066, China) (2 Institute of Developmental Psychology, Beijing normal university, Beijing 100875, China)
  • Received:2014-01-05 Online:2014-12-25 Published:2014-12-25
  • Contact: XIN Tao, E-mail: xintao@bnu.edu.cn

摘要:

项目曝光控制和内容约束关系到测验安全、测验的信度和效度, 是计算机化自适应测验(Computerized Adaptive Testing, CAT)中两类重要的非统计约束条件。本文在认知诊断CAT中针对内容约束和项目曝光控制要求, 运用5种方法选择测验项目。它们分别是:(1) Monte Carlo方法与项目合格方法相结合, 记为MC-IE; (2) Monte Carlo方法与最大优先指标方法相结合, 记为MC-MPI; (3) Monte Carlo方法与限制阈值方法相结合, 记为MC-RT; (4) Monte Carlo方法与限制进度指标方法相结合, 记为MC-RPG以及(5) Monte Carlo方法与最大后验概率方法相结合, 记为MC-PP。然后通过在线性、收敛、发散、无结构和独立五种属性结构下构建题库并运用重参化融融统和模型模拟被试反应比较它们的选题表现。研究发现, (1) 相同选题方法在不同属性结构下项目曝光率的分布类似, 测量精度按线性、收敛、发散、无结构和独立结构的顺序依次降低; (2) 相同属性结构下, 不同方法的测量精度高低依次为MC-PP、MC-IE、MC-RT、MC-MPI和MC-RPG方法; 项目曝光均匀性优劣依次为MC-RPG、MC-MPI、MC-RT、MC-IE和MC-PP方法。统一量纲值表明, MC-RPG方法的综合表现最好, MC-MPI方法的表现次之。

关键词: 认知诊断理论, 计算机化自适应测验, 测量精度, 项目曝光率, 内容约束。

Abstract:

It is well known that items in the bank of computerized adaptive testing (CAT) are always expected to be used equally. For one thing, a good deal of manpower and financial resources spent on constructing the item bank will surely be wasted if a large proportion of items are seldom exposed or even never be used. For the other, works for ensuring the test security and maintaining the item bank will become serious for test practitioners if items are exposed extremely skewed. In addition to controlling the item exposure, tests which assembled for different examinees are usually required to satisfy many constraints, such as (a) the well-proportional of each content domain; (b) the “enemy items” could not be appeared in the same test, and (c) the appropriate balance of item keys. Supposing some constraints are violated, it will give some unexpected reactions during the test and result in inaccuracy of trait estimates. Therefore, both item exposure control and content constraints are important non-statistical constraints. They have great influence on the test validity, measurement accuracy and comparability among examinees. So, they need to be incorporated into the designing of item selection for CAT in practical settings. When cognitive diagnostic theory is used in CAT, examinees can receive more detailed diagnostic information regarding their mastery of every attribute. Therefore, cognitive diagnostic CAT (CD-CAT) is a promising research area and has gained much attention because it integrates both the cognitive diagnostic method and adaptive testing. The present study compared the performances of five item selection methods in CD-CAT with item exposure control and content constraints. The item selection methods applied are (a) incorporating the Monte Carlo approach into the item eligibility approach (MC-IE); (b) incorporating the maximum priority index method into the Monte Carlo approach (MC-MPI); (c) incorporating the restrictive threshold method into the Monte Carlo approach (MC-RT); (d) incorporating the restrictive progressive method into the Monte Carlo approach (MC-RPG), and (e) incorporating the maximum post probability of knowledge states method into the Monte Carlo approach (MC-PP). The reparameterized unified model was implemented in the simulation experiments to generate item responses with respect to five item banks constructed according to attribute structures of linear, convergent, divergent, unstructured and independent, respectively. Results indicate that (a) the distributions of item exposure produced by the same item selection method in different item banks are similar, (b) the measurement precisions of each item selection method yield in attribute structures of linear, convergent, divergent, unstructured and independent are decreased gradually; (c) the performances of different item selection methods ordered by the measurement accuracy in each test condition are methods of the MC-PP, the MC-IE, the MC-MPI, the MC-RT, and the MC-RPG; their performances in terms of item exposure control are sorted in the opposite order. According to the value of uniformly dimensional, the MC-RPG method yields a best balance between item exposure control and test accuracy while satisfying some content constraints, and then followed by the MC-MPI method.

Key words: cognitive diagnostic theory, computerized adaptive testing, measurement accuracy, item exposure rate, content constraints.