ISSN 0439-755X
CN 11-1911/B
主办:中国心理学会
   中国科学院心理研究所
出版:科学出版社

心理学报 ›› 2022, Vol. 54 ›› Issue (9): 1137-1150.doi: 10.3724/SP.J.1041.2022.01137

• 研究报告 • 上一篇    

考虑题目选项信息的非参数认知诊断计算机自适应测验

孙小坚1,2,3, 郭磊3,4()   

  1. 1西南大学数学与统计学院, 重庆 400715
    2西南大学基础教育研究中心, 重庆 400715
    3中国基础教育质量监测协同创新中心西南大学分中心, 重庆 400715
    4西南大学心理学部, 重庆 400715
  • 收稿日期:2021-12-31 发布日期:2022-07-21 出版日期:2022-09-25
  • 通讯作者: 郭磊 E-mail:happygl1229@swu.edu.cn
  • 基金资助:
    国家自然科学基金青年项目(31900793);中央高校基本科研业务费专项资金(SWU2109222);中国科普研究所委托项目(210105ESR056);2020年度重庆市博士后特别资助项目资助

Nonparametric cognitive diagnostic computerized adaptive testing using multiple-choice option information

SUN Xiaojian1,2,3, GUO Lei3,4()   

  1. 1School of Mathematics and Statistics, Southwest University, Chongqing 400715, China
    2Basic Education Research Centre, Southwest University, Chongqing 400715, China
    3Southwest University Branch, Collaborative Innovation Center of Assessment for Basic Education Quality, Chongqing 400715, China
    4Faculty of Psychology, Southwest University, Chongqing 400715, China
  • Received:2021-12-31 Online:2022-07-21 Published:2022-09-25
  • Contact: GUO Lei E-mail:happygl1229@swu.edu.cn

摘要:

选择题中的作答选项能提供额外诊断信息, 为充分利用选项信息, 研究提出认知诊断计算机自适应测验(CD-CAT)中两种处理选择题选项信息的非参数选题策略和变长终止规则。模拟研究的结果发现:(1)定长条件下两种非参数选题策略的分类准确性整体要高于参数选题策略; (2)两种非参数选题策略较参数选题策略具有更加均衡的题库使用情况; (3)非参数选题策略在两种新的变长终止规则下具有更高的分类准确率; (4)两种非参数选题策略均适用于选择题CD-CAT情境, 使用者可任选其一进行测验分析。

关键词: 认知诊断计算机自适应测验, 题目选项信息, 非参数选题策略, 变长终止规则

Abstract:

Most existing cognitive diagnostic computerized adaptive testing (CD-CAT) item selection methods ignore the diagnostic information that distractors provide for multiple-choice (MC) items. Consequently, some useful information is missed and resources are wasted. To overcome this, researchers proposed the Jensen-Shannon divergence (JSD) strategy to select items with the MC-DINA model. However, the JSD strategy needs large samples to obtain reliable estimates of the item parameters before the formal test, and this could compromise the items in the bank. By contrast, the nonparametric method does not require any parameter calibration before the formal test and can be used in small educational programs.
The current study proposes two nonparametric item selection methods (i.e., HDDmc and JDDmc) for CD-CAT with MC items as well as two termination rules (i.e., MR and DR) for variable-length CD-CAT with MC items. Two simulation studies were conducted to examine the performance of these nonparametric item selection methods and termination rules.
The first study examined the performance of the HDDmc and JDDmc with fixed-length CD-CAT. In this study, six factors were manipulated: the number of attributes (K = 4 vs. 6), the structure of the Q-matrix (simple vs. complex), the quality of the item bank (high vs. low vs. mixed), the distribution of the attribute profile (multivariate normal threshold model vs. discrete uniform distribution), the test length (two vs. three vs. four times of K), and the item selection methods (HDDmc vs. JDDmc vs. JSD). Of these, item selection method was the within-group variable, and the rest were between-group variables. The results showed that: (1) the HDDmc and JDDmc produced higher attribute pattern matched ratios (PMRs) than the JSD method for most conditions; (2) the HDDmc and JDDmc produced similar PMRs for all conditions; (3) the HDDmc and JDDmc produced more even distributions of item exposure than the JSD method.
The second simulation study investigated the performance of the MR and DR with variable-length CD-CAT. Six factors were also manipulated in this study: the settings for the number of attributes, the structure of the Q-matrix, the quality of the item bank, and the distribution of the attribute profile were the same as in the first study; the other two factors were termination rules (MR, DR, D1, and D3) and item selection methods (HDDmc and JDDmc). Again, the first four were between-group variables, while termination rules and item selection methods were within-group variables. The results showed that: (1) the HDDmc and JDDmc yielded higher PMRs for MR and DR rules than for the D1 and D3 rules; (2) the HDDmc and JDDmc yielded longer test lengths for MR and DR rules than for the D1 and D3 rules, especially for the JDD rule.
In sum, both nonparametric item selection methods and the two new termination rules proved appropriate for CD-CAT with MC items, which means they can be used to balance the trade-off between measurement accuracy and item exposure rate.

Key words: cognitive diagnostic computerized adaptive testing, multiple-choice items, nonparametric item selection method, termination rule

中图分类号: