ISSN 0439-755X
CN 11-1911/B

中国科学院心理研究所

• 论文 •

### 结合a分层的兼具项目曝光和广义测验重叠率控制的选题策略

1. (1北京师范大学认知神经科学与学习国家重点实验室; 2中国基础教育质量评价与提升协同创新中心, 北京 100875)
• 收稿日期:2013-06-14 出版日期:2014-05-24 发布日期:2014-05-24
• 通讯作者: 边玉芳
• 基金资助:

高等学校博士学科点专项科研基金资助课题(20120003110002)的资助。

### a-Stratified Methods Combining Item Exposure Control and General Test Overlap in Computerized Adaptive Testing

GUO Lei;WANG Zhuoran;WANG Feng;BIAN Yufang

1. (1 National Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing 100875, China) (2 National Cooperative Innovation Center for Assessment and Improvement of Basic Education Quality, Beijing Normal University, Beijing 100875, China)
• Received:2013-06-14 Online:2014-05-24 Published:2014-05-24
• Contact: BIAN Yufang

Abstract:

Test security and item pool utilization rate are very important in computerized adaptive testing (CAT), especially in the high-stakes tests. Most existing methods only focus on the item exposure rate, but rarely control the test overlap rate. Way (1998) suggested that the item exposure and the test overlap rate be two indices of test security. Following this reasoning, Chen (2010) proposed an on–line version of the Sympson-Hetter procedure with general test overlap control (SHGT) that didn’t need iterative simulations. Although the SHGT method could control item exposure and general test overlap simultaneously without iterative simulations, the item pool utilization rate was not very ideal when the item exposure or test overlap rate was slightly high or the number of examinees who shared the information with another examinee was small. Thus, the test security was threatened. To address the limitation of the SHGT method, we combined the a-stratified method with the SHGT method, and proposed three new methods: SHGT_a method, SHGT_b method, and SHGT_c method. Simulation results indicated that: (1) Compared to the SHGT method, these three new methods could not only improve the item pool utilization rate, but also maintain a very high precision of ability estimate in the same experiment condition; (2) With the increase of maximum item exposure rate (rmax) and maximum general test overlap rate ( ), and the decrease of a, the precision of ability estimation increased. Compared to the SHGT method, these three new methods could maintain a higher item pool utilization rate; (3) SHGT_b and SHGT_c outperformed SHGT_a in the aspect of the precision of ability estimation when rab was high; (4) The three new methods had a good performance in respect of the precision of ability estimation; (5) Compared to the SHGT method, these three new methods could also effectively solve the problem of item exposure over-control. Some future directions of study were suggested at the end of this paper.