Item Selection Strategies for Computerized Adaptive Testing with the Generalized Partial Credit Model

Abstract

Abstract:

The objective of computerized adaptive testing (CAT) is to construct an optimal test for each examinee. Item Selection Strategy (ISS) is an important part of CAT research, whose quality is directly related to the reliability, efficiency, and security of the test.
Many researches and applications of CAT are based on a dichotomously scored model. It is highly evident that more information can be obtained from examinees using a polytomously scored model rather than a dichotomous model. Moreover, it is necessary for us to further explore CAT research based on a polytomously scored model.
Both the Generalized Partial Credit Model (GPCM) and the Graded Response Model (GRM) are within the range of a polytomously scored model. However, they differ from each other. In the GRM, the item grade difficulties ascend monotonously as the grades increase; while the GPCM shows the performing process of the item, which is separated into some line-steps to put forwards. In the GPCM, each item contains several step parameters, and there are no specific rules governing them. The posterior step cannot advance when the earlier step has not been completed, and the posterior’s step parameter may be lower than that of the previous one. Considerable research is already being conducted on CAT using the GRM; however, in our country, there are few reports pertaining to research on CAT using the GPCM.
This study investigated the four types of ISS in comparison with CAT in various circumstances, using the GPCM through computer simulated programs. They are implemented in four item pools, and each item pool has a capacity of 3000 items. Each item has five step parameters; further, the discrimination parameter and step parameters are distributed as follows: {(b~N(0,1), (lna~N(0,1)), (b~N(0,1)), (a~U(0.2,2.5)), (b~U(-3,3)), (lna~N(0,1)), (b~U(-3,3)), and (a~U(0.2,2.5)). Item parameters are generated based on the Monte Carlo simulation method. Responses to the items are generated according to the GPCM for a sample of 1000 simulatees ( ) whose trait level was also generated using the Monte Carlo simulation method in some types of ISS. During the course of responses, the simulatees’ ability is estimated based on the response obtained. In addition, after the four item pools are sorted by the discrimination parameter to complete the a-stratified design, the abovementioned process is performed repeatedly. Thirty-two simulated CATs are administered with the output evaluated with regard to the following measurements: precision, ISS steady, item used even, average use of item per person, χ2, efficiency, and item overlap.
The data in tables 1 and 2 include both the index values used for evaluation (which were obtained from the CAT process using four types of ISS when the item pool did not adopt the stratified design and instead adopted the a-stratified design) and values that are calculated after summing the weight of every index value. We can draw the following conclusions from the data in the tables: all the ability estimates are highly accurate and have fewer differences. Moreover, we compare the value by summing every means weight, we learn that the item step parameter distribution greatly influences the choices of ISS.
On the condition that the examinee’s trait level follows normal distribution, the application results of the ISS and the item step parameter distribution share a very close relationship. (1) If the item’s step parameters follow a normal distribution, the efficiency of the ISS for a random step parameter matching the trait level is much better than that for others. (2) If the item’s step parameters follow a uniform distribution, the efficiency of the item selection strategy for the item’s average step parameter matching the trait level is much better than that for others

Key words: IRT, polytomously scored model, GPCM, a-stratified design, item selection strategy

CLC Number:

B841

LIU Zhen,DING Shu-Liang,LIN Hai-Jing. (2008). Item Selection Strategies for Computerized Adaptive Testing with the Generalized Partial Credit Model. , 40(05), 618-625.

[1]	KE Xiaoxiao, QI Huizi, LIANG Jiahui, JIN Xinyuan, GAO Jie, ZHANG Mingxia, WANG Yamin. Situational assessment method of the Chinese people’s holistic thinking characteristics and their application [J]. Acta Psychologica Sinica, 2021, 53(12): 1299-1309.
[2]	LI Meijuan,LIU Yue,LIU Hongyun. Analysis of the Problem-solving strategies in computer-based dynamic assessment: The extension and application of multilevel mixture IRT model [J]. Acta Psychologica Sinica, 2020, 52(4): 528-540.
[3]	GUO Jichengsi,HUANG Jianping,WAN Xiaoang. The influence of target knowledge on path integration [J]. Acta Psychologica Sinica, 2019, 51(2): 188-195.
[4]	ZHANG Weiwei,HUANG Jianping,WAN Xiaoang. Influence of expectations on human path integration [J]. Acta Psychologica Sinica, 2019, 51(11): 1219-1228.
[5]	Xiaojun YUAN, Xiaoxia CUI, Zhengcao CAO, Hong KAN, Xiao WANG, Yamin WANG. Attentional bias towards threatening visual stimuli in a virtual reality-based visual search task [J]. Acta Psychologica Sinica, 2018, 50(6): 622-636.
[6]	HUANG Feng, DING Qian, WEI Hua, HONG Jianzhong. Effects of post thematic characteristics on knowledge sharing in the virtual community: The bystander effect perspective [J]. Acta Psychologica Sinica, 2018, 50(2): 226-234.
[7]	ZHOU Xi; WAN Xiaoang; DU Dikang; XIONG Yilei; HUANG Weixin. Reorientation in uncontinuous virtual reality space [J]. Acta Psychologica Sinica, 2016, 48(8): 924-932.
[8]	ZHANG Jing, CHEN Wei. Is body image plastic? The impact of synchrony and distance reference frame on body ownership [J]. Acta Psychologica Sinica, 2016, 48(8): 933-945.
[9]	YU Liutao; BAO Jianzhang; CHEN Qinghua; WANG Dahui. The effect of individual confidence on dyadic decision making [J]. Acta Psychologica Sinica, 2016, 48(8): 1013-1025.
[10]	GUO Lei; ZHENG Chanjin; BIAN Yufang; SONG Naiqing; XIA Lingxiang. New item selection methods in cognitive diagnostic computerized adaptive testing: Combining item discrimination indices [J]. Acta Psychologica Sinica, 2016, 48(7): 903-914.
[11]	JIAN Xiaozhu; DAI Buyun; DAI Haiqi. The weighted-score logistic model and Monte Carlo simulation study [J]. Acta Psychologica Sinica, 2016, 48(12): 1625-1630.
[12]	GUO Jichengsi; WAN Xiaoang. The Effect of Learning in Virtual Path Integration [J]. Acta Psychologica Sinica, 2015, 47(6): 711-720.
[13]	BIAN Yulong, HAN Lei, ZHOU Chao, CHEN Yingmin, GAO Fengqiang. The Proteus Effect in Virtual Reality Social Environments: Influence of Situation and Shyness [J]. Acta Psychologica Sinica, 2015, 47(3): 363-374.
[14]	HU Bin;FENG Chengzhi. The Different Effects of PM Types, Motivation, and Task Sequence on Prospective Memory [J]. Acta Psychologica Sinica, 2013, 45(9): 944-960.
[15]	LUO Fen,DING Shu-Liang,WANG Xiao-Qing. Dynamic and Comprehensive Item Selection Strategies for Computerized Adaptive Testing Based on Graded Response Model [J]. , 2012, 44(3): 400-412.

Item Selection Strategies for Computerized Adaptive Testing with the Generalized Partial Credit Model

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments