The Block Item Pocket Method to Allow Item Review in CAT

doi:10.3724/SP.J.1041.2015.01188

Abstract

Abstract:

Most computerized adaptive testing (CAT) do not allow examinees to review items because it will drastically decrease measurement precision and bring about extra cheating strategies (Wainer, 1993; Wise, 1996). Allowing item review is essential to make CAT comparable with traditional tests. It also matters in application. Item review enables examinees to correct mistakes due to carelessness, which can further improve the precision of ability estimation. No such option may cause some negative consequences for their overall performance especially in high-stake examinations, such as tension or anxiety (Vispoel, Henderickson, & Bleiler, 2000). Therefore, it is worth trying if allowing item review could alleviate problems mentioned at the beginning (Wise, 1996; Vispoel, 2000, 2005).

Several methods have been proposed, including the successive block method (Stocking, 1997) and the item pocket (IP) method (Han, 2013). However, both methods are limited in some ways. Stocking’s method does not allow examinees to skip items and requires a large number of blocks which may bring about some extra adverse effects because of frequent decision to go to next block. Han’s method can avoid limitations of Stocking’s. But it requires an appropriate IP size and may result in high bias in large IP size situation. The present study proposed the block item pocket (BIP) method which sets fewer but larger blocks with a proper total IP size. This method keeps advantages of Stocking’s and Han’s and overcomes their disadvantages.

Two simulation studies of two response strategies were conducted to evaluate validity of the BIP method. Item parameters were randomly drawn from uniform distribution (b ~ U (-3, 3)) and (α ~ U (0, 2)). Each examinee was administered a fixed-length CAT with 30 items. The initial item for each examinee was randomly drawn from θ ~ U (-0.5, 0.5). For the CAT administration, the Maximum Fisher Information method was adopted to select items. The interim and final scores were estimated using MLE method in most conditions. When responses were less than 5 or when all answers were correct or wrong, EAP method was adopted. Each study contained five conditions: non-review, 1 blocks IP method, 2 blocks, 3 blocks and 6 blocks BIP method. Statistics like BIAS, MAE, and RMSE were used as evaluation criteria.

Results indicated that: (1) BIP method had better estimate precision than IP method at low ability level under normal strategy; (2) When dealing with Wainer-like strategy, BIP method was far more precise than item pocket method at all ability levels; (3) As the number of blocks increased, estimate precision got closer to non-review condition. Advantages of this new method and future directions were discussed.

Key words: computerized adaptive testing, item review, item pocket method, answer change, block item pocket method

LIN Zhe; CHEN Pin; XIN Tao. (2015). The Block Item Pocket Method to Allow Item Review in CAT. Acta Psychologica Sinica, 47(9), 1188-1198.

[1]	TAN Qingrong, WANG Daxun, LUO Fen, CAI Yan, TU Dongbo. A high-efficiency and new online calibration method in CD-CAT based on information gain of entropy and EM algorithm [J]. Acta Psychologica Sinica, 2021, 53(11): 1286-1298.
[2]	CHEN Ping. Two new online calibration methods for computerized adaptive testing [J]. Acta Psychologica Sinica, 2016, 48(9): 1184-1198.
[3]	GUO Lei; ZHENG Chanjin; BIAN Yufang; SONG Naiqing; XIA Lingxiang. New item selection methods in cognitive diagnostic computerized adaptive testing: Combining item discrimination indices [J]. Acta Psychologica Sinica, 2016, 48(7): 903-914.
[4]	DAI Buyun; ZHANG Minqiang; JIAO Can; LI Guangming; ZHU Huawei; ZHANG Wenyi. Item Selection Using the Multiple-Strategy RRUM Based on CD-CAT [J]. Acta Psychologica Sinica, 2015, 47(12): 1511-1519.
[5]	GUO Lei; ZHENG Chanjin; BIAN Yufang. Exposure Control Methods and Termination Rules in Variable-Length Cognitive Diagnostic Computerized Adaptive Testing [J]. Acta Psychologica Sinica, 2015, 47(1): 129-140.
[6]	GUO Lei;WANG Zhuoran;WANG Feng;BIAN Yufang. a-Stratified Methods Combining Item Exposure Control and General Test Overlap in Computerized Adaptive Testing [J]. Acta Psychologica Sinica, 2014, 46(5): 702-713.
[7]	MAO Xiuzhen; XIN Tao. A Comparison of Item Selection Methods for Cognitive Diagnostic Computerized Adaptive Testing with Nonstatistical Constraints [J]. Acta Psychologica Sinica, 2014, 46(12): 1910-1922.
[8]	MAO Xiuzhen;XIN Tao. A Comparison of Item Selection Methods for Controlling Exposure Rate in Cognitive Diagnostic Computerized Adaptive Testing [J]. Acta Psychologica Sinica, 2013, 45(6): 694-703.
[9]	LUO Fen,DING Shu-Liang,WANG Xiao-Qing. Dynamic and Comprehensive Item Selection Strategies for Computerized Adaptive Testing Based on Graded Response Model [J]. , 2012, 44(3): 400-412.
[10]	WANG Wen-Yi,DING Shu-Liang,YOU Xiao-Feng. On-Line Item Attribute Identification in Cognitive Diagnostic Computerized Adaptive Testing [J]. , 2011, 43(08): 964-976.
[11]	CHEN Ping,XIN Tao. Item Replenishing in Cognitive Diagnostic Computerized Adaptive Testing [J]. , 2011, 43(07): 836-850.
[12]	CHEN Ping,XIN Tao. Developing On-line Calibration Methods for Cognitive Diagnostic Computerized Adaptive Testing [J]. , 2011, 43(06): 710-724.
[13]	CHENG Xiao-Yang,DING Shu-Liang,YAN Shen-Hai,ZHU Long-Yin. New Item Selection Criteria of Computerized Adaptive Testing with Exposure-Control Factor [J]. , 2011, 43(02): 203-212.
[14]	CHEN Ping,DING Shu-Liang . Research on Computerized Adaptive Testing that Allows Reviewing and Changing Answers [J]. , 2008, 40(06): 737-747.
[15]	Lin Haijing,Ding-Shuliang. An Exploration and Realization of Computerized Adaptive Testing with Cognitive Diagnosis [J]. , 2007, 39(04): 747-753.

The Block Item Pocket Method to Allow Item Review in CAT

Knowledge

Review File

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments