ISSN 0439-755X
CN 11-1911/B

心理学报 ›› 2016, Vol. 48 ›› Issue (12): 1625-1630.doi: 10.3724/SP.J.1041.2016.01625

• 论文 • 上一篇    下一篇


简小珠1,2; 戴步云2; 戴海琦2   

  1. (1井冈山大学教育学院, 江西 吉安 343009) (2江西师范大学心理学院, 江西省心理与认知科学重点实验室, 南昌 330022)
  • 收稿日期:2014-11-18 发布日期:2016-12-24 出版日期:2016-12-24
  • 通讯作者: 戴步云, E-mail:
  • 基金资助:


The weighted-score logistic model and Monte Carlo simulation study

JIAN Xiaozhu1,2; DAI Buyun2; DAI Haiqi2   

  1. (1 School of Education, Jinggangshan University, Ji’an 343009, China) (2 School of Psychology, Jiangxi Normal University; Jiangxi Key Laboratory of Psychology and Cognitive Science, Nanchang 330022, China)
  • Received:2014-11-18 Online:2016-12-24 Published:2016-12-24
  • Contact: DAI Buyun, E-mail:


试题难度、试题考查重要性程度加权是多级记分试题的两个基本属性, 因而在IRT项目特征函数中需用不同参数来表示。以往多级记分模型用多个难度参数来描述多级记分试题的难度, 不能有效的表达多级记分试题的分数权重作用。从多级记分试题的分数加权作用角度, 本文提出Logistic加权模型并论述了理论构建思想。在Logistic加权模型下对项目参数估计的EM算法进行推导并编写了相应的参数估计程序。在Logistic加权模型下进行测验模拟, 发现项目参数估计的模拟返真性能良好。

关键词: IRT, Logistic模型, Logistic加权模型, 多级记分模型


Item difficulty and item emphases are the two fundamental properties for the polytomously scored item. Thus, it is necessary to use a special parameter, the weighted-score parameter or the item full mark, to express the emphases of the polytomously scored item. In the previous studies, the researchers had proposed eight polytomous models, e.g., Graded-Response Model, Partial Credit Model, etc. In all the polytomous models, several item difficult parameters are used to represent the item difficulty based on the dichotomous Logistic model. Thus, the polytomous models may not effectively give expression to the item emphases of the polytomous item. A new polytomous model, the weighted-score logistic model (WSLM), is proposed in this study. On the basis of the item emphases of the polytomously scored item, the WSLM model adds the weighted-score parameters into the dichotomous logistic model. The WSLM includes only one difficulty parameter (i.e., the average difficulty parameter) to represent the overall item difficulty, which obviously differs from the other polytomous models. Moreover, in the WSLM, the probability of an examinee responding in category , is of certain functional relation with the average difficulty parameter, discrimination parameter, and the score that the examinee have obtained on this item. Thus, the probability of responding in category under the WSLM can be expressed as . According, the probabilities that an individual will receive the category scores of 0, 1, 2, …, under the WSLM are expressed by: respectively. And all the above probabilities add up to 1. Then, the probability that an examinee will receive a category score of or higher on a polytomously scored item is . It should be noted that, the WSLM reduces to the dichotomous logistic model if . Similarly, the probabilities of the WSLM can also be graphically represented via the category response curves and operating characteristic curves. What’s more, the shapes of the category response curves and operating characteristic curves of the WSLM are very similar to those of GRM. The item full mark is determined when designing the test, which can be considered as the indirect reflection of the common understanding of the weight of the item score. Thus, the item full mark of the polytomously scored item can be treated as the item weighted-score parameter, which does not need to be estimated. Just like the dichotomous logistic model, the discrimination parameter and average difficulty parameter of the WSLM can be estimated by using the classical MMLE/EM algorithm. For a mixed test containing both the dichotomously scored and polytomously scored items, the MMLE/EM algorithm can also be used to estimate all the item parameters. Following the basic procedure of MMLE/EM algorithm, we have written the item parameter estimation programs using the Visual Basic Program, and have successfully estimated the discrimination parameters and the average difficulty parameters of both the dichotomous and polytomous items under the WSLM. A Monte Carlo simulation study was conducted to investigate the performance of WSLM. The results of the simulation tests demonstrated that the ABS and RMSE of item parameters were relatively small. The numerical values of the item full marks can’t almost affect the ABS and RMSE of item parameters. When the scores on the items were not consecutive, the ABS and RMSE of item parameters were relatively small. Moreover, the ABS and RMSE of item parameters under the WSLM was as small as those under the dichotomous Logistic model. In summary, the item-parameter recovery in the simulation tests under the WSLM was effective and acceptable.

Key words: IRT, Logistic model, weighted-score Logistic model, polytomous models.