无领导小组讨论的多侧面Rasch模型应用

doi:10.3724/SP.J.1041.2013.01039

心理学报 ›› 2013, Vol. 45 ›› Issue (9): 1039-1049.doi: 10.3724/SP.J.1041.2013.01039

无领导小组讨论的多侧面Rasch模型应用

姚若松;赵葆楠;刘泽;苗群鹰

(¹广州大学教育学院, 广州 510006) (²广州大学外国语学院, 广州 510006)

收稿日期:2012-12-27 发布日期:2013-09-25 出版日期:2013-09-25
通讯作者: 姚若松
基金资助:
广东省哲学社会科学“十一五”规划项目(GD10CGL08)和广州市哲学社会科学发展“十二五”规划项目(13G59)资助。

The Application of Many-Facet Rasch Model in Leaderless Group Discussion

YAO Ruosong;ZHAO Baonan;LIU Ze;MIAO Qunying

(¹Department of Education, Guangzhou University, Guangzhou 510006, China) (² School of Foreign Studies, Guangzhou University, Guangzhou 510006, China)

Received:2012-12-27 Online:2013-09-25 Published:2013-09-25
Contact: YAO Ruosong

摘要/Abstract

摘要： 采用项目反应理论(IRT)的多侧面Rasch模型(MFRM), 分析评价中心技术中无领导小组讨论(LGD)的测评结果, 探讨被试能力水平、评委评分宽严度、评分内部一致性、维度难度和评定等级等问题, 进而讨论各种偏差。通过MFRM分析人事测评结果, 可深入了解被试能力的真实差异、甑别维度难度、探查测评误差源, 从而完善测评试题编制、评估或诊断评委合格性、提高测评维度与测评目的匹配性, 为拓展项目反应理论在人事测评中的应用提供独特视角。

关键词: 无领导小组讨论, 多侧面Rasch模型, 项目反应理论, 人事测评

Abstract: Many-Facet Rasch model (MFRM) of Item Response Theory (IRT) is applied to performance assessment. Domestic and foreign researches applied MFRM in many fields such as analysis of various examinations, medical diagnosis, judgments of life quality and so on. In these assessment tests, ratings were influenced by a variety of factors among which judges played the most important part. This thesis mainly probed into issues covering subjects, judges, rating scales and rating deviation in Leaderless Group Discussion (LGD) of personnel assessment center in personnel assessment to improve the effectiveness and stability of assessment. This study adopted the FACETS software, a MFRM computer statistics program, to establish 3 facets of subjects, judges and rating dimensions to analyze subjects’ abilities, rater severity, inter-rater reliability, dimension difficulty and rating scales. Meanwhile, this study got results of deviation analysis of subjects and judges, judges and dimensions, deviation among judges, subjects and dimensions. The results illustrated significant differences existed among levels of subjects’ ability, rater severity, dimension difficulty and the rating scale. Differences of rater severity generally did not affect the test scores of subjects. Except some judges, other judges’ ratings had good internal consistency. Dimension difficulty could better distinguish subjects’ ability but judges tended to concentrate on using an intermediate rating scale; The results of deviation analysis of judges and subjects, judges and dimension showed that untrained judges E, F had more rating deviations, so it was necessary to monitor their scores and strengthen the training of the two judges. The application of MFRM, IRT’s expansion, to assessment center evaluation could enable evaluators to make the employment decision by estimated ability level of subjects, design tests according to dimension difficulty, set the standards for training and selection referring to examine judges’ ratings rater severity and inter-rater reliability, improve the assessment process based on a variety of deviation analysis, and finally promote scientific, standardized and precise development of evaluation system of assessment center.

Key words: leaderless group discussion, many-facet Rasch model, item response theory, personnel assessment

姚若松;赵葆楠;刘泽;苗群鹰. (2013). 无领导小组讨论的多侧面Rasch模型应用. 心理学报, 45(9), 1039-1049.

YAO Ruosong;ZHAO Baonan;LIU Ze;MIAO Qunying. (2013). The Application of Many-Facet Rasch Model in Leaderless Group Discussion. Acta Psychologica Sinica, 45(9), 1039-1049.

[1]	付颜斌, 陈琦鹏, 詹沛达. 问题解决任务中行动序列的二分类建模：单/两参数行动序列模型[J]. 心理学报, 2023, 55(8): 1383-1396.
[2]	童昊, 喻晓锋, 秦春影, 彭亚风, 钟小缘. 多级计分测验中基于残差统计量的被试拟合研究[J]. 心理学报, 2022, 54(9): 1122-1136.
[3]	任赫, 陈平. 两种新的多维计算机化分类测验终止规则[J]. 心理学报, 2021, 53(9): 1044-1058.
[4]	罗芬, 王晓庆, 蔡艳, 涂冬波. 基于基尼指数的双目标CD-CAT选题策略[J]. 心理学报, 2020, 52(12): 1452-1465.
[5]	陈平. 两种新的计算机化自适应测验在线标定方法[J]. 心理学报, 2016, 48(9): 1184-1198.
[6]	孟祥斌;陶剑;陈莎莉. 四参数Logistic模型潜在特质参数的 Warm加权极大似然估计[J]. 心理学报, 2016, 48(8): 1047-1056.
[7]	汪文义; 宋丽红;丁树良. 复杂决策规则下MIRT的分类准确性和分类一致性[J]. 心理学报, 2016, 48(12): 1612-1624.
[8]	詹沛达;陈平;边玉芳. 使用验证性补偿多维IRT模型进行认知诊断评估[J]. 心理学报, 2016, 48(10): 1347-1356.
[9]	詹沛达;李晓敏;王文中;边玉芳;王立君. 多维题组效应认知诊断模型[J]. 心理学报, 2015, 47(5): 689-701.
[10]	杜文久;周娟;李洪波. 二参数逻辑斯蒂模型项目参数的估计精度[J]. 心理学报, 2013, 45(10): 1179-1186.
[11]	刘红云,李冲,张平平,骆方. 分类数据测量等价性检验方法及其比较：项目阈值(难度)参数的组间差异性检验[J]. 心理学报, 2012, 44(8): 1124-1136.
[12]	杜文久;肖涵敏. 多维项目反应理论等级反应模型[J]. 心理学报, 2012, 44(10): 1402-1407.
[13]	刘红云,骆方,王玥,张玉. 多维测验项目参数的估计：基于SEM与MIRT方法的比较[J]. 心理学报, 2012, 44(1): 121-132.
[14]	涂冬波,蔡艳,戴海琦,丁树良. 多维项目反应理论：参数估计及其在心理测验中的应用[J]. 心理学报, 2011, 43(11): 1329-1340.
[15]	吴,锐,丁树良,甘登文. 含题组的测验等值[J]. 心理学报, 2010, 42(03): 434-442.

无领导小组讨论的多侧面Rasch模型应用

The Application of Many-Facet Rasch Model in Leaderless Group Discussion

PDF (PC)

可视化

English Version

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价