ISSN 0439-755X
CN 11-1911/B
主办:中国心理学会
   中国科学院心理研究所
出版:科学出版社

心理学报 ›› 1998, Vol. 30 ›› Issue (2): 211-218.

• • 上一篇    下一篇

概化理论在作文评分中的应用研究

刘远我;张厚粲   

  1. 人事部人事考试中心,北京师范大学
  • 出版日期:1998-06-25 发布日期:1998-06-25
  • 通讯作者: 刘远我

APPLICATION OF GENERALIZABILITY THEORY IN COMPOSITION SCORING

Liu Yuanwo (Personnel Testing Authorities,Minisity of Personnel P.R.C.,Beijing 100054) Zhang Houcan (Beijing Normal University. Beijing 100875)   

  • Published:1998-06-25 Online:1998-06-25

摘要: 概化理论是现代心理测量理论之一。该文简要地介绍了该理论的基本思想,并用此理论对作文评分的误差控制问题进行了应用性探讨。研究中请6位评分员对20名学生每人三种文体的作文用分项评分法进行评定。然后用GENOVA软件估计了作文评分中的评分员效应和题目效应,并对各种误差构成进行了分析比较。结果表明:在作文评分中,评分员效应最大,题目效应不明显。同时还发现,不同文体对评分误差有重要影响,议论文的评分误差最大。在作文能力的三要素中,以内容的评分误差最大,结构的评分误差最小。最后,本文还对传统评分者一致性系数(W系数)的不足以及概化理论中系数的优越性进行了讨论。

关键词: 概化理论, 作文评分, 误差控制, W系数

Abstract: Generalizability Theory (GT) is a new trend in psychological measurement. First the basic ideas and methods of Generalizability Theory was introduced, Then an exploratory application of it was conducted on the problem of scoring in Chinese composition. Twenty high school students were asked to write three compositions of different forms in one month, all of the compositions were rated by six raters. Data was finally analysed by GENOVA. The results indicated that the influence of raters was large, while item effect was small It was also found that different forms of compositions had different scoring error, essay with problem discussion was likely to have the largest scoring error. As to the three essential factors of writing ability, the largest scoring error happened on the content, while the smallest scoring error happened on the structure. Furthermore, the author pointed out some defficiencies of the Kendall coefficient of concordance (W coefficient) and suggest that phi coefficient is a more effective index as rater consistency reliability.

Key words: Generalizability Theory, composition scoring, error controlling, W coefficient