ISSN 0439-755X
CN 11-1911/B
主办:中国心理学会
   中国科学院心理研究所
出版:科学出版社

心理学报 ›› 2011, Vol. 43 ›› Issue (04): 453-461.

• • 上一篇    下一篇

单维测验合成信度三种区间估计的比较

叶宝娟;温忠麟   

  1. (1华南师范大学心理应用研究中心, 广州 510631) (2香港考试及评核局, 香港)
  • 收稿日期:2010-11-23 修回日期:1900-01-01 发布日期:2011-04-30 出版日期:2011-04-30
  • 通讯作者: 温忠麟

A Comparison of Three Confidence Intervals of Composite Reliability of A Unidimensional Test

YE Bao-Juan;WEN Zhong-Lin   

  1. (1 Center for Studies of Psychological Application, South China Normal University, Guangzhou 510631, China)
    (2 Hong Kong Examinations and Assessment Authority, Hong Kong, China)
  • Received:2010-11-23 Revised:1900-01-01 Online:2011-04-30 Published:2011-04-30
  • Contact: WEN Zhong-Lin

摘要: 已有许多研究建议使用合成信度来估计测验信度, 并报告其置信区间。有三种方法或途径可以计算单维测验合成信度的置信区间, 包括Bootstrap法、Delta法和直接用统计软件(如LISREL)输出的标准误进行计算。本文通过模拟研究进行比较, 发现Delta法与Bootstrap法得到的置信区间相当接近, 但用LISREL输出的标准误计算的与Bootstrap法得到的结果相差很大。推荐用Delta法估计合成信度的置信区间(使用Mplus容易实现), 但不能直接用LISREL输出的标准误来计算。举例说明了如何计算单维测验的合成信度以及用Delta法计算其置信区间。

关键词: 合成信度, 置信区间, Bootstrap法, Delta法, LISREL

Abstract: The widely used coefficient a may underestimate or overestimate reliability when its premise assumption is violated and therefore is not a good index to evaluate reliability. Composite reliability can better estimate reliability by using confirmatory factor analysis (see e.g., Bentler, 2009; Green & Yang, 2009). As is well known, point estimate contains limited information about a population parameter and could not give how far it could be from the population parameter. The confidence interval of the parameter could provide more information. In evaluating the quality of a test, the confidence interval of composite reliability has received more and more attention in recent years.
There are three approaches to estimate the confidence interval of composite reliability of a unidimensional test: Bootstrap method, Delta method and directly using the standard error in the output of an SEM software (e.g., LISREL). Each of the three approaches produces a standard error of composite reliability. Then the confidence interval can be easily formed based on the standard error. Bootstrap method provides an empirical result of the standard error of composite reliability and is the most credible, but the method needs data simulation technique and is not be easily mastered by general applied researchers. Delta method computes the standard error of composite reliability by approximate calculation, and the method is much simpler than Bootstrap method. LISREL software can directly give the standard error of composite reliability, and this method is the simplest among the three methods.
To evaluate the standard errors of composite reliability obtained by Delta method and LISREL software, we compared them with that obtained by Bootstrap method, because the latter can be treated as the true value in theory. A simulation study was conducted to the comparison. Four factors were considered in the simulation design: (a) the number of items on each test (k=3, 6, 10, and 15); (b) factor loading (high, medium and low); (c) sample size (N=100, 300, 500, and 1000); (d) the method for calculating the standard error of composite reliability (Bootstrap, Delta, and LISREL). Totally, 48 treatment conditions were generated in terms of the above 4-factor simulation design (i.e., 48=4×3×4×3).
The simulation results indicated that the difference between the standard errors obtained by Delta method and Bootstrap method was ignorable under each designed condition, except when sample size was small (less than 200)and standardized factor loadings were not high (less than 0.7). However, there was substantial difference between the standard errors obtained by LISREL software directly and Bootstrap method under each designed condition. Noting that the result from Bootstrap method can be treated as the true value, we recommended that Delta method could be adopted to estimate the confidence interval of composite reliability of a unidimensional test. At the same time we revealed that the standard error directly obtained by LISREL software is severely biased.
We used an example of a unidimensional test to illustrate how to calculate composite reliability and its confidence interval by using Delta method based on LISREL output. We also showed that the same results could be directly obtained by using SEM software Mplus that automatically calculates the confidence interval with Delta method and presents the confidence interval.

Key words: composite reliability, confidence interval, Bootstrap method, Delta method, LISREL