ISSN 1671-3710
CN 11-4766/R
主办:中国科学院心理研究所
出版:科学出版社

心理科学进展 ›› 2024, Vol. 32 ›› Issue (4): 700-714.doi: 10.3724/SP.J.1042.2024.00700

• 研究方法 • 上一篇    

密集追踪研究中测验信度的估计:多层结构和动态特性的视角

罗晓慧, 刘红云()   

  1. 北京师范大学心理学部, 应用实验心理北京市重点实验室, 心理学国家级实验教学示范中心(北京师范大学), 北京 100875
  • 收稿日期:2023-08-29 出版日期:2024-04-15 发布日期:2024-02-29
  • 通讯作者: 刘红云 E-mail:hyliu@bnu.edu.cn
  • 基金资助:
    国家自然科学基金项目(32071091)

Estimating test reliability of intensive longitudinal studies: Perspectives on multilevel structure and dynamic nature

LUO Xiaohui, LIU Hongyun()   

  1. Beijing Key Laboratory of Applied Experimental Psychology, National Demonstration Center for Experimental Psychology Education (Beijing Normal University), Faculty of Psychology, Beijing Normal University, Beijing 100875, China
  • Received:2023-08-29 Online:2024-04-15 Published:2024-02-29
  • Contact: LIU Hongyun E-mail:hyliu@bnu.edu.cn

摘要:

随着密集追踪研究在心理学等社会科学领域的广泛运用, 密集追踪情境中测验信度的估计也受到越来越多研究者的关注。早期沿用横断研究中信度估计思想或基于概化理论的信度估计方法存在诸多局限, 并不适用于密集追踪的情境。针对密集追踪数据的多层结构和动态特性这两大特点, 可基于多层验证性因子分析、动态因子分析和动态结构方程模型估计密集追踪研究中测验的信度。通过实证数据的演示与比较, 讨论三种估计方法的特点和适用情境。未来研究可基于其它密集追踪模型探讨测验信度的估计, 也应重视测验信度的检验与报告。

关键词: 密集追踪研究, 信度, 多层结构, 动态特性, 动态结构方程模型

Abstract:

Intensive longitudinal data are increasingly used in empirical research in psychology and other social sciences. While such data provide a wealth of information for exploring the dynamics of variables, they also pose challenges for estimating the reliability of tests used in intensive longitudinal studies. A large number of previous intensive longitudinal studies have failed to reasonably and adequately assess the reliability of their tests. Earlier reliability estimation methods drawn from cross-sectional studies or based on generalizability theory (GT) have many limitations and are not applicable to intensive longitudinal data. Therefore, the main purpose of this study is to systematically introduce the reliability estimation methods suitable for intensive longitudinal studies and to provide methodological guidance for applied researchers.

Intensive longitudinal data have two main characteristics: multilevel structure and dynamic nature. The multilevel structure refers to the data structure of multiple repeated measures (level 1) nested within individuals (level 2); and the dynamic nature refers to the temporal dependency between consecutive observations. From the perspective of multilevel structure and dynamic nature, we first introduce two reliability estimation methods that consider one of these two characteristics. The reliability estimation method focusing on the multilevel structure is based on multilevel confirmatory factor analysis (MCFA). It can estimate test reliability at the between- and within-person levels separately. However, this method does not take into account individual differences in reliability and temporal dependency between consecutive observations in intensive longitudinal data. The reliability estimation method focusing on the dynamic nature is based on dynamic factor analysis (DFA). It establishes a dynamic factor model for each individual, and includes autoregressive processes to reflect the dynamic nature of intensive longitudinal data. It can estimate person-specific reliabilities, helping researchers to better understand individual differences in test reliability. However, this method confounds the trait and state components of observed scores, which may lead to biased estimates of person-specific reliabilities. In addition, it cannot estimate between-person reliabilities.

Considering the limitations of the above two methods, we next present another reliability estimation method (i.e., a method based on dynamic structural equation modeling (DSEM)) that integrates the multilevel structure and dynamic nature of intensive longitudinal data. This method builds factor models at the between- and within-person levels to estimate reliabilities at both levels, and includes autoregressive processes to reflect the temporal dependency between consecutive observations. It also allows for random effects of model parameters (e.g., factor loadings and autoregressive effects) to enable estimation of person-specific reliabilities. In summary, this method can simultaneously reflect the multilevel structure and dynamic nature of intensive longitudinal data, and examine individual differences in test reliability, which helps researchers to better estimate and understand reliability in intensive longitudinal studies.

Finally, we demonstrate and compare three reliability estimation methods with empirical data on daily procrastination. We summarize and discuss the main features and applicable contexts of these methods, and recommend researchers to examine the reliability of each item and consider individual differences in test reliability in intensive longitudinal studies. We also provide practical suggestions on how to better report reliabilities in applied research. Future research could explore the reliability estimation methods based on other models, and should also pay more attention to the testing and reporting of test reliability in intensive longitudinal studies.

This paper provides a systematic review of reliability estimation methods in intensive longitudinal studies from the perspectives of multilevel structure and dynamic nature. We illustrate the differences between the methods in estimating three types of reliabilities (i.e., between-person reliabilities, within-person reliabilities, and person-specific reliabilities), and interpret the reliability results at both the item and scale levels, which helps researchers to understand and evaluate the test reliability in intensive longitudinal studies more comprehensively. In addition, by analyzing and comparing the main features of different methods, we point out the contexts in which they are applicable and provide practical suggestions for estimating and reporting reliability in intensive longitudinal studies.

Key words: intensive longitudinal study, reliability, multilevel structure, dynamic nature, dynamic structural equation modeling

中图分类号: