密集追踪干预研究设计中的建模及其样本量规划——基于动态结构方程模型

doi:10.3724/SP.J.1041.2026.0773

摘要/Abstract

摘要： 密集追踪干预研究具有生态效度高、能够提供实时和个性化干预等优势。然而, 目前常用的数据分析方法未能充分反映密集追踪数据的特点, 而先进的数据分析模型又缺乏与之匹配的样本量规划方法, 极大地限制了这种范式的推广应用。本文在两种典型的密集追踪干预实验设计——单臂设计和随机对照设计下, 基于动态结构方程模型, 结合检验力和效应量估计准确性, 采用模拟研究方法开展样本量规划, 并从第一类错误率等方面对两种设计进行综合比较, 最后提出了实验设计和样本量规划建议。

关键词: 密集追踪干预, 动态结构方程模型, 检验力分析, 效应量, 样本量规划

Abstract: Intensive longitudinal interventions (ILIs) have emerged as powerful tools for understanding, treating and preventing mental and behavioral disorders. However, most existing ILI studies rely on traditional analytic methods such as ANOVA or linear mixed models, which overlook both individual differences and the inherent autocorrelation structure of time-series data. Moreover, intervention effects are often evaluated only through changes in the mean level of key variables (e.g., anxiety). This study demonstrates how dynamic structural equation modeling (DSEM) can be used to analyze ILI data and evaluate intervention effects across three dimensions—mean, autoregression, and intra-individual variability (IIV)—for two types of intervention designs: single-arm trial (SAT) and randomized controlled trial (RCT). We conducted two simulation studies to examine sample size recommendations for DSEM-based ILI studies, considering both statistical power and accuracy in parameter estimation (AIPE). In a third simulation, we compared the type I error rates of SAT and RCT designs when natural temporal changes occurred in the control group. Finally, we illustrated sample size planning using empirical data from a pre-ILI study targeting appearance anxiety reduction.
Simulation Studies 1 and 2 investigated the power and AIPE across varying sample sizes, as well as the required sample size for both SAT and RCT designs. The effect sizes of intervention effects for mean, autoregression and IIV were fixed at the medium level. Two factors regarding sample size were manipulated: number of participants (N = 30, 60, 100,150, 200, 300,400), number of time-points (T = 10, 20, 40, 60, 80, 100). The data-generating models and fitted models were identical, with analysis conducted using Mplus 8.10 and Bayesian estimation. Model performance was assessed in terms of convergence rate, power and AIPE for intervention effects, as well as bias in the standard errors of the intervention effects. Simulation Study 3 assessed the type I error rate for both designs when changes in the control group were different from zero, indicating a change (on average) due to time. Last, the empirical study conducted sample size planning based on a pre-study aimed at reducing appearance anxiety using an ILI design.
The results are as following. First, all simulation conditions achieved satisfactory convergence. Second, statistical power increased and credible interval width decreased with larger N or T. However, a minimum of 60 participants was required to achieve adequate power (i.e., ≥0.8). The relative bias in intervention effect was generally small. Except in the SAT design, the intervention effects on autoregression and IIV were underestimated when the number of time-points was low (T = 10 or 20). While in the RCT design, the intervention effect on mean was underestimated when sample size in both levels were small (N = 30 or 60, T = 10). Bias in the standard error was also negligible across conditions. Third, a credible interval width contours plot were useful for determining sample size under both power- and AIPE-based criteria. were useful for determining sample size under both power- and AIPE-based criteria. Fourth, when natural mean-level changes occurred between pre- and post-intervention phases, the SAT design exhibited inflated type I error rates compared to the RCT design, especially with larger samples.
In conclusion, DSEM provides a flexible framework for analyzing ILI data by simultaneously capturing intervention effects on mean, autoregression, and IIV. Researchers should choose between SAT and RCT designs based on theoretical and practical considerations: RCTs offer stronger control for time-related confounds but require larger samples, whereas SATs are more suitable for small-sample or preliminary studies. For Monte Carlo-based sample size planning, accurate specification of true parameter values is critical; these should be derived from pre-studies, similar empirical data, or meta-analytic evidence whenever possible. When such information is unavailable, the procedures described in this study offer practical guidance.

Key words: intensive longitudinal intervention, dynamic structural equation modeling, power analysis, effect size, sample size planning

刘玥, 何月翎, 刘红云. (2026). 密集追踪干预研究设计中的建模及其样本量规划——基于动态结构方程模型. 心理学报, 58(4), 773-792.

LIU Yue, HE Yueling, LIU Hongyun. (2026). Data analysis and sample size planning for intensive longitudinal intervention studies using dynamic structural equation modeling. Acta Psychologica Sinica, 58(4), 773-792.

参考文献

[1] Albers, C., & Lakens, D. (2018). When power analyses based on pilot data are biased: Inaccurate effect size estimators and follow-up bias. Journal of Experimental Social Psychology, 74, 187-195.
[2] Arend, M. G., & Schäfer, T. (2019). Statistical power in two-level models: A tutorial based on Monte Carlo simulation.Psychological Methods, 24(1), 1-19.
[3] Aschenbrenner, A. J., & Jackson, J. J. (2024). High-frequency assessment of mood, personality, and cognition in healthy younger, healthy older and adults with cognitive impairment.Aging, Neuropsychology, and Cognition, 31(5), 914-931.
[4] Asparouhov T., Hamaker E. L., & Muthén B. (2018). Dynamic structural equation models.Structural Equation Modeling: A Multidisciplinary Journal, 25(3), 359-388.
[5] Baey, C., & Le Deley, M. C. (2011). Effect of a misspecification of response rates on type I and type II errors, in a phase II Simon design.European Journal of Cancer, 47(11), 1647-1652.
[6] Balaskas A., Schueller S. M., Cox A. L., & Doherty G. (2021). Ecological momentary interventions for mental health: A scoping review.PloS One, 16(3), e0248152.
[7] Bell I. H., Fielding-Smith S. F., Hayward M., Rossell S. L., Lim M. H., Farhall J., & Thomas N. (2018). Smartphone-based ecological momentary assessment and intervention in a coping-focused intervention for hearing voices (SAVVy): Study protocol for a pilot randomised controlled trial.Trials, 19(1), 262.
[8] Bradley, J. V. (1978). Robustness? British Journal of Mathematical and Statistical Psychology, 31(2), 144-152.
[9] Chen, M., & Zhou, P. (2017). Ecological momentary assessment and intervention of substance use.Advances in Psychological Science, 25(2), 247-252.
[陈明瑞, 周萍. (2017). 成瘾物质使用的生态瞬时评估与干预.心理科学进展, 25(2), 247-252.]
[10] Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). New York: Erlbaum.
[11] Cuijpers P., Pineda B. S., Quero S., Karyotaki E., Struijs S. Y., Figueroa C. A.,. Muñoz R. F. (2021). Psychological interventions to prevent the onset of depressive disorders: A meta-analysis of randomized controlled trials.Clinical Psychology Review, 83, 101955.
[12] Fang, Y., & Wang, L. (2024). Dynamic structural equation models with missing data: Data requirements on N and T.Structural Equation Modeling: A Multidisciplinary Journal, 31(5), 891-908.
[13] Hamaker E. L., Asparouhov T., & Muthén, B. (2021). Dynamic structural equation modeling as a combination of time series modeling, multilevel modeling, and structural equation modeling. In R. H. Hoyle (Ed.), Handbook of Structural Equation Modeling (2nd ed., p. 31). Guilford Press.
[14] Hawes T., Zimmer-Gembeck M. J., & Campbell S. M. (2020). Unique associations of social media use and online appearance preoccupation with depression, anxiety, and appearance rejection sensitivity.Body Image, 33, 66-76.
[15] Hawker C. O., Merkouris S. S., Youssef G. J., & Dowling N. A. (2021). A smartphone-delivered ecological momentary intervention for problem gambling (GamblingLess: Curb Your Urge): Single-arm acceptability and feasibility trial.Journal of Medical Internet Research, 23(3), e25786.
[16] Hedeker D., Mermelstein R. J., & Demirtas H. (2008). An application of a mixed‐effects location scale model for analysis of ecological momentary assessment (EMA) data.Biometrics, 64(2), 627-634.
[17] Heron, K. E., & Smyth, J. M. (2010). Ecological momentary interventions: Incorporating mobile technology into psychosocial and health behaviour treatments.British Journal of Health Psychology, 15(1), 1-39.
[18] Hoffman, L., & Walters, R. W. (2022). Catching Up on Multilevel Modeling.Annual Review of Psychology, 73, 659-689.
[19] Hu C., Wang F., Guo J., Song M., Sui J., & Peng K. (2016). The replication crisis in psychological research.Advances in Psychological Science, 24(9), 1504-1518.
[胡传鹏, 王非, 过继成思, 宋梦迪, 隋洁, 彭凯平. (2016). 心理学研究中的可重复性问题: 从危机到契机.心理科学进展, 24(9), 1504-1518.]
[20] Kenny, D. A., & Judd, C. M. (1986). Consequences of violating the independence assumption in analysis of variance.Psychological Bulletin, 99(3), 422-431.
[21] Kowialiewski, B. (2025). The power of effect size stabilization.Behavior Research Methods, 57(1), 7.
[22] Kruschke, J. K., & Liddell, T. M. (2018). The Bayesian new statistics: Hypothesis testing, estimation, meta-analysis, and power analysis from a Bayesian perspective.Psychonomic Bulletin & Review, 25(1), 178-206.
[23] Lafit G., Sels L., Adolf J. K., Loeys T., & Ceulemans E. (2022). PowerLAPIM: An application to conduct power analysis for linear and quadratic longitudinal actor-partner interdependence models in intensive longitudinal dyadic designs.Journal of social and Personal Relationships, 39(10), 3085-3115.
[24] Li Y., Williams L., Muth C., Heshmati S., Chow S. M., & Oravecz Z. (2024). A growth of hierarchical autoregression model for capturing individual differences in changes of dynamic characteristics of psychological processes.Structural Equation Modeling: A Multidisciplinary Journal, 32(2), 237-250.
[25] Liu Y., Xu L., Liu H., Han Y., You X., & Wan Z. (2024). Confidence interval width contours: Sample size planning for linear mixed-effects models.Acta Psychologica Sinica, 56(1), 124-138.
[刘玥, 徐雷, 刘红云, 韩雨婷, 游晓锋, 万志林. (2024). 置信区间宽度等高线图在线性混合效应模型样本量规划中的应用.心理学报, 56(1), 124-138.]
[26] Mair J. L., Hayes L. D., Campbell A. K., Buchan D. S., Easton C., & Sculthorpe N. (2022). A personalized smartphone-delivered just-in-time adaptive intervention (JitaBug) to increase physical activity in older adults: Mixed methods feasibility study.JMIR Formative Research, 6(4), e34662.
[27] Maxwell S. E., Kelley K., & Rausch J. R. (2008). Sample size planning for statistical power and accuracy in parameter estimation.Annual Review of Psychology, 59, 537-563.
[28] Muthén, L. K., & Muthén, B. O. (1998-2017). Mplus user's guide. Muthén and Muthén.
[29] Nosek B. A., Hardwicke T. E., Moshontz H., Allard A., Corker K. S., Dreber A.,. Vazire S. (2022). Replicability, robustness, and reproducibility in psychological science.Annual Review of Psychology, 73, 719-748.
[30] Pek, J., & Park, J. (2019). Complexities in power analysis: Quantifying uncertainties with a Bayesian-classical hybrid approach.Psychological Methods, 24(5), 590-605.
[31] Rauschenberg C., Boecking B., Paetzold I., Schruers K., Schick A., van Amelsvoort T., & Reininghaus U. (2021). A compassion-focused ecological momentary intervention for enhancing resilience in help-seeking youth: uncontrolled pilot study.JMIR Mental Health, 8(8), e25650.
[32] Reininghaus U., Daemen M., Postma M. R., Schick A., Hoes-van der Meulen I., Volbragt N.,. van Amelsvoort T. (2024). Transdiagnostic ecological momentary intervention for improving self-esteem in youth exposed to childhood adversity: The SELFIE randomized clinical trial.JAMA Psychiatry, 81(3), 227-239.
[33] Rights, J. D., & Sterba, S. K. (2019). Quantifying explained variance in multilevel models: An integrative framework for defining R-squared measures.Psychological Methods, 24(3), 309-338.
[34] Schueller S. M., Aguilera A., & Mohr D. C. (2017). Ecological momentary interventions for depression and anxiety.Depression and Anxiety, 34(6), 540-545.
[35] Schultzberg, M., & Muthén, B. (2018). Number of subjects and time points needed for multilevel time-series analysis: A simulation study of dynamic structural equation modeling.Structural Equation Modeling: A Multidisciplinary Journal, 25(4), 495-515.
[36] Sherwood, S. N. (2022). Feasibility and efficacy of virtual darkness in reducing intra-individual sleep variability among young adults with insomnia [Unpublished Doctoral dissertation]. University of Nevada, Las Vegas.
[37] Shrier, L. A., & Spalding, A. (2017). “Just take a moment and breathe and think”: Young women with depression talk about the development of an ecological momentary intervention to reduce their sexual risk.Journal of Pediatric and Adolescent Gynecology, 30(1), 116-122.
[38] Smith, K. E., & Juarascio, A. (2019). From ecological momentary assessment (EMA) to ecological momentary intervention (EMI): Past and future directions for ambulatory assessment and interventions in eating disorders.Current Psychiatry Reports, 21(7), 53.
[39] Usami, S. (2020). Confidence interval‐based sample size determination formulas and some mathematical properties for hierarchical data.British Journal of Mathematical and Statistical Psychology, 73(S1), 1-31.
[40] Wang, L. P., & Maxwell, S. E. (2015). On disaggregating between-person and within-person effects with longitudinal data using multilevel models.Psychological Methods, 20(1), 63-83.
[41] Wilhelm P., Perrez M., & Pawlik, K. (2012). Conducting research in daily life: A historical review In M R Mehl & T S Conner (Eds), Handbook of research methods for studying daily life (pp 62-86) New York: Guilford Press.
[42] Wright C., Dietze P. M., Agius P. A., Kuntsche E., Livingston M., Black O. C., … Lim M. S. (2018). Mobile phone-based ecological momentary intervention to reduce young adults' alcohol use in the event: A three-armed randomized controlled trial.JMIR Mhealth Uhealth, 6(7), e149.
[43] Yi, Z. (2020). Intensive longitudinal data analyses and sample size considerations in intervention studies with dynamic structural equation modeling [Unpublished Doctoral dissertation]. University of South Florida.
[44] Zhang W., Xu L., Pan X., Yao L., Zhong W., & Li J. (2024). Application progress of ecological momentary intervention in health behavior promotion.Journal of Nursing Science, 39(2), 116-121.
[张祎, 徐岚, 潘习, 姚林, 仲伟莹, 李佳璇. (2024). 生态瞬时干预在健康行为促进中的应用进展.护理学杂志, 39(2), 116-121.]
[45] Zhou L., Wang M., & Zhang Z. (2021). Intensive longitudinal data analyses with dynamic structural equation modeling.Organizational Research Methods, 24(2), 219-250.
[46] Zimmer, F., & Debelak, R. (2025). Simulation-based design optimization for statistical power: Utilizing machine learning.Psychological Methods, 30(3), 513-536.