ISSN 0439-755X
CN 11-1911/B
主办:中国心理学会
   中国科学院心理研究所
出版:科学出版社

心理学报 ›› 2025, Vol. 57 ›› Issue (11): 2001-2021.doi: 10.3724/SP.J.1041.2025.2001 cstr: 32110.14.2025.2001

• 人工智能心理与治理专刊 • 上一篇    下一篇

数智时代工作紧张人群阈下抑郁的影响因素:基于机器学习的证据

邓丽芳1(), 裴蓓1, 高天艾2   

  1. 1 北京航空航天大学人文社会科学学院(公共管理学院), 心理与行为研究中心, 北京 100191
    2 国家体育总局体育科学研究所, 北京 100061
  • 收稿日期:2024-02-01 发布日期:2025-09-24 出版日期:2025-11-25
  • 通讯作者: 邓丽芳, E-mail: lifangdeng@buaa.edu.cn
  • 基金资助:
    北京自然科学基金面上项目(7202101)

The factors affecting subthreshold depression for people with occupational stress in the era of digital intelligence: Machine learning-based evidence

DENG Lifang1(), PEI Bei1, GAO Tian’ai2   

  1. 1 Research Center for Psychology and Behavior, School of Humanities and Social Sciences, Beihang University, Beijing, 100191, China
    2 China Institute of Sport Science, Beijing, 100061, China
  • Received:2024-02-01 Online:2025-09-24 Published:2025-11-25

摘要:

阈下抑郁作为抑郁症的前兆, 对抑郁风险有重要警示作用。本研究结合扎根理论和机器学习方法, 构建了中国工作紧张人群阈下抑郁影响因素的机器学习模型, 并对各因素之间的关联结构, 因素词频、职业差异和年代变化进行了分析。发现:(1)工作紧张人群阈下抑郁的表征包含5个主范畴, 其中意志减弱词频占比最高; 影响因素包含8个主范畴, 工作因素、评价适应和自主选择词频占比分列前三位。(2) 8类影响因素均与阈下抑郁表征紧密关联; 基于关联规则的网络分析结果显示出“自我认知”、“行为自由”、“环境适应”和“一般社交”副范畴的重要作用。(3)医护人员的躯体因素与其他职业具有显著差异。(4) 2011至2023年间, 工作因素呈上升趋势, 人际因素呈下降趋势。(5)基于BERT的机器学习模型可以用于判别工作紧张人群阈下抑郁的影响因素, 且判别结果能够对工作紧张人群的抑郁风险做出较好预测, 采用XGBoost算法的预测准确率为81.58%, 其中对阈下抑郁组预测的F1分数为0.90, AUC值为0.93。本文拓展了阈下抑郁在工作紧张人群中的研究, 为工作紧张人群阈下抑郁的识别与防治提供了新视角。

关键词: 阈下抑郁, 工作紧张, 机器学习

Abstract:

Depression is one of the most common psychological problems, and subthreshold depression, as a precursor to its occurrence, plays a vital warning role in the prevention and treatment of depression. However, there is currently a lack of in-depth analysis of the representations and influencing factors of subthreshold depression in people with occupational stress in China (SDPOSC). This study integrated grounded-theory research with machine learning methods to explore the manifestations and influencing factors of subthreshold depression among Chinese working population under stress. BERT technology was utilized to construct a discriminant model for identifying the factors influencing subthreshold depression within this population, and the model's effectiveness was subsequently studied and confirmed.

This research is composed of two studies. The first study involved the analysis of network texts harvested through web crawling, employing grounded theory for coding to establish a framework of factors influencing subthreshold depression in individuals under stress. The correlation structure between influencing factors and their representations was further explored, along with association rule analysis between influencing factors. Word frequency analysis and occupational difference tests were then conducted to analyze the characteristics of the influencing factors. Mann-Kendall test was subsequently applied to analyze the development trend of influencing factors. Based on the analysis of online text, the second study constructed a machine learning model using BERT technology. the influencing factors of subthreshold depression are distinguished and the effectiveness of the model is subsequently confirmed.

Results showed that (1) Manifestations of subthreshold depression in people with occupational stress have five categories, with weakened willpower the highest frequency of expression and daily behavioral changes the lowest frequency. (2) Main influencing factors consist of eight categories, with work factors, evaluation adaptation, and autonomous selection the highest frequency; and stress events the lowest. (3) Eight types of influencing factors were closely related to subthreshold depression symptoms, with stressful event the best single predictor. Network analysis based on association rules revealed that "self-awareness, " "behavioral freedom, " "environmental adaptation, " and "general social interaction" are the most important subcategories. (4) Healthcare professionals had a significant difference in somatic factors compared to other professions, identified by a difference test of word frequency distribution. (5) Words related to work factors has shown an upward trend from 2011 to 2023, while those related to interpersonal factors have shown a downward trend. (6) A BERT-based machine learning model is obtained and it works in identifying influencing factors of subthreshold depression in populations experiencing work-related stress, in particular, the XGBoost algorithm achieved a prediction accuracy of 81.58%, with particularly strong performance in subthreshold depression detection (F1-score = 0.90, AUC = 0.93).

This study provided an in-depth analysis of the representations and influencing factors of SDPOSC, enriching the localization research of subthreshold depression from an empirical perspective. Furthermore, a machine learning model by BERT can be utilized in subsequent research. The study of SDPOSC can help identify their depression risk and has important theoretical and practical significance for the prevention and treatment of SDPOSC.

Key words: subthreshold depression, occupational stress, machine learning

中图分类号: