Advances in Psychological Science

Driving mechanisms and impact effects of AI feedback-seeking behavior: A research proposal

SUN Fang, LI Shaolong, LONG Lirong, LEI Xuan, ZENG Xianglin, HUANG Xiahong

2025, 33 (10): 1647-1662. doi: 10.3724/SP.J.1042.2025.1647

Abstract ( 1193 )

HTML ( 182 )

PDF (710KB) ( 3387 ) Peer Review Comments

In the current VUCA (volatility, uncertainty, complexity, ambiguity) era, employees must proactively seek feedback to facilitate personal development and enhance their workplace competitiveness. Artificial intelligence (AI) offers new opportunities for proactive feedback-seeking, with a survey by Oracle Corporation indicating that over 50% of employees prefer seeking feedback from AI systems. However, traditional research on feedback-seeking behavior has yet to incorporate AI as a feedback source, leaving the mechanisms and consequences of employee feedback-seeking from AI largely underexplored. Moreover, emerging studies on AI feedback primarily position employees as passive feedback recipients, paying limited attention to their proactive feedback-seeking behaviors. Therefore, this research aims to bridge the gap by integrating insights from traditional feedback-seeking behavior literature with emerging studies on AI feedback, expanding the concept of feedback-seeking to include AI as a viable source, and contributing to the burgeoning field of emerging technologies and employee psychology and behavior.

Specifically，Study 1 adopts a human-AI interaction perspective to examine how AI system characteristics influence employees’ feedback-seeking from AI, as well as the underlying mechanisms. Drawing on mind perception theory and trust literature, this study proposes that employees’ perceptions of AI agency and experience, along with their cognition-based and affect-based trust in AI, serve as serial mediators in the relationships between two AI system features—transparency and anthropomorphism—and employees’ feedback-seeking from AI. Furthermore, acknowledging that task characteristics are essential factors in traditional feedback-seeking research, we suggest that problem solving moderates the serial mediated relationship. Building on the core outcome of interest in the feedback literature, Study 2 seeks to explore the consequences of employees’ feedback-seeking from AI on performance improvement and its underlying mechanisms. Based on feedback process theory and AI-related literature, this study proposes that feedback-seeking from AI positively predicts employees’ performance improvement. This effect is mediated by the accuracy and specificity of AI-generated feedback information. Moreover, the type of task—objective vs. subjective—is expected to moderate this mediation process.

In sum, the findings from the two studies offer several important theoretical contributions. First, this research innovatively positions artificial intelligence as a target of employees’ feedback-seeking behavior, thereby expanding the boundaries of the feedback-seeking literature. It lays a foundation for future research to explore the unique dynamics, antecedents, and consequences of seeking feedback from AI in the workplace. Second, by focusing on two critical system characteristics—transparency and anthropomorphism—this research identifies how system-level features, which have been emphasized in evaluations of AI by human users, influence employees’ feedback-seeking from AI. It reveals unique antecedents and mechanisms through which AI system features shape user behavior, deepens our understanding of the relationship between system characteristics and feedback-seeking behavior, and provides theoretical guidance for future research in AI-enabled work settings. Third, taking into account the distinctive nature of human-AI interaction, this research examines the consequences of feedback-seeking from AI on employee performance-related outcomes and uncovers the underlying mechanisms. Performance is a core outcome of interest in feedback-seeking research, yet prior studies have yielded inconsistent findings regarding the feedback-performance link. By exploring how feedback-seeking from AI influences performance improvement and through which mechanisms, this research helps clarify this relationship and broadens the theoretical boundaries of AI-related feedback-seeking research.

From a practical perspective, the findings offer actionable insights for managers seeking to encourage employees to proactively seek feedback from AI systems, helping them leverage potential benefits while mitigating associated risks. It also highlights the types of work tasks where such feedback-seeking is most appropriate and effective. Overall, this research positions AI feedback-seeking as a lens through which managers can better understand the evolving interplay between emerging technologies and employee behavior, and it invites practitioners to rethink how to foster synergy between humans and intelligent systems.

Figures and Tables | References | Related Articles | Metrics

The impacts and mechanisms of artificial intelligence on knowledge workers: An instrumental and humanistic perspective

XU Minya, CHEN Liping, LIU Beini

2025, 33 (10): 1663-1683. doi: 10.3724/SP.J.1042.2025.1663

Abstract ( 1047 )

HTML ( 116 )

PDF (682KB) ( 1177 ) Peer Review Comments

With ongoing breakthroughs in artificial intelligence (AI) technology and its reshaping of the workplace, traditional management paradigms face unprecedented challenges. AI has transformed the way information circulates and amplified the visibility of individual contributions. To thrive in this new context, established management theories must be revised and expanded better to support collaborative growth between organizations and their employees. The rapid evolution of AI has also precipitated significant psychological and behavioral shifts among knowledge workers, reshaping their perceptions of the modern work environment and their expectations for organizational development. Yet research on the mechanisms by which AI influences knowledge workers remains fragmented. This study explores the empowering and activating pathways of AI for knowledge workers in terms of “agency” and “self-actualization” through both instrumental and humanistic perspectives.

In the first stream of this research, we focus on the instrumental effects of AI on the creativity of knowledge workers. By providing abundant information and substituting for certain cognitive processes, AI empowers employees and makes it easier for them to demonstrate creative performance. Study 1, grounded in information integration theory, investigates how the frequency of AI use enhances knowledge worker creativity through a sequential mediation of information acquisition richness and knowledge integration, while testing employees’ domain-specific expertise as a boundary condition. Study 2 draws on creativity process theory to explore the “double‐edged” role of AI use, i.e., how AI use reshapes both divergent and convergent thinking and, in turn, affects creative outcomes. It further examines whether the level of interpersonal interaction at work moderates these indirect effects.

The second research stream investigates AI’s humanistic effects on knowledge workers. By stimulating future‐oriented reflection, AI activates employees’ consideration of their own development trajectories. Drawing on work embeddedness theory, we distinguish between push factors—unfavorable job conditions that increase turnover intention—and pull factors—favorable job attributes that strengthen retention intention. Study 3, informed by protection motivation theory, examines how frequent AI use fosters knowledge workers’ turnover intention by diminishing perceived career development prospects, and it tests employees’ self‐actualization needs as a moderator. Study 4, extending work embeddedness theory, evaluates whether an organization’s implementation of process and product digitization influences knowledge workers’ turnover intention via changes in job enjoyment and career prospect perceptions, with employees’ digital skill proficiency as a boundary condition. Together, these studies aim to illuminate the pathways through which knowledge workers achieve both agency and self‐actualization in the AI era.

In summary, this study embraces the core principle of “employee-centric human-AI symbiosis” and is dedicated to fostering the harmony and optimization of human-AI interactions. By conceptualizing AI’s informational role and its dual influence on employees’ cognitive capabilities, it offers fresh insights into the creativity process theory for knowledge workers in the AI era. Furthermore, by examining how AI-driven empowerment reshapes employees’ psychological needs and their perceived well-being at work, and how these shifts affect turnover and retention decisions, it enriches and extends existing research on effective collaboration between AI and knowledge workers. Ultimately, our findings aim to equip organizations with actionable strategies for leveraging human-AI synergy, enabling them to more precisely understand employees’ psychological states and to devise targeted strategies for talent retention, thereby better addressing AI-related challenges and achieving sustainable development.

Figures and Tables | References | Related Articles | Metrics

How do generative AI teammates affect team new product idea generation? A perspective from team process

ZHENG Yu, CHEN Yi, WU Yueyan

2025, 33 (10): 1684-1697. doi: 10.3724/SP.J.1042.2025.1684

Abstract ( 785 )

HTML ( 104 )

PDF (798KB) ( 1239 ) Peer Review Comments

Generative AI agents, leveraging their natural language capabilities for human interaction and emergent intelligence that helps humans transcend cognitive fixedness, are increasingly participating autonomously in interactions and collaborations within enterprise new product development (NPD) teams, akin to human members. This creates a novel "multiple-humans-one-machine" collaborative context. Generative AI teammates have become significant new members of NPD teams as well. However, their practical effectiveness and impact on team creativity remain contentious. Existing research regrettably exhibits a threefold disconnection. Firstly, NPD team related research remains confined to contexts involving exclusively human members, overlooking the novel team paradigm introduced by the integration of AI agents. Secondly, AI agent related research predominantly stagnates at the individual level of analysis, lacking a comprehensive team-level perspective. Thirdly, Human-AI collaboration related research is largely limited to dyadic human-AI interaction, failing to extend into the complexities of multi-human-AI teams. Consequently, following the theoretical logic of the Input-Process-Output (IPO) model of team effectiveness, this study investigates the influence of generative AI teammates on team performance in new product idea generation from the team perspective. Based on the three distinct phases of new product idea generation: divergence, convergence, and formation, this study includes three sub-studies.

Sub-study 1 focuses on the divergence phase. It explores the cognitive fixation mechanism underlying the inhibitory effect of generative AI teammates from the perspective of team task processes and identifies related mitigation strategies. The study posits that the superior information processing and logical articulation capabilities of generative AI teammates not only foster team consensus but also discourage human members from voicing unique ideas and intuitions, thereby exacerbating team cognitive fixation and inhibiting the diversity of ideas generated by human members. Compared with interactive groups, this inhibitory effect of generative AI teammates will be effectively mitigated in nominal groups.

Sub-study 2 examines the convergence phase. It investigates the social identification mechanism underlying the reinforcing effect of generative AI teammates from a team affective process perspective and proposes related enhancement strategies. As human members perceive generative AI teammates as lacking value judgments, subjective preferences, and emotional capabilities, the study posits that human members perceived team social identification - including emotional exchange - among themselves is heightened after generative AI teammates joined their team. This consequently enhances the convergence of idea adoption among human members. The reinforcing effect of generative AI teammates in the convergence phase can be further amplified by high team diversity beliefs.

Sub-study 3 addresses the formation phase. It systematically explores the double-edged sword effect of generative AI teammates and constructing corresponding coping strategies. The study contends that generative AI teammates, by inhibiting the diversity of ideas generated in the divergence phase and enhancing the convergence of idea adoption in the convergence phase, ultimately increase the speed of team new product idea generation while decreasing the quality of team new product idea generation in the formation phase. Furthermore, when human members receive generative AI skill training, the positive effect of generative AI teammates on the team new product idea generation speed will be further strengthened, while the negative effect of generative AI teammates on the team new product idea generation quality will be effectively alleviated.

This study specifically focuses on the role of generative AI teammates as new members within NPD teams. It reveals the mechanisms through which generative AI teammates influence team interaction and collaboration among human members at the team level, and constructs optimized collaborative strategies for generative AI teammates operating within the multiple-humans-one-machine context. Consequently, this research not only enriches the theoretical understanding of team effectiveness models and expands human-AI collaboration strategies from "one-human-one-machine" to "multiple-humans-one-machine" contexts, but also provides significant practical implications for enterprises seeking to effectively leverage generative AI teammates in NPD teams and offer significant decision-making references for the Chinese government in implementing its “AI+” initiative.

Figures and Tables | References | Related Articles | Metrics

The effects of dirty work on its practitioners’ work meaningfulness

ZHANG Guanglei, ZHU Shijia, WANG Huaying, HE Yuheng

2025, 33 (10): 1698-1711. doi: 10.3724/SP.J.1042.2025.1698

Abstract ( 626 )

HTML ( 105 )

PDF (575KB) ( 1190 ) Peer Review Comments

Dirty work refers to occupations or job tasks that are physically, socially or morally tainted, and thus are perceived by its practitioners as disgusting and degrading. Workers engaged in such occupations or work tasks face dual pressures stemming from their internal experiences of dirtiness (experienced work dirtiness) and external perceptions of dirtiness (occupational stigma perception). In practice, they are confronted with multiple threats, including low wages, limited career advancement opportunities, and social exclusion. In response to these challenges, scholars in the field of dirty work have emphasized the importance of constructing meaning of work as a psychological buffer. Work meaningfulness, as a result of meaning construction, is broadly defined as an individual’s overall perception and understanding of the value, purpose, and significance of their work.

The positive sense of meaning experienced by individuals engaged in dirty work has consistently attracted scholarly attention. Existing research has primarily explored, from an individual or in-group perspective, how dirty work practitioners’ work-group culture or occupational ideology, as well as their normalization strategies, influence the construction of the meaning of work. However, limited attention has been paid to the relational and symbiotic dimensions of meaning construction. Moreover, how the boundary conditions of social support that contributes to these processes remain largely under-explored.

Accordingly, the present study adopts a social support perspective and draws upon the four-quadrant framework of meaning-making, which is characterized by the axes of “agency - communion” and “self - other”, to propose a conceptual model that delineates both individual and interpersonal mechanisms through which dirty work practitioners construct the meaning of their work. Based on this framework, we aim to systematically examine how perceived work dirtiness and perceived occupational stigma affect dirty work practitioners’ sense of meaning and the process of constructing positive meaning from their work. We further investigate the individual and interpersonal mechanisms underlying this relationship. Specifically, our study explores the moderating effects of instrumental support from in-group others (e.g., supervisors and coworkers), and expressive evaluations or support from both in-group and out-group others (e.g., supervisors, coworkers, beneficiaries and clients). Theoretically, the findings are intended to contribute to a more comprehensive understanding of how practitioners engaged in dirty work derive meaning from their work. In practical terms, the present study is likely to offer insights into how organizations can foster employees’ sense of meaningfulness at work by managing individual and interpersonal factors.

Figures and Tables | References | Related Articles | Metrics

Does poverty limit imagination? The impact of a poverty mindset on entrepreneurial ideation

HUANG Hao, YANG Chen, MEI Mei, LIU Xiaomeng

2025, 33 (10): 1712-1730. doi: 10.3724/SP.J.1042.2025.1712

Abstract ( 718 )

HTML ( 127 )

PDF (702KB) ( 1921 ) Peer Review Comments

Encouraging and promoting entrepreneurship at the Bottom of the Pyramid (BOP) has emerged as a novel approach in recent years for effectively linking long-term poverty alleviation with rural revitalization. However, to date, the cognitive factors influencing the entrepreneurial behavior of the BOP population have received scant attention. This study focuses on the core research question: “Does the poverty mindset affect entrepreneurial ideation?” and investigates the relationship between the poverty mindset and entrepreneurial ideation in the BOP entrepreneurial context through three major research modules. Research Module 1 adopts the scarcity theory perspective to explore process mechanisms through which the poverty mindset affects entrepreneurial ideation. Specifically: 1) Using the attentional focus shift path of scarcity, it reveals the "tunneling effect" on entrepreneurial ideation via prospective thinking and the resource-induced coping heuristic as mediating variables; 2) Applying the cognitive load path of scarcity, it demonstrates the "bandwidth effect" on entrepreneurial ideation through intuitive cognition and cognitive flexibility as mediators. Research Module 2 employs a constructivist lens to examine intervention strategies for poverty mindset's impact on entrepreneurial ideation. This includes: 1) Drawing from sensemaking theory to identify boundary conditions for the "tunneling effect" using perceived innovation legitimacy as moderator; 2) Grounded in cognitive construction theory, it establishes boundary conditions for the "bandwidth effect" using ideation structuring as moderator. Research Module 3 utilizes the scarcity self-regulation perspective to investigate latent profiles of poverty mindset and their predictive effects. Originating from the need to explore BOP entrepreneur heterogeneity, this module applies a person-centered research paradigm to examine whether replicable poverty mindset profiles exist among BOP entrepreneurs, and how these profiles predict entrepreneurial mindset and ideation quality. This study demonstrates the following theoretical innovations: first, we respond to the academic call to integrate poverty mindset and entrepreneurial mindset research by constructing the theoretical pathway “poverty mindset → entrepreneurial mindset → entrepreneurial ideation”, broadening the theoretical scope of poverty psychology driving entrepreneurial behavior processes and promoting cross-fertilization between scarcity theory and entrepreneurship research; second, we integrate perspectives from the cognitive-driven viewpoint of scarcity theory to propose the process mechanism through which the relationship between the poverty mindset and entrepreneurial ideation in the BOP context is mediated by the entrepreneurial mindset, thereby uncovering the black-box mechanism of how the poverty mindset acts upon entrepreneurial ideation and deepening the academic understanding of the front end of BOP entrepreneurs’ innovation activities under a poverty mindset; third, we construct an intervention framework for the “poverty mindset → entrepreneurial ideation” relationship, providing a new theoretical perspective for intervention strategies aimed at enhancing the quality of entrepreneurial ideation under a poverty mindset; fourth, we explore the latent profiles of the poverty mindset among BOP entrepreneurs and their predictive effects based on a person-centered research paradigm, enriching the application of the person-centered approach in entrepreneurial ideation research. Through a multi-perspective systematic research design, this study aims to reveal the developmental patterns of the relationship between the poverty mindset and entrepreneurial ideation in the local BOP entrepreneurial context, deepen the theoretical understanding of the front end of BOP entrepreneurs’ innovation activities driven by context and cognition, and provide theoretical guidance and practical implications for governments and practitioners in formulating entrepreneurial support policies and designing entrepreneurial knowledge services targeting the BOP population.

Figures and Tables | References | Related Articles | Metrics

Probability neglect in medical decision making: The underlying mechanisms and interventions

XING Cai, LIU Zhifei, CAO Fuxian, MIAO Meng, LU Yutao, DING Xiaotong, FU Zhushi

2025, 33 (10): 1731-1744. doi: 10.3724/SP.J.1042.2025.1731

Abstract ( 468 )

HTML ( 54 )

PDF (773KB) ( 1357 ) Peer Review Comments

Despite normative frameworks emphasizing integrative processing of probability and outcome information, real-world medical decision making often suffers from probability neglect, in which individuals place disproportionately more weight to potential adverse outcomes while neglecting their likelihood in treatment choices. To address this challenge, the present research systematically investigates the probability neglect phenomenon, underlying mechanisms, and corrective interventions for probability neglect in medical contexts through a multi-phase, multi-method program encompassing behavioral, psychophysiological, neurostimulation, and nudge-based studies.

First, Experiments 1 and 2 establish the prevalence of probability neglect in two pivotal populations, chronic illness patients and older adults, via within-subject comparisons of customized medical versus monetary decision tasks. Prior to decision tasks, participants complete monetary evaluation tasks for a spectrum of side effect severities, enabling calibration of personalized trade-offs between severity and probability. In the medical decision tasks, each side effect’s probability is determined by participants’ willingness to pay (WTP) to avoid that effect; in the parallel monetary decision tasks, side effects are replaced by their WTPs. Analyses of choice patterns reveal systematic underweighting of probability information in medical decisions relative to monetary contexts. Building on this foundation, Experiments 3 and 4 incorporate process tracing: Experiment 3 deploys eye-tracking to record fixation count and duration on probability versus outcome information, alongside real-time pupil dilation measures. Experiment 4 uses the Mouselab program to record open-box frequencies and compute search-measure (SM) indices, disentangling option- versus dimension-focused information-acquisition strategies. Together, these studies validate the ubiquity of probability neglect and uncover its associations with attentional biases and emotional responses.

Second, to isolate emotional drivers of probability neglect, Experiments 5-7 manipulate affective processing through context variation and targeted regulation strategies. Experiment 5 extends medical tasks into the domain of health supplements, comparing choices between medications and supplements to probe differential emotional impacts. Experiment 6 incorporates side-effect duration as an additional factor, testing whether prolonged adverse experiences amplify emotional arousal and probability neglect, particularly in supplement scenarios. Crucially, Experiment 7 adopts a mediation framework in which participants are randomly assigned to one of three emotion-regulation conditions—emotion-focus, emotion-suppression, or control—via standardized instructional sets. Behavioral outcomes and process measures assess whether attenuation of emotional reliance restores balanced probability weighting. Findings from these experiments elucidate causal links between emotion intensity and probability neglect, guiding development of emotion-targeted interventions.

Third, to obtain objective neurophysiological evidence, Experiments 8 and 9 adopt a multi-modal approach. Experiment 8 again employs the Mouselab program while continuously monitoring heart rate (HR) and skin conductance level (SCL) to index emotional arousal during decision tasks. We hypothesize higher physiological arousal corresponds to stronger probability neglect in medical versus monetary contexts. Experiment 9 employs transcranial direct current stimulation (tDCS) targeting emotion-processing cortical regions. Participants receive anodal, cathodal, or sham stimulation, after which choice behavior and attention measures reveal whether enhancing or inhibiting emotion-related cortical excitability causally alters probability weighting. These neurostimulation data furnish robust causal evidence for the emotional basis of probability neglect.

Fourth, informed by mechanistic insights, Experiments 10 and 11 design and evaluate two innovative nudge-based interventions aimed at correcting probability neglect. The first intervention leverages big data analytics of over 1.1 million social-media posts to extract empirically validated emotion-regulation behaviors, which are distilled into daily micro-nudges delivered via brief digital prompts (<10 min engagement). The second builds on self-perception theory, implementing a virtual AI feedback paradigm in which participants complete simulated conversational tasks with an AI agent that reinforces their belief that they are not influenced by emotions during decision making. Both interventions run over one week across experimental (AI nudge, big-data nudge, emotion-suppression instruction) and control groups. Outcome measures include pre- and post-intervention assessments of probability neglect in medical decisions, longitudinal follow-ups with older adults and chronic patients, and qualitative interview data to refine intervention content. Anticipated results demonstrate sustained reductions in probability neglect and enhanced decision quality, validating scalable, noncoercive strategies aligned with Thaler and Sunstein’s nudge philosophy.

By integrating personalized behavioral paradigms with process-tracing, psychophysiological monitoring, and neurostimulation techniques, this study advances both theoretical and practical frontiers in medical decision research. Empirical validation of causal affective pathways enriches the Dual System Model of Medical Decision Making (DSM-M), while novel digital and AI-based nudges offer pragmatic tools for healthcare delivery and telemedicine integration. Future work will explore embedding real-time emotion-monitoring modules into telehealth platforms to enable just-in-time decision support, thereby strengthening patient autonomy and optimizing health outcomes.

Figures and Tables | References | Related Articles | Metrics

From mind reading to mind modulation: Applications and mechanisms of neural modulation in brain-computer interfaces from a psychological perspective

CHEN Zhaojie, WANG Guofang

2025, 33 (10): 1745-1765. doi: 10.3724/SP.J.1042.2025.1745

Abstract ( 764 )

HTML ( 77 )

PDF (776KB) ( 1050 ) Peer Review Comments

Brain-Computer Interface (BCI) technology is rapidly evolving from a rehabilitative tool to a powerful medium for cognitive enhancement and psychotherapeutic intervention. This paper provides a comprehensive psychological analysis of this technological paradigm shift, conceptualized as a progression from “mind reading” to “mind modulation.” It moves beyond a general review of applications to construct a novel theoretical and ethical framework for understanding and guiding the future of BCI in psychology. The central thesis is that to responsibly harness BCI's potential, its development must be deeply integrated with psychological principles governing cognition, emotion, and human autonomy.

A primary contribution of this paper is the synthesis of established psychological theories with the technical mechanisms of BCI. We argue that the efficacy of BCI in cognitive enhancement is grounded in foundational models such as Posner's Attention Network Theory and Baddeley's Working Memory model, which provide neurological targets for intervention. Critically, the long-term benefits of BCI-driven training are explained through the lens of neuroplasticity, where closed-loop neurofeedback acts as a driver for activity-dependent synaptic reinforcement, a process analogous to reward-driven learning. This theoretical integration provides a robust scientific rationale for how BCI systems can precisely target and durably improve core cognitive functions, including attention, memory, and executive control.

While acknowledging the significant clinical potential of BCI in treating disorders like depression, anxiety, and PTSD, this paper foregrounds the associated ethical challenges, particularly the risk of technological dependence. To address this, we introduce a novel quantitative framework: the Technological Dependence Risk Index (TDRI). This multi-dimensional model is designed to systematically assess the potential adverse effects of long-term BCI use on psychological well-being. The TDRI integrates four key dimensions:

(1) Psychological Adaptability (PA): Measures changes in emotional stability and cognitive function after discontinuing BCI use, grounded in emotion regulation and cognitive load theories.

(2) Usage Frequency and Duration (UFD): Quantifies behavioral reliance on the BCI, drawing from technology addiction models to identify patterns of overuse.

(3) Autonomy and Control Perception (ACP): Assesses shifts in an individual’s sense of self-agency and control, based on Self-Determination Theory and theories of locus of control.

(4) Post-BCI Recovery Ability (PRA): Evaluates the capacity to return to baseline cognitive and emotional functioning post-intervention, accounting for individual differences. The composite index (TDRI = w₁·PA + w₂·UFD + w₃·ACP + w₄·PRA) offers a structured, empirically testable tool for researchers, clinicians, and ethicists to monitor and mitigate the risks of psychological dependence, ensuring that BCI serves as an empowering rather than a debilitating tool.

Furthermore, this paper outlines a forward-looking vision for the synergy between BCI and other frontier technologies, specifically Artificial Intelligence (AI) and Virtual Reality (VR), from a psychological standpoint. The integration with AI, particularly deep learning, is presented not merely as a technical enhancement but as a pathway to creating truly adaptive and personalized interventions. AI-driven algorithms can decode complex neural signatures in real time, allowing a BCI system to dynamically adjust therapeutic or training paradigms to an individual's fluctuating cognitive load and emotional state, thereby optimizing efficacy and user engagement. Similarly, the fusion of BCI with VR can create ecologically valid, immersive environments for therapy and cognitive training. For instance, in PTSD treatment, a VR-BCI system can modulate the intensity of an exposure scenario based on real-time neural markers of anxiety, creating a therapeutic window that is both safe and effective.

In conclusion, this paper contributes a unique psychological perspective on the advancement of BCI technologies. By proposing the “mind reading to mind modulation” framework, introducing the quantitative TDRI model, and systematically linking BCI design to cognitive and emotional theories, it offers a blueprint for future research and development. The ultimate goal is to foster a new generation of BCI systems that are not only technologically powerful but also human-centered, ethically sound, and psychologically empowering, ensuring a sustainable and beneficial integration of mind and machine.

Figures and Tables | References | Related Articles | Metrics

Technical innovations and practical challenges in automatic item generation

HAN Yuting, WANG Wenxuan, LIU Hongyun, YOU Xiaofeng

2025, 33 (10): 1766-1782. doi: 10.3724/SP.J.1042.2025.1766

Abstract ( 458 )

HTML ( 51 )

PDF (2780KB) ( 2733 ) Peer Review Comments

Automatic Item Generation (AIG) technology has emerged as a transformative approach to address fundamental challenges in psychological and educational test development, including high item creation costs, low development efficiency, difficulty in maintaining large item banks, and security vulnerabilities from item exposure in high-stakes testing contexts. This review systematically examines the technological evolution from rule-driven methods to large language model (LLM)-based approaches and analyzes contemporary implementation challenges with corresponding solutions.

The technological evolution of AIG encompasses three distinct phases. Early rule-based methods, including cognitive design systems, item modeling approaches, and ontology-based techniques, relied heavily on expert knowledge and predefined templates. While producing structurally sound items, these approaches required substantial professional input and lacked flexibility in handling complex linguistic phenomena. Corpus-based methods subsequently introduced statistical approaches leveraging large-scale language data to enhance linguistic authenticity, though remained constrained by corpus coverage and domain specificity. Deep learning technologies marked a paradigm shift in AIG capabilities. Classical techniques such as word embeddings, recurrent neural networks (RNN), long short-term memory (LSTM), and sequence-to-sequence (Seq2Seq) models progressively improved semantic representation and text coherence. The introduction of Transformer architecture with its self-attention mechanism revolutionized natural language processing by effectively capturing long-range dependencies. Building on this foundation, pre-trained large language models like BERT, T5, and the GPT series learned rich language representations from massive text corpora, enabling sophisticated understanding and generation capabilities that fundamentally transformed AIG approaches. Contemporary LLM-based AIG systems employ domain fine-tuning and prompt engineering strategies, with knowledge enhancement technologies—particularly retrieval-augmented generation (RAG) and knowledge graphs—addressing professional knowledge accuracy by integrating external structured knowledge sources. LLM-based AIG demonstrates significant advantages over traditional methods, including dramatically improved generation efficiency, enhanced linguistic fluency and diversity, reduced dependence on extensive manual template creation, and the ability to generate contextually rich items across multiple domains and languages.

Despite technological advances, LLM-based AIG implementation faces seven core challenges across quality assurance, functional expansion, and practical application dimensions:

First, content authenticity and professional accuracy issues, particularly in specialized domains like medicine and law where LLMs exhibit “hallucination” phenomena. Solutions include employing larger-scale models (e.g., GPT-4 over GPT-3.5), implementing human-AI collaborative frameworks where experts guide initial generation and validate outputs, deploying RAG systems with domain-specific knowledge bases and sophisticated retrieval mechanisms, and establishing comprehensive expert review protocols covering content validity, professional accuracy, conceptual depth, and structural integrity.

Second, ethical responsibility, cultural fairness, and construct validity concerns, especially in personality and social attitude assessments where models may perpetuate training data biases. Strategies encompass data augmentation to balance cultural representation, multi-stage psychometric validation progressing from expert screening through pilot testing to large-scale validation, Differential Item Functioning (DIF) analysis for cross-group measurement equivalence, and adoption of Exploratory Structural Equation Modeling (ESEM) over traditional confirmatory factor analysis for AI-generated items.

Third, single-modality limitations restricting assessments to text-based items, excluding visual-spatial reasoning and multimodal content. Multimodal large language models (MLLMs) like GPT-4o and Phi-3-vision offer solutions, though require specialized prompt engineering frameworks translating assessment requirements into multimodal representations while maintaining psychometric properties and disciplinary standards.

Fourth, insufficient capability for generating open-ended items assessing higher-order thinking skills. Solutions involve integrating Bloom’s taxonomy or similar frameworks into generation prompts, developing synchronized scoring rubrics and exemplar responses, and creating evaluation criteria encompassing cognitive complexity, response diversity, and scoring feasibility.

Fifth, quality control reliance on manual review contradicting efficiency goals. Emerging intelligent evaluation systems demonstrate promise, though require independent training from generation models, domain-specific evaluation models, and integration with human expert final review in hybrid quality assurance systems.

Sixth, resource constraints limiting access to computational infrastructure. Parameter-efficient fine-tuning techniques (LoRA, QLoRA) reduce memory requirements while maintaining performance. Cloud computing services provide on-demand resources, and prompt engineering optimization offers low-resource alternatives for institutions unable to fine-tune models.

Seventh, technical complexity creating barriers for non-technical domain experts. User-friendly interfaces should abstract API interactions, provide template libraries for common assessment types, offer real-time preview and editing capabilities, and integrate quality feedback mechanisms guiding users toward best practices.

This review reveals that successful LLM-based AIG implementation requires coordinated solutions addressing domain knowledge, technological, psychometric, and practical challenges. Key priorities include optimizing knowledge enhancement technologies for domain-specific accuracy, adapting validation methods for AI-generated content, and establishing human-AI collaborative workflows that leverage computational efficiency while maintaining professional standards. The convergence of these approaches—systematic RAG implementation, multi-stage psychometric validation, parameter-efficient training methods, and accessible user interfaces—provides a practical framework for advancing test development. Future directions should focus on empirical validation of these strategies across diverse assessment contexts and the establishment of standardized protocols that ensure both innovation and measurement quality in the evolving landscape of automated item generation.

Figures and Tables | References | Related Articles | Metrics

Empathy in large language models: Evaluation, enhancement, and challenges

ZHOU Qianyi, CAI Yaqi, ZHANG Ya

2025, 33 (10): 1783-1793. doi: 10.3724/SP.J.1042.2025.1783

Abstract ( 1283 )

HTML ( 120 )

PDF (535KB) ( 3208 ) Peer Review Comments

Amid the rapid evolution of artificial intelligence technologies, the application scope of large language models (LLMs) has extended beyond traditional information processing tasks to novel domains involving the simulation of complex human emotions and interactions. Particularly in emotion-intensive contexts such as psychological counseling, physician-patient communication, and customer service, the capacity of LLMs for empathy simulation has emerged as a focal point in academic research and demonstrates substantial potential for real-world application. However, fundamental questions remain: What are the essential differences between LLM-simulated empathy and human empathy? How can we evaluate such capabilities in a scientific and comprehensive manner? What is the current state of development, and what are the core bottlenecks? More critically, how can LLMs’ empathetic performance be effectively enhanced while addressing the associated ethical risks?

While existing studies have explored some of these issues, a systematic integrative framework is still lacking. Therefore, this study conducts a comprehensive analysis of empathy simulation in LLMs across four key dimensions: evaluation methods, current development status, enhancement strategies, and critical challenges. The goal is to provide a theoretical foundation and directional guidance for future research and practical deployment in this domain.

Currently, the evaluation of LLMs’ empathy simulation can be categorized into three main approaches: human-based, automated, and task-driven. Human evaluation relies on subjective ratings or comparative judgments made by human annotators or domain experts, and excels at capturing nuanced emotional perceptions and context-dependent subtleties. However, it suffers from high subjectivity and cost. Automated evaluation employs computational techniques such as sentiment classification and cosine similarity for objective quantification, offering efficiency and reproducibility suitable for large-scale testing. Nonetheless, it often fails to account for contextual or subtle emotional variations and cannot adequately assess the naturalness or perceived empathy of language. Task-driven evaluation involves designing specific tasks such as emotion cause recognition (e.g., RECCON) or leveraging psychological empathy paradigms and standardized scales (e.g., IRI, BES) to assess model performance. This method aligns more closely with real-world applications and yields quantifiable metrics, though its generalizability is constrained by the specific design of tasks. This study compares the strengths and limitations of the three approaches and highlights the lack of a unified evaluation framework. It emphasizes the urgent need to develop a standardized and integrated assessment system, particularly one that incorporates psychological measurement paradigms to probe deeper empathetic response mechanisms within LLMs.

Recent studies using these varied evaluation methods suggest that LLMs can generate empathetic responses comparable to, or even surpassing, human outputs in certain scenarios, providing effective emotional support. However, there is still considerable room for improvement in specific empathy-related subtasks and in performance on standardized empathy scales—especially when confronting complex or mixed emotions. To enhance empathy simulation in LLMs, four key strategies are proposed: data augmentation, architectural and framework optimization, reinforcement learning, and prompt engineering. Data augmentation involves constructing larger, higher-quality, and culturally diverse empathetic dialogue datasets for fine-tuning. Architectural and framework optimization refers to the development of novel model structures capable of dynamically capturing emotional and personality traits (e.g., Pecer), or hybrid frameworks combining expert models and chain-of-thought reasoning (e.g., HEF, EBG) to improve the understanding of fine-grained emotions. Reinforcement learning integrates feedback from humans or other AI agents (e.g., the Muffin framework) to reward high-quality responses and guide the generation of empathy-aligned outputs. Prompt engineering can embed psychological theories—such as cognitive behavioral therapy—into prompt design, guiding the model to conduct deeper emotional reasoning and generate contextually appropriate responses from the input level.

Nevertheless, LLMs face persistent and, in some cases, insurmountable technical limitations. Their empathy simulation is fundamentally rooted in large-scale statistical pattern matching, rather than genuine emotional experience or intrinsic motivation. As a result, their responses often appear formulaic and lack authenticity. Moreover, LLMs struggle with interpreting complex emotions, sarcasm, and culturally variable expressions of empathy. Their adaptability to diverse cultural contexts and emotionally ambiguous situations remains limited. In addition, the use of LLMs inevitably raises ethical concerns, including the potential generation of harmful or discriminatory content, misuse for information manipulation, and the risk of users developing excessive emotional dependence on AI systems—potentially undermining real-life social interactions.

In conclusion, this study presents a comprehensive investigation into LLMs’ empathy simulation from the perspectives of evaluation, current capabilities, enhancement strategies, and inherent challenges. It identifies key issues and delineates future research directions, offering a conceptual foundation for advancing both academic inquiry and practical implementation in this emerging field.

Figures and Tables | References | Related Articles | Metrics

Audiovisual integration in infant language acquisition: Different patterns in typically developing infants and those at elevated risk for autism spectrum disorder

JIN Mengke, YAN Linlin, LIU Shaoying, XIAO Naiqi

2025, 33 (10): 1794-1804. doi: 10.3724/SP.J.1042.2025.1794

Abstract ( 418 )

HTML ( 61 )

PDF (807KB) ( 1243 ) Peer Review Comments

Language development in infancy is fundamentally shaped by the dynamic integration of auditory and visual (AV) cues. This review examines the role of AV synergy in early language acquisition by contrasting developmental trajectories in typically developing infants and those at elevated risk for autism spectrum disorder (ASD).

In typically developing infants, AV integration progresses through stage-specific mechanisms. During the first three months postnatally, infants prioritize eye gaze to establish social engagement. At this stage, infants rely on temporally synchronized cues, such as speech paired with facial expressions. Early cross-modal learning is mediated by primary audiovisual cortical responses, initially confined to narrow temporal windows. Between 3-6 months, attention shifts toward the mouth region, driven by exaggerated articulatory movements and prosodic features typical of infant-directed speech (IDS). During this phase, infants also begin to show sensitivity to conflicting AV inputs, demonstrated by the McGurk effect. In such cases, infants integrate mismatched visual /ga/ and auditory /ba/ into a fused “da” percept. Adaptive mechanisms emerge during this period, with infants increasing mouth fixation to compensate for auditory ambiguity in noisy or unfamiliar linguistic contexts. Between 6-9 months, mouth-focused attention becomes dominant, facilitating precise phoneme-lip mapping. Bilingual infants exhibit adaptive plasticity, extending mouth fixation durations to manage dual-language inputs. From 9-12 months, socio-cognitive maturation supports dynamic rebalancing of attention. Infants maintain mouth fixation during lexical acquisition to enhance phoneme-semantic associations, while simultaneously reinstating eye contact to facilitate joint attention and intentional communication. Across all stages, IDS optimizes language learning through enhanced AV synchrony, such as slowed speech rates and amplified mouth movements, serving as scaffolding for developmental milestones.

Infants at high risk for ASD demonstrate systematic deviations in AV integration emerging early in life. A prominent feature is a progressive decline in social attention, particularly eye gaze, apparent as early as two months of age. Unlike typically developing infants, who maintain eye contact to foster social reciprocity, high-risk infants gradually reduce fixation on the eyes. This diminished attention disrupts foundational processes of joint attention, thereby limiting caregiver-infant interactions and linguistic input. Neural studies link these behavioral differences to reduced cortical activation in temporal regions during dynamic face processing, suggesting impaired encoding of social stimuli. Concurrently, high-risk infants display delayed attention to the mouth region, with significant increases in mouth fixation occurring around 18 months, considerably later than the typical 6- to 9-month period. This delay negatively impacts phoneme-lip mapping accuracy, leading to weaker phoneme discrimination. For instance, high-risk infants struggle to leverage visual speech cues in noisy environments, reflecting impaired AV integration. Neurophysiological evidence further highlights impaired AV synchrony detection, including increased tolerance to asynchronous AV stimuli and the absence of McGurk responses by nine months. These behavioral deficits are underpinned by neural atypicality, evidenced by attenuated event-related potentials (ERPs), such as diminished N290 responses to dynamic faces. Such neural signatures predict later social and linguistic impairments. Additionally, sex differences reveal divergent compensatory strategies: female high-risk infants partially mitigate language delays by increasing mouth fixation, whereas male infants exhibit persistent deficits in social attention and AV integration. Collectively, these findings highlight AV integration anomalies as early transdiagnostic markers detectable months before overt behavioral ASD symptoms, such as language delays or social withdrawal, emerge.

Intervention strategies aligned with developmental stages have demonstrated efficacy. Early interventions (0-6 months) leverage biofeedback to reinforce eye contact and enrich IDS-driven multimodal input. Mid-phase interventions (6-12 months) employ virtual reality training to enhance visual reliance in challenging auditory environments, alongside wearable eye-trackers to align gaze with auditory labeling. After 12 months, interventions incorporate emotional prosody and facial expressions to support socio-linguistic fluency. Preliminary studies indicate that multisensory integration training significantly improves language outcomes in high-risk infants, surpassing attention-focused approaches.

Critical challenges remain, including clarifying how prosodic cues influence phoneme discrimination, understanding neural mechanisms underlying consonant learning, and translating AV biomarkers into practical clinical tools. Future research should combine naturalistic observation with advanced neuroimaging techniques to develop multimodal risk assessment systems. Addressing these gaps will facilitate early, personalized interventions, leveraging neuroplasticity to reduce developmental impairments.

Figures and Tables | References | Related Articles | Metrics

Factors and mechanisms underlying health-related motivated reasoning

LIU Xin, LYU Xiaokang

2025, 33 (10): 1805-1820. doi: 10.3724/SP.J.1042.2025.1805

Abstract ( 595 )

HTML ( 90 )

PDF (591KB) ( 2377 ) Peer Review Comments

Health-related motivated reasoning refers to a specific form of motivated reasoning within the health domain, wherein individuals’ cognitive processing is influenced by their health-related goals, desires, or emotions. In health contexts, individuals are inclined to accept, interpret, and recall information that aligns with their pre-existing health motivations, while disregarding or rejecting information that contradicts these motivations. This process leads to systematic biases in information selection, interpretation, and memory. Although previous research has highlighted the “double-edged sword” nature of health-related motivated reasoning, the literature lacks a comprehensive systematic review of this phenomenon. Moreover, discussions regarding its theoretical foundations, influencing factors, underlying mechanisms, and practical implications remain limited and fragmented.

Accurately categorizing health motivation is a critical prerequisite for elucidating the mechanisms underlying individual health information processing. Health motivation can be classified from multiple theoretical perspectives, including temporal orientation, individual psychological characteristics, reasoning goals, regulatory strategies, and the information life cycle. Early research often adopted a temporal orientation, distinguishing between “because” motives and “in order to” motives based on the timing of individual behaviors. Self-Determination Theory conceptualizes motivation along a continuum, differentiating intrinsic motivation, extrinsic motivation, and amotivation. Cognitive Dissonance Theory, from the perspective of reasoning goals, identifies accuracy motivation and directional motivation as two distinct types. Regulatory Orientation Theory offers an alternative framework, positing that motivation can be categorized according to self-regulatory strategies, specifically distinguishing between promotion-focused and prevention-focused motivation. More recent studies have further refined the classification of motivation in health information processing by considering the temporal dimension of the information life cycle. This approach segments the process into stages—generation, reception, and dissemination of information—and examines the predominant motivational drivers at each stage.

While categorization of motivation offers a foundational framework for understanding health-related motivated reasoning, the actual reasoning process is subject to the moderating effects of various internal and external factors. In this paper, the influences on health-related motivated reasoning are systematically classified into individual, informational, and cultural dimensions, each exerting distinct effects on the reasoning process. Specifically, individual factors pertain to the psychological and behavioral motivational sources of individuals, encompassing health beliefs, cognitive traits, and discrete emotions. Informational factors highlight the characteristics and presentation of information, with particular attention to conflicting information and information framing. Cultural factors, on the other hand, provide a broader contextual lens, involving social identity and cultural norms. Collectively, these dimensions interact to shape the manifestation and development of health-related motivated reasoning.

The mechanisms underlying motivated reasoning in health can be more comprehensively understood by integrating traditional cognitive bias perspectives with Bayesian modeling approaches. While early theories attribute motivated reasoning primarily to cognitive biases—where individuals preferentially seek information that confirms pre-existing beliefs and avoid counter-attitudinal evidence—this view is limited in its ability to capture the dynamic process of belief updating. Recent advances propose that health-related motivated reasoning can be conceptualized as a systematic deviation from normative Bayesian updating, characterized by a “triple deviation”: prior beliefs exert disproportionate influence, evidence is selectively processed, and posterior beliefs are directionally biased. This Bayesian framework not only elucidates the quantitative dynamics of belief adjustment under motivational influences but also provides a robust methodological foundation for analyzing the complexities of health-related motivated reasoning.

With the advancement of theoretical frameworks and the diversification of research topics, there remains a pressing need for deeper inquiry into health-related motivated reasoning. Future research should prioritize three key directions: (1) constructing interactive models to elucidate how individual traits, informational attributes, and cultural contexts jointly shape health-related motivated reasoning; (2) advancing the application of Bayesian modeling to clarify how various health motivations influence Bayesian reasoning processes and to identify the neural mechanisms underlying motivation-driven belief updating; and (3) investigating the impact of motivated reasoning on health intervention strategies, with the aim of developing targeted interventions tailored to distinct motivated profiles and addressing the complexities introduced by the era of networked intelligence.

Figures and Tables | References | Related Articles | Metrics

Prototypical scene: Critical theoretical nodes in psychobiographical therapeutic praxis

HE Chenglin, SHU Yueyu, ZHENG Jianhong, HUANG Zejiao, SONG Huan

2025, 33 (10): 1821-1836. doi: 10.3724/SP.J.1042.2025.1821

Abstract ( 577 )

HTML ( 58 )

PDF (698KB) ( 3395 ) Peer Review Comments

Psychobiography has traditionally centered on the analysis of extraordinary individuals, yet recent advancements have expanded its scope toward understanding the life trajectories of ordinary people through the practice of psychobiographical therapy (PBT). At the theoretical core of PBT lies Schultz's notion of the “prototypical scene,” identified as pivotal events and emotional conflicts within an individual's life story that explain their present psychological concerns and serve as critical points for therapeutic intervention. Although influential, Schultz's original conceptualization of the prototypical scene remains theoretically ambiguous regarding its precise definition, identification criteria, and fundamental characteristics. This ambiguity poses significant challenges for therapists attempting practical application, underscoring the need for clearer theoretical elaboration and methodical guidelines for identification and therapeutic intervention.

Addressing these gaps, this study presents a comprehensive theoretical refinement of the prototypical scene, highlighting its unique role in bridging psychobiography and therapeutic praxis. The proposed framework delineates a three-stage progression of the prototypical scene: generation, activation, and impact, each accompanied by distinct psychological processes and mechanisms. Central to this conceptual advancement is the explicit articulation of the prototypical scene's multidimensional characteristics, namely non-uniqueness, repetitiveness, symbolism, emotional cohesion, iterability, and constructibility.

The study introduces an innovative "onion model" to systematically extract prototypical scenes from life narratives. This model comprises five layers, progressing inwardly from current problem narratives (initial therapeutic concern) to comprehensive life stories, emotional intensity identification, recurring patterns analysis, and finally pinpointing the core prototypical scenes. Each layer strategically aligns with Schultz's established identification criteria—such as emotional intensity, interpenetration, developmental crisis, family conflicts, and thrownness—enabling therapists to efficiently discern and operationalize these core scenes within therapeutic contexts.

In differentiating prototypical scenes from related concepts such as Jungian archetypes and psychological complexes, the study provides a critical analysis. While Jungian archetypes are universally shared symbolic patterns emerging from the collective unconscious, prototypical scenes are highly individualized symbolic events deeply embedded in personal history and specific emotional contexts. Unlike archetypes, prototypical scenes are not abstract templates but rather concrete narrative moments encapsulating personal emotional conflicts and repetitive behavioral patterns. Additionally, prototypical scenes closely resemble Jung's concept of complexes, manifesting concrete scenes embodying unresolved emotional conflicts that profoundly influence one's psychological dynamics and behavioral outcomes.

The research further explores the dynamic and iterative nature of prototypical scenes, highlighting their role in structuring personal identity and psychological functioning over time. These scenes are not static but evolve continuously through repeated activations across similar emotional or environmental contexts. Emotional cohesion is emphasized as a critical dimension, underscoring how deeply embedded emotional experiences enhance the memory vividness and psychological resonance of these scenes, shaping personal behavior and identity.

Clinically, the process of identifying, deconstructing, and reconstructing prototypical scenes is outlined meticulously. Identification involves guiding clients through a structured exploration of their life narratives to uncover deeply embedded emotional events. The subsequent deconstruction phase incorporates storytelling, emotional catharsis, and interpretative reframing, enabling clients to externalize, re-experience, and critically examine the underlying emotional conflicts encapsulated within these scenes. Reconstruction strategies then facilitate a positive reframing of these narratives, focusing on meaning-making and identity integration, transforming previously detrimental scenes into sources of resilience and personal growth.

To demonstrate clinical applicability, illustrative examples are provided, highlighting how narrative reframing and emotional reinterpretation effectively transform prototypical scenes into constructive narratives. For instance, transforming a childhood abandonment scenario from a narrative of victimhood into a narrative of resilience highlights the transformative potential inherent in reconstructing prototypical scenes. Moreover, clinical vignettes exemplify how the process of reflection and reinterpretation leads to profound psychological and creative transformation.

The proposed theoretical framework and therapeutic methodologies position prototypical scenes at the forefront of contemporary psychotherapeutic innovation. This advancement aligns with the current psychotherapeutic emphasis on narrative identity, emotional processing, and symbolic interpretation. By refining the conceptual and practical dimensions of prototypical scenes, this research not only advances psychobiographical theory but also significantly enriches therapeutic praxis. Additionally, by systematically exploring how individuals construct and reconstruct personal meaning through prototypical scenes, this study provides valuable insights into promoting psychological resilience, mental health, and overall well-being.

In conclusion, the enhanced conceptual clarity and practical applicability of the prototypical scene framework mark a significant step forward in psychobiographical therapy. Future research should further investigate the therapeutic mechanisms involved, explore cross-cultural variations, and develop advanced therapeutic tools to improve clinical effectiveness and accessibility. Additionally, integrating technological innovations such as artificial intelligence could further refine the identification and intervention processes, potentially revolutionizing personalized psychotherapy through precise and efficient life-story analysis.

Figures and Tables | References | Related Articles | Metrics

Archive