Loading...
ISSN 0439-755X
CN 11-1911/B

Archive

    For Selected: Toggle Thumbnails
    Academic Papers of the 27 th Annual Meeting of the China Association for Science and Technology
    When AI “possesses” personality: Roles of good and evil personalities influence moral judgment in large language models
    JIAO Liying, LI Chang-Jin, CHEN Zhen, XU Hengbin, XU Yan
    2025, 57 (6):  929-946.  doi: 10.3724/SP.J.1041.2025.0929
    Abstract ( 186 )   HTML ( 38 )  
    PDF (836KB) ( 91 )  

    At the intersection of technology and morality, a critical question arises: Can large language models (LLMs) simulate good and evil personalities, and does this capacity influence their performance in moral judgment tasks? This study investigated the moral judgment characteristics of LLMs when simulating different good and evil personalities, as well as the similarities and differences between these patterns and those of humans. Across two studies, we analyzed moral judgment data generated by two LLMs—ERNIE 4.0 and GPT-4 (N = 4,832)—alongside responses from human participants (N = 370). The results revealed that: (1) LLMs are capable of successfully simulating varying levels of good and evil personalities; (2) the personality configuration significantly affects the moral judgments made by LLMs; and (3) a personality hierarchy emerges in the alignment between human and LLMs’ responses: good personality plays a more critical role than evil personality (inter-personality hierarchy), and within the good personality, conscientiousness and integrity dimension exerts the strongest influence (intra-personality hierarchy). This research constructed a theoretical model of good and evil personalities in LLMs under moral judgment tasks, contributing to a deeper understanding of how simulated personalities function in AI moral reasoning. The findings provided a theoretical foundation for promoting moral alignment in artificial intelligence systems.

    Figures and Tables | References | Related Articles | Metrics
    The linguistic strength and weakness of artificial intelligence: A comparison between Large Language Model (s) and real students in the Chinese context
    GAO Chenghai, DANG Baobao, WANG Bingjie, WU Michael Shengtao
    2025, 57 (6):  947-966.  doi: 10.3724/SP.J.1041.2025.0947
    Abstract ( 94 )  
    Previous research on generative artificial intelligence (AI) has been primarily conducted in the English context, but it remains unclear about linguistic strength and weakness of generative AI in the Chinese context. This study focuses on the accuracy and normativity, affectivity, and creativity of AI in generating language knowledge, and explores its cultural adaptability and ability to generate humanized and personalized content. Evaluating and analyzing these key indicators helps us gain a deeper understanding of the linguistic strengths and weaknesses of AI, as well as cultivating the unique advantages of humans in education.
    By combining quantitative and qualitative methods, we evaluated the differences in knowledge accuracy, normativity, affectivity, and creativity between large language models and real students. Specifically, using an explanatory sequential design in the mixed-methods framework, we first tested group differences in each indicator among GPT-4 and ERNIE-4 versus real students on knowledge accuracy, normativity, affectivity, and creativity to test the. Next, through content analyses, we explored the specific performance of large language models on each indicator and the mechanism of their linguistic strengths and weaknesses.
    Study 1 found that compared to real students, GPT-4 exhibited higher accuracy in modern text knowledge (especially conceptual knowledge), but lower accuracy in ancient poetry and language usage. The knowledge normativity of GPT-4 were comparable to those of real students, while its affectivity and creativity were lower than those of real students. Moreover, the highest individual scores of GPT-4 in normativity and emotionality were on comparable with the highest scores of real students. Study 2, based on ERNIE-4, confirmed the aforementioned results, and the accuracy in ancient poetry was still lower than that of real students. The results exhibited the advantages of artificial intelligence in the areas of modern knowledge and norms, its shortcomings in ancient poetry knowledge, and its potential in affective and creative expressions.
    Taken together, the current findings demonstrate the linguistic strength of generative AI in the knowledge accuracy of modern Chinese literary, and the weakness regarding ancient Chinese poetry and affective and creative writings, as well as generative AI’s potential in normative and affective expressions. This sheds light on the field of the cultural adaptability, affective and creative expressions of generative AI, and has valuable implications for the AI-assistant teaching practice in the Chinese context.
    Related Articles | Metrics
    Academic Papers of the 27th Annual Meeting of the China Association for Science and Technology
    Cognitive outsourcing based on generative artificial intelligence: An Analysis of interactive behavioral patterns and cognitive structural features
    WANG Fancong, TANG Xiaoyu, YU Shengquan
    2025, 57 (6):  967-986.  doi: 10.3724/SP.J.1041.2025.0967
    Abstract ( 249 )   HTML ( 12 )  
    PDF (3482KB) ( 82 )  

    Humans can enhance task efficiency and quality by delegating part of their cognitive tasks to generative artificial intelligence (AI), a process referred to as cognitive outsourcing. However, individuals’ effectiveness in using AI varies. To identify the key characteristics and inherent requirements of effective cognitive outsourcing, this study designed a cognitive outsourcing activity for graduate students. Participants wrote articles on open-ended topics with the assistance of a generative AI system and were divided into high-performance and low-performance groups based on their article scores. Differential analysis of knowledge pre-tests revealed that the high-performance group exhibited significantly higher prior domain knowledge compared to the low-performance group. Through lag sequential analysis and epistemic network analysis of interaction process data, differences in interactive behavioral patterns and cognitive structural features between the two groups were identified: participants in the high-performance group demonstrated more diversified behavioral transitions, forming a pattern characterized by “rapid and autonomous task comprehension and planning, efficient and precise human-computer interaction, selective information extraction and deep processing”; the cognitive structure of the high-performance group was balanced and comprehensive, primarily engaging with higher-level cognitive processing, while the low-performance group's cognitive structure was unbalanced and fragmented, primarily engaging with lower-level cognitive processing. In conclusion, effective cognitive outsourcing is a multifaceted process that necessitates active participation and profound cognitive processing. It demands proficient integration between internal cognitive frameworks and external technological tools.

    Figures and Tables | References | Related Articles | Metrics
    Academic Papers of the 27 th Annual Meeting of the China Association for Science and Technology
    Suicidal ideation data augmentation and recognition technology based on large language models
    ZHANG Yanbo, HUANG Feng, MO Liuling, LIU Xiaoqian, ZHU Tingshao
    2025, 57 (6):  987-1000.  doi: 10.3724/SP.J.1041.2025.0987
    Abstract ( 84 )   HTML ( 6 )  
    PDF (384KB) ( 27 )  

    Suicide has become a global public health challenge. Traditional methods for identifying suicidal ideation primarily rely on patients actively seeking help, while automated identification models based on text analysis are limited by the scarcity of annotated data. This study innovatively proposes a data augmentation method based on large language models (LLMs) to improve the accuracy of suicidal ideation text recognition. The research employs a two-stage design: Study 1 focuses on data augmentation, and Study 2 validates the enhancement effect. In Study 1, ChatGLM3-6B and Qwen-7B-Chat were selected as the underlying models, combining supervised learning strategies with zero-shot and few-shot learning methods to optimize training dataset quality. Through eight rigorous comparative experiments, the results show that the two self-developed models demonstrated excellent performance in data augmentation, with comprehensive scores of 0.90 and 0.92 for their processed datasets, significantly outperforming baseline models (p < 0.001). Study 2 further evaluated the impact of data augmentation on recognition model performance, showing that the enhanced models comprehensively outperformed the best baseline models in terms of recognition accuracy and true negative rate (p < 0.001). This study not only validates the effectiveness of LLM-based data augmentation methods in improving the performance of suicidal ideation recognition models but also opens new directions for artificial intelligence applications in the field of mental health. This approach has the potential to provide timely and effective early warning of suicide risk while protecting user privacy, offering important technical support and research ideas for suicide prevention work. Future research could focus on expanding data heterogeneity, optimizing prompt engineering design, and introducing human-computer interaction paradigms to further extend the application of this method in promoting clinical psychological diagnosis.

    Figures and Tables | References | Related Articles | Metrics
    Reports of Empirical Studies
    Object category differences regulate the sensory dominance of the response level in an audiovisual cross-modal conflict
    ZHOU Heng, WANG Ai-Jun, YUAN Xiang-Yong, JIANG Yi
    2025, 57 (6):  1001-1012.  doi: 10.3724/SP.J.1041.2025.1001
    Abstract ( 63 )   HTML ( 8 )  
    PDF (1311KB) ( 22 )  

    The sensory dominance refers to the phenomenon where the brain prioritizes the processing of information from a specific sensory modality when faced with multisensory modalities. The cognitive processing level hypothesis posits that the sensory dominance effect is determined by different levels of cognitive processing, with early perceptual processing favoring visual dominance and late response processing favoring auditory dominance. However, existing research has not focused on how the intermediate processing levels between early and late stages of cognitive processing influence the sensory dominance effect. This study manipulated object category differences at the intermediate processing level, employing a 2-1 mapping paradigm to examine how object category representation between early perceptual and late response levels affects cross-modal sensory dominance effects through three experiments. Experiment 1 found that object category differences can modulate the sensory dominance effect at the response level, with visual dominance when category differences are small and auditory dominance when they are large. Experiment 2 indicated that this effect is not related to different processing depths of visual stimuli, confirming that the effect is specific to the visual channel. Experiment 3, using transcranial direct current stimulation (tDCS) to inhibit the brain area responsible for category processing, the left anterior temporal lobe, found that the auditory dominance at the response level disappeared. The study demonstrates that the object category representation at the intermediate processing level within cognitive processing levels regulates the sensory dominance effect, thereby refining the cognitive processing level hypothesis's explanation of cross-modal sensory dominance effects.

    Figures and Tables | References | Related Articles | Metrics
    The multi-cue influence mechanism of spatial communication across different perspectives
    JIAN Jinhan, ZHANG Junheng, YAN Bihua, JI Ming
    2025, 57 (6):  1013-1040.  doi: 10.3724/SP.J.1041.2025.1013
    Abstract ( 45 )   HTML ( 6 )  
    PDF (332KB) ( 19 )  
    Figures and Tables | References | Related Articles | Metrics
    The processing bias of verbal emotional information in depression prone individuals
    LI Yutong, LI Xuan, YUE Zeming, LI Yahong, SUI Xue
    2025, 57 (6):  1041-1055.  doi: 10.3724/SP.J.1041.2025.1041
    Abstract ( 85 )   HTML ( 24 )  
    PDF (202KB) ( 42 )  
    Figures and Tables | References | Related Articles | Metrics
    The joint role of childhood emotional abuse and bullying victimization in the development of adolescent depressive symptoms: Sequential mediation or enhanced moderation?
    LI Xi, ZHANG Chang, YU Ruize, YIN Yijia, ZHOU Tong, LIU Wei, CHEN Ning
    2025, 57 (6):  1056-1069.  doi: 10.3724/SP.J.1041.2025.1056
    Abstract ( 74 )   HTML ( 13 )  
    PDF (638KB) ( 71 )  

    To examine the longitudinal joint role of two types of interpersonal risk factors (childhood emotional abuse and adolescent bullying victimization) in the development of adolescents' depressive symptoms, 521 middle school students were followed up for 2 years and 3 rounds. The results of the analysis based on the latent growth curve model and its variants showed that: (1) both childhood emotional abuse and adolescent bullying victimization significantly and positively predicted adolescents' depressive symptoms in terms of initial intercept and growth slope; (2) the initial intercept and growth slope of bullying victimization mediated the prediction of adolescents' depressive symptoms by childhood emotional abuse; and (3) with respect to the initial intercept, bullying victimization weakened the prediction of adolescents' depressive symptoms by childhood emotional abuse; and (4) in terms of initial intercept, bullying victimization weakened the prediction of adolescents' depressive symptoms. Childhood emotional abuse weakened the positive predictive effect of bullying victimization on depressive symptoms, but there was no significant moderating effect on the growth slope. These results suggest that childhood emotional abuse and adolescent bullying victimization not only independently predicted the development of depressive symptoms in adolescents, but also played a joint role, which was mainly manifested in a longitudinal sequential mediation model (rather than an enhanced moderation model). Based on this conclusion, this paper integrates the interpersonal risk model of depression with the cumulative risk model to form the cumulative interpersonal risk model, and identifies the longitudinal pattern of interpersonal risk factors across developmental stages and relational systems as sequential mediators of depression in adolescents.

    Figures and Tables | References | Related Articles | Metrics
    Is the Bystander Truly Objective? The Moderation of Third-Party Moral Judgment by Perspective Taking in Moral Scenarios
    SONG Ru, WU Jun, LIU Caixia, LIU Jie, CUI Fang
    2025, 57 (6):  1070-1082.  doi: 10.3724/SP.J.1041.2025.1070
    Abstract ( 86 )   HTML ( 7 )  
    PDF (745KB) ( 22 )  

    In moral scenarios, individuals often exhibit divergent interpretations and judgments of the same moral event due to varying prior experiences, making true “bystander objectivity” challenging. This study investigates how prior experiences influence perspective selection and subsequently moderate moral judgment and its neural underpinnings by activating different moral role perspectives (decision-maker vs. receiver) using Event-Related Potentials (ERPs). Results reveal that activating the receiver's perspective leads to stricter moral judgments, whereas activating the decision-maker's perspective results in more lenient judgments. Furthermore, the moderating effect of perspective on moral judgment weakens as the decision-maker's gains from immoral choices decrease. At the neural level, activating different moral role perspectives affects early processing and emotional arousal during moral judgment, manifesting as larger N1 and P2 components for the decision-maker perspective, and larger FRN components related to expectation violation for the receiver perspective. These findings indicate that prior moral experiences significantly shape an individual's moral judgment preferences as a bystander, primarily by modulating early processing of others' moral decisions.

    Figures and Tables | References | Related Articles | Metrics
    The double-edged sword effect of proactive helping behavior on coworker relationships
    CHEN Liangyong, CAO Zhonghuai, WAN Wenhua, ZHANG Weiting
    2025, 57 (6):  1083-1097.  doi: 10.3724/SP.J.1041.2025.1083
    Abstract ( 71 )   HTML ( 7 )  
    PDF (315KB) ( 28 )  
    Figures and Tables | References | Related Articles | Metrics
    Commentary
    The so-called influence relationship requires caution: Commentary on Wen et al. (2024)
    GE Xiaoyu
    2025, 57 (6):  1098-1107.  doi: 10.3724/SP.J.1041.2025.1098
    Abstract ( 99 )   HTML ( 13 )  
    PDF (545KB) ( 30 )  

    This is a commentary on a paper entitled “The influence relationship among variables and types of multiple influence factors working together” by Wen et al., published in Acta Psychologica Sinica in October 2024. They proposed a new concept called the “influence relationship.” This new concept is problematic. First, Wen et al. provided no definition for the “influence relationship,” which is unacceptable for a new-conception paper. Second, according to their proposed inference requirement, if researchers fail to disprove alternative explanations that threaten causal inferences, then they can use the term, “influence relationship,” when reporting their studies. However, this argument is a manifestation of the misunderstanding of inference requirements of causal relationships. Third, “influence” is a term that poses causal meanings according to Chinese and English dictionaries, previous academic articles, and empirical evidence. Thus, the suggestion by Wen et al. to describe a noncausal relationship using “influence” can result in an overstatement of research significance and misunderstanding among fellow academics and public readers. This scenario is contradictory to the increasing expectations of researchers of more rigorous scientific language. Fourth, Wen et al. were confused with goals and the realization of such goals. Failure to disprove alternative explanations is a compromise or a limitation in methods instead of a unique goal.

    Wen et al. stressed that a “directional correlation” lacked an appropriate name in academia. Therefore, they called it the “influence relationship.” This stance is seemingly an unfair description of the academic status quo because researchers typically adopt the word, “predict,” to describe a directional correlation. Based on previous articles, this commentary proposes another framework for the categorization of variable relationships. At the goal level, causal goals—in which researchers hypothesize a difference in Y if X is deliberately changed—can be distinguished from noncausal goals. Furthermore, noncausal goals can be classified as predictive goals (e.g., using texts to predict mental disorder risks or test scores to predict future performance) and purely correlational goals (e.g., a shopping basket analysis or a correlation analysis between a newly proposed personality construct and the Big Five). Neither is concerned with alterations to X. At the realization level, if a researcher opts for a causal goal but fails to provide sufficient evidence to support causal relationships, then they are expected to avoid causal language (e.g., “influence”) when reporting results and key conclusions. Alternatively, they can use terms such as “be associated with” and “predict” if appropriate.

    Moreover, this commentary provides authors and reviewers with several practical suggestions. (A) Clearly define research goals because the different criteria to evaluate causal, predictive, and purely correlational studies should be followed. (B) Enable researchers to discuss causal meanings conveyed by their results even if they fail to offer sufficient causal evidence when targeting causal goals. This statement does not mean an encouragement of overstatement; conversely, only if researchers clearly define their causal goals can they admit the extent to which they are realizing such goals. (C) Use noncausal language to report noncausal results frankly rather than using euphemisms as a strategy for impression management. (D) Avoid an all-or-none attitude toward causal evidence; instead, value every effort that helps disprove alternative explanations and provides more confidence in causal propositions. (E) Do not rely on a single study (even a randomized experiment) to provide conclusive answers to causal questions; instead, value the accumulation of evidence and triangulation.

    Figures and Tables | References | Related Articles | Metrics
    Distinguishing between causality, influence, correlation, and prediction*
    WEN Zhonglin, MA Peng, MENG Jin, WANG Yifan
    2025, 57 (6):  1108-1118.  doi: 10.3724/SP.J.1041.2025.1108
    Abstract ( 94 )   HTML ( 2 )  
    PDF (193KB) ( 21 )  

    In response to Wen et al.’s (2024b) proposal of the new term called influence relationship, Ge (2025) raised the following doubts: (1) the new term lacks a clear definition; (2) influence relationship could not be distinguished from causal relationship; (3) one should not create a new objective (or relationship) simply because causality is not achieved; (4) the so-called influence relationship should be termed prediction instead. This article attempts to resolve his doubts as follows: (1) influence relationship has been defined rigorously, employing the “genus and differentia” method; (2) a causal relationship is necessarily an influence relationship, but an influence relationship is not necessarily a causal relationship; (3) establishing an influence relationship can be a goal for non-experimental research, which is superior to merely establishing a correlational relationship; (4) prediction is an application of variable relationships but is not the relationship itself and is not equivalent to the influence relationship.

    Figures and Tables | References | Related Articles | Metrics