ISSN 0439-755X
CN 11-1911/B
主办:中国心理学会
   中国科学院心理研究所
出版:科学出版社

心理学报 ›› 2026, Vol. 58 ›› Issue (3): 381-398.doi: 10.3724/SP.J.1041.2026.0381 cstr: 32110.14.2026.0381

• 第二十七届中国科协年会学术论文 •    下一篇

从显性威慑到隐性内化:AI监管和黑暗三联征人格对诚实行为的影响

王健树1, 姜啸威2, 陈亚楠1, 王明辉1, 杜峰3,4   

  1. 1河南大学心理学院, 开封 475004;
    2澳大利亚悉尼科技大学计算智能和脑机接口实验室, 悉尼 2007;
    3中国科学院心理研究所认知科学与心理健康国家重点实验室, 北京 100101;
    4中国科学院大学心理学系, 北京 100049
  • 收稿日期:2025-05-10 发布日期:2025-12-26 出版日期:2026-03-25
  • 通讯作者: 陈亚楠, E-mail: chenyn@henu.edu.cn; 王明辉, E-Mail: wmhwang@163.com
  • 作者简介:王健树和姜啸威同为第一作者。
  • 基金资助:
    教育部人文社会科学研究青年基金项目(22YJCZH021), 国家社科基金(24AGL037)资助

From overt deterrence to covert internalization: Moral effects of AI regulation and the moderating role of personality traits

WANG Jianshu1, JIANG Xiaowei2, CHEN Yanan1, WANG Minghui1, DU Feng3,4   

  1. 1School of Psychology, Henan University, Kaifeng 475004, China;
    2Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney 2007, Australia;
    3State Key Laboratory of Cognitive Science and Mental Health, Institute of Psychology, Chinese Academy of Sciences, Beijing 100101, China;
    4Department of Psychology, University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2025-05-10 Online:2025-12-26 Published:2026-03-25

摘要: 随着生成式人工智能逐步演变为具备自主影响力的社会代理, 其对人类道德决策的影响愈加显著。然而, 传统伦理监管模式基于理性人假设, 忽视了人格特质对道德选择的调节作用, 导致监管效果因人而异, 难以实现最优效率。为突破这一局限, 本研究聚焦于道德研究中的诚实维度, 构建了监管类型 × 人格特质交互模型, 系统探讨显性监管、隐性激励与道德反馈三类AI驱动的干预策略对自恋、马基雅维利主义和精神病态人格的差异化影响。结果显示:(1)显性监管有效提升诚实行为, 尤其对高马基雅维利主义与高精神病态个体具有显著约束力; (2)监管即使隐藏也能规范诚实行为, 其不确定性加速了决策反应; (3)隐性激励促进诚实行为并具内化效应, 但高马基雅维利主义者在奖励撤除后的诚实率显著降低, 揭示了该策略在此类人群中的脆弱性; (4)道德反馈对整体诚实率提升有限, 仅对高马基雅维利主义者具有一定影响, 而高自恋者表现出持续不诚实倾向。本研究创新性地揭示了人格特质(黑暗三联征)在AI伦理监管中的关键作用, 并为传统的理性人监管范式提供了一个基于人格差异化的视角, 为未来设计情境化与个性化AI伦理干预策略提供了重要的理论与实践依据。

关键词: AI监管, 诚实行为, 黑暗三联征, 人格特质, 情境化干预

Abstract: As generative artificial intelligence (GenAI) evolves into social agents with autonomous influence, its impact on human moral decision-making is becoming increasingly significant. Current regulatory models are often grounded in the "rational person hypothesis, " which assumes uniform responses to ethical constraints. This perspective, however, overlooks the profound moderating role of personality traits in moral choices, leading to divergent regulatory effects and a loss of efficiency. The Dark Triad of personality (narcissism, Machiavellianism, and psychopathy) is a robust predictor of moral deviation. To address this gap, we constructed a "Regulation Type × Personality Trait" interaction model. We hypothesized that the effectiveness of different AI-driven intervention strategies—namely explicit regulation, implicit incentives, and moral feedback—would be significantly moderated by individuals' Dark Triad traits when making decisions about honesty.
A series of experiments were conducted to test our hypotheses. The study utilized a modified coin-flip task where participants privately guessed and reported outcomes, a paradigm designed to create opportunities for dishonest behavior for personal gain. Participants' honesty rates and reaction times were recorded as the primary dependent variables. Across the experiments, we manipulated the AI-driven intervention strategies. These strategies included: (1) explicit (visible) versus implicit (invisible) AI surveillance which involved potential penalties for dishonesty; (2) implicit monetary incentives which rewarded consistent honesty; and (3) moral feedback which provided textual messages in response to honest or dishonest reports. Prior to the behavioral tasks, participants' personality traits were measured using the validated Short Dark Triad (SD3) scale.
The results supported our hypotheses, demonstrating significant interactions between intervention types and personality traits. In Experiment 1, explicit AI surveillance significantly increased honest reporting (t(45) = 4.59, p < 0.001), particularly among individuals with high levels of Machiavellianism (t(25) = 4.60, p = 0.005) and psychopathy (t(28) = 4.44, p < 0.001). In Experiment 2, invisible AI surveillance also enhanced honesty but was less effective than visible AI surveillance, F(2, 90) = 18.10, p < 0.001. Notably, invisible surveillance resulted in the shortest reaction time (RT = 0.49), F(2, 90) = 34.10, p < 0.001. High Machiavellian participants displayed greater honesty under visible surveillance (OR = 0.70, p = 0.013) but were more dishonest without or under invisible AI surveillance. In Experiment 3, potential financial rewards increased reaction time (F(2, 118) = 58.59, p < 0.001), while high Machiavellian individuals showed reduced honesty during the internalization stage, t(57.98) = -2.04, p = 0.044. In Experiment 3a and 3b, financial incentives promoted honesty more effectively than moral messaging during the reward stage (t(120) = 3.07, p = 0.003) and maintained this effect into the internalization stage (t(120) = 2.06, p = 0.041), demonstrating the robustness of monetary influence. High Machiavellian participants sustained higher honesty levels in the internalization stage (OR = 1.96, p < 0.001). In contrast, narcissistic participants showed resistance to moral messaging, especially during the reward stage, t(49.95) = -2.55, p = 0.013.
This study was the first to systematically reveal the critical moderating role of the Dark Triad personality traits in AI ethical regulation. The findings challenge the traditional 'rational person' paradigm by empirically demonstrating the significant personality-based heterogeneity of regulatory effects. The core contribution of this research is the proposal of an innovative concept: 'personality-regulated regulation.' This framework provides a vital theoretical and practical foundation for designing future AI ethical intervention strategies that are contextualized and personalized. Such an approach allows for the optimization of regulatory resource allocation and enhances overall regulatory efficacy, moving beyond one-size-fits-all models.

Key words: AI regulation, honest behavior, Dark Triad, personality traits, contextualized intervention

中图分类号: