ISSN 1671-3710
CN 11-4766/R
主办:中国科学院心理研究所
出版:科学出版社

• •    

一级与二级视觉视角采择的机制比较:理论争议、行为与神经科学的证据

王镓茵, 李晶   

  1. 南京师范大学心理学院, 210097
  • 收稿日期:2025-06-06 修回日期:2025-12-18 接受日期:2026-01-09
  • 基金资助:
    教育部人文社会科学研究一般项目(24YJA190007); 国家自然科学基金面上项目(4237010315)

Comparing the Mechanisms of Level-1 and Level-2 Visual Perspective Taking: Theoretical Controversies, Behavioral and Neuroscientific Evidence

Wang Jiayin, Li Jing   

  1. , 210097,
  • Received:2025-06-06 Revised:2025-12-18 Accepted:2026-01-09
  • Supported by:
    Humanity and Social Science Youth foundation of Ministry of Education of China(24YJA190007); National Natural Science Foundation of China(4237010315)

摘要: 视觉视角采择(VPT)分为一级与二级。现有理论对二者关系存在根本分歧:双系统理论主张两者内部机制独立,单系统理论则认为它们共享同一系统。综合两种理论视角,本文提出三阶段加工模型,该模型认为,一二级VPT均经历信息处理、视角模拟及信息整合与反应选择三个阶段。行为与神经证据表明,在三个阶段中一二级VPT的机制可能既有差异又有相似:在信息处理阶段,二者共享基础空间信息编码,但二级VPT表征更精细;在视角模拟阶段,一级VPT依赖快速的、非具身的视线追踪,而二级VPT则需具身的参照系变换的自我旋转;在信息整合与反应选择阶段,二者可能共享对他人意图的理解,但二级VPT需更强认知控制。本文构建的三阶段模型为理解一二级VPT提供了统一框架,未来研究应致力于开发分离各阶段的实验范式、利用高时间分辨率技术检验模型的时间进程,并深入探索二级VPT中具身机制的触发条件与跨模态整合。

关键词: 视觉视角采择 双系统理论 单系统理论 空间认知

Abstract: Visual perspective taking (VPT) is commonly divided into level-1 and level-2. Current theories fundamentally disagree on their relationship: the two-systems account posits distinct mechanisms, while the single-system account suggests shared processing. Integrating these perspectives, this article proposes a three-stage processing model, which posits that both level-1 and level-2 VPT involve three stages: information processing, perspective simulation, and information integration with response selection. Behavioral and neural evidence indicates that the mechanisms of level-1 and level-2 VPT exhibit both similarities and differences across these stages. During information processing, both share basic spatial information encoding, but level-2 VPT requires more fine-grained representations. In the perspective simulation stage, level-1 VPT relies on rapid, non-embodied gaze tracking, whereas level-2 VPT involves embodied self-rotation with reference frame transformation. During information integration and response selection, both may share an understanding of others’ intentions, though level-2 VPT demands stronger cognitive control. The proposed three-stage model offers a unified framework for understanding level-1 and level-2 VPT. Future research should focus on developing experimental paradigms to dissociate these stages, employing high-temporal-resolution techniques to examine the temporal dynamics of the model, and further investigating the triggering conditions of embodied mechanisms in level-2 VPT, as well as cross-modal integration.

Key words: visual perspective taking, two-systems account, single-system account, spatial cognition