ISSN 0439-755X
CN 11-1911/B
主办:中国心理学会
   中国科学院心理研究所
出版:科学出版社

心理学报 ›› 2025, Vol. 57 ›› Issue (10): 1832-1848.doi: 10.3724/SP.J.1041.2025.1832 cstr: 32110.14.2025.1832

• 研究报告 • 上一篇    下一篇

迫选测验中虚假作答行为建模及其在人格测评中的应用:基于RES理论框架

何翠婷1, 彭思韦2, 朱怡安1, 汪大勋1(), 蔡艳1(), 涂冬波1()   

  1. 1江西师范大学心理学院, 南昌 330022
    2浙江师范大学心理学院,金华 321004
  • 收稿日期:2024-05-30 发布日期:2025-08-15 出版日期:2025-10-25
  • 通讯作者: 汪大勋, wangda.xun@163.com;
    涂冬波, E-mail: tudongbo@aliyun.com;
    蔡艳, E-mail: cy1979123@aliyun.com
  • 作者简介:第一联系人:

    彭思韦和朱怡安为文章共同第一作者

  • 基金资助:
    国家自然科学基金(32160203);国家自然科学基金(62167004);国家自然科学基金(62467002);国家自然科学基金(32300942)

Faking modeling for forced choice measures in personality assessment based on RES theoretical framework

HE Cuiting1, PENG Siwei2, ZHU Yian1, WANG Daxun1(), CAI Yan1(), TU Dongbo1()   

  1. 1School of Psychology, Jiangxi Normal University, Nanchang 330022, China
    2School of Psychology, Zhejiang Normal University, Jinhua 330022, China
  • Received:2024-05-30 Online:2025-08-15 Published:2025-10-25

摘要:

与Likert自评量表相比, 虽然迫选测验因对项目进行社会称许性匹配而具一定的抗作假功效, 但大量研究表明项目的称许性会由于与不同的项目匹配成block发生改变, 并在不同的测评情境下也会发生改变, 因此迫选测验仍不可避免地存在虚假作答行为, 进而严重降低并危害测量结果的准确性与公平性。鉴于此, 本研究基于瑟斯顿IRT模型(TIRT)以及Böckenholt (2014)的RES作假理论模型, 针对迫选测验中虚假作答行为进行统计建模(简记为RES-TIRT), 以期解决上述问题。本文通过两项模拟研究探讨了新模型的性能并与传统的模型进行比较, 随后通过实证研究深入探讨了新模型在大五人格测评中的具体应用及其优势。模拟研究结果表明:(1)在不同模拟条件下RES-TIRT模型估计情况良好; (2)不论是项目参数还是被试参数, 新模型RES-TIRT的参数估计精度均明显优于传统的TIRT模型。实证研究将新模型应用于真实的大五人格测评, 通过对比分析诚实作答组和虚假作答组的结果, 结果表明:与传统的TIRT模型相比, 新模型RES-TIRT能有效地降低乃至消除虚假作答对测量结果的负面影响, 并进一步提升了迫选测验的抗作假功效, 有力地证明了RES-TIRT模型的优势及其应用前景。

关键词: 瑟斯顿IRT模型, 迫选测验, 虚假作答, 人格测评

Abstract:

Although forced-choice (FC) assessments with social desirability matching reduce faking compared to Likert scales, the desirability of items may shift when matched in blocks and vary across contexts. Consequently, faking remains a persistent issue in FC assessments, compromising measurement accuracy and fairness. To address this, we propose a statistical model for detecting and mitigating faking in FC assessments, integrating Böckenholt’s (2014) model of RES faking theory with the Thurstone Item Response Theory (TIRT) model (Brown & Maydeu-Olivares, 2011). Our approach aims to minimize the adverse effects of faking and enhance the robustness of FC measures.

Two simulation studies were conducted to evaluate the proposed RES-TIRT model. Simulation Study 1 examined model performance under varying conditions (sample size, FC scale format, item direction, trait correlation, and dimensionality). Results indicated optimal estimation accuracy when using 3-item blocks, 3 dimensions, a correlation of 0 between dimensions, and a mix of positive and negative item descriptions. Simulation Study 2 compared trait estimation accuracy between TIRT and RES-TIRT models under increasing faking prevalence. While the TIRT model performed better in faking-free conditions, its accuracy declined more sharply than the RES-TIRT model as faking increased—particularly for item parameters—demonstrating the RES-TIRT model’s superior resistance to faking.

An empirical analysis further validated the model’s applicability in real-world settings. Comparing honest responses with faked responses (simulating lawyer job applications), we found that applicants strategically inflated traits like openness, agreeableness, and extraversion to meet job requirements. The RES-TIRT model effectively detected these distortions, showing significant discrepancies in these dimensions compared to the TIRT model. Additionally, the RES-TIRT model effectively captured response distortion tendencies, as evidenced by significantly elevated latent trait values θjE under faking conditions compared to honest responses. This indicates that, the faking behavior of the applicants can be successfully captured by the RES-TIRT model. Moreover, the difficulty parameter βEim triggering fake answers can be observed to determine whether a FC block is prone to faking. These empirically derived parameters enable targeted refinements in FC measure development, allowing test constructors to strategically modify or eliminate items with low faking thresholds, thereby enhancing the scale's overall resistance to response biases.

In conclusion, both simulation and empirical studies have demonstrated that the RES-TIRT model is a viable alternative to the TIRT model. It can be employed to address the issue of faking in FC scales, particularly in high-stakes situations such as talent selection.

Key words: Thurstonian IRT model, forced-choice measures, faking responses, personality assessment

中图分类号: