ISSN 0439-755X
CN 11-1911/B
主办:中国心理学会
   中国科学院心理研究所
出版:科学出版社

心理学报 ›› 2023, Vol. 55 ›› Issue (12): 1903-1916.doi: 10.3724/SP.J.1041.2023.01903

• 研究报告 •    下一篇

笔画节点在手写体汉字识别中的作用

朱一鸣, 赵阳, 唐宁, 周吉帆(), 沈模卫()   

  1. 浙江大学心理与行为科学系, 杭州 310058
  • 收稿日期:2022-01-14 发布日期:2023-10-16 出版日期:2023-12-25
  • 通讯作者: 周吉帆, E-mail: jifanzhou@zju.edu.cn;沈模卫, E-mail: mwshen@zju.edu.cn
  • 基金资助:
    国家自然科学基金面上项目(32071044);国家自然科学基金面上项目(31871096);中央高校基本科研业务费专项资金资助(2021FZZX001-06)

The role of stroke nodes in the recognition of handwritten Chinese characters

ZHU Yiming, ZHAO Yang, TANG Ning, ZHOU Jifan(), SHEN Mowei()   

  1. Department of Psychology and Behavioural Sciences, Zhejiang University, Hangzhou 310058, China
  • Received:2022-01-14 Online:2023-10-16 Published:2023-12-25

摘要:

产生式理论认为, 视觉图形的识别是对其产生过程的逆推理。汉字是笔画按正字法规则交错连接构成的象形文字, 手写体汉字识别可以认为是对汉字产生过程的反向推理。基于典型的产生式模型——贝叶斯规划学习模型, 汉字的产生式识别过程从识别字的笔画开始, 先基于线段交点提取出节点, 再枚举能产生该节点的所有笔画组合方式, 从而获得汉字的产生方式。据此预测, 节点数量和节点复杂度是手写汉字识别过程的重要影响因素。本研究通过三个实验考察了节点在汉字识别中的作用。结果显示, 含有较多节点的汉字具有更好的识别绩效(节点数量效应), 掩盖由较多笔画构成的高复杂度节点会对汉字识别产生更大干扰(节点复杂度效应)。本研究增进了对汉字识别早期过程的认识, 为字形识别的产生式反向推理过程提供了证据。

关键词: 手写汉字识别, 节点, 笔画, 产生式模型

Abstract:

Generative theory holds that the recognition of visual graphics is the inverse reasoning of its generation process. Chinese characters are hieroglyphs formed by interlacing strokes according to orthographic rules. Chinese character recognition can be regarded as the reverse reasoning of the generation process of Chinese characters. Based on the typical generative model -- Bayesian program learning model, the recognition of Chinese characters starts from recognizing the strokes. Firstly, the nodes are extracted based on the intersection of lines, and then all the stroke combination modes that can generate the node are enumerated to obtain the generation mode of Chinese characters. According to the above prediction, the number of nodes and node complexity are important factors in the process of Chinese character recognition. This study investigated the role of nodes in Chinese character recognition through three experiments.

If the nodes provide guidance information for stroke segmentation, the more nodes, the better the performance of Chinese character recognition. In Experiment 1, we tested whether characters with more nodes have recognition advantages by adopting a 2×2 within-subjects design and using 76 single characters as the materials. Characters were chosen from two groups (high node-count and low node-count) of true characters, and two groups (high node-count and low node-count) of fake characters. The characters were briefly presented (10 ms, 20 ms, 30 ms, 40 ms, 50 ms, 60 ms) and appeared once at each presentation time. The presentation order of stimuli was completely random. Each participant completed a total of 456 trials. Twenty-six participants joined in the experiment. After observing each character, the participants reported whether it was a true character or a fake one. If high complex nodes in a larger stroke space provide more information, covering high complex nodes will cause greater interference to character recognition. In Experiment 2, we tested whether characters covered the high complex nodes are harder to recognize by adopting a 2×2×4 within-subjects design and using 160 compound characters covering a node as the materials. Characters were chosen from four groups (covering the first node with high or low complexity and the fifth node with high or low complexity) of true characters, and four groups of fake characters with the same conditions. The process was the same as that of Experiment 1, except the presentation time (60 ms, 70 ms, 80 ms, 90 ms). Twenty-nine participants joined in the experiment. Each participant completed 640 trials. Experiment 3 adopted a task similar to Experiment 2, and added two variables: component type and node generation method. The presentation time was 60 ms. Characters were chosen from eight groups of true characters, and eight groups of fake characters with the same conditions. Each stimulus is presented once. Twenty-six participants joined in the experiment. The accuracy and reaction time (RT) of true characters were analyzed in all experiments.

The results showed that the participants had a better recognition performance for the characters with more nodes (node number effect), and covering the high complex nodes significantly damaged their performance (node complexity effect). In Experiment 1, the accuracy of recognizing characters with more nodes was higher and the response time was lower. The repeated-measures ANOVA of accuracy and RT found that the main effect of the number of nodes was significant. The interaction between the number of nodes and presentation time was significant. When the stimulus presentation time were 40 ms and 50 ms, node number effect was more pronounced. In Experiment 2, the accuracy of recognizing characters covering the high complex nodes was lower. The repeated-measures ANOVA of accuracy found that the main effect of the complexity of nodes was significant. The interaction between node complexity and node order is significant. Node complexity effect was more pronounced when covering the fifth nodes. In Experiment 3, we also found that the main effect of the complexity of nodes was significant. The interaction between node complexity and node generation method is significant.

These findings support the nodes provide bottom-up stroke separation guidance information. Stroke separation was performed in parallel, for the more nodes a character has, the more information provided for the stroke segmentation, and therefore the character would be easier to recognize. And stroke separating began with the extraction and analysis of nodes, for the more complex nodes are, the greater impact on recognize. This study deepens the understanding of the early visual process of Chinese character recognition and supports Chinese character recognition is a generative reverse reasoning process, which could contribute to develop a complete cognitive model of Chinese character recognition.

Key words: handwritten Chinese character recognition, node, stroke, generative model

中图分类号: