ISSN 0439-755X
CN 11-1911/B
主办:中国心理学会
   中国科学院心理研究所
出版:科学出版社

心理学报 ›› 1982, Vol. 14 ›› Issue (4): 3-8.

• •    下一篇

汉字结构的统计分析

彭瑞祥   

  1. 中国科学院心理研究所
  • 出版日期:1982-12-25 发布日期:1982-12-25
  • 通讯作者: 彭瑞祥

A PRELIMINARY REPORT ON STATISTICAL ANALYSIS OF THE STRUCTURE OF CHINESE CHARACTERS

Peng Rui-xiang Institute of Psychology, Academia Sinica   

  • Published:1982-12-25 Online:1982-12-25

摘要: 以小学语文课本的三千个印刷体汉字作统计材料,每个字划分为四个象限,把在左上角象限、右下角象限的笔划构成的形状分别统计,形状相同或类似的归为一类并称之为子模式。统计结果表明,左上角的子模式组字能力比右下角的强。但左上角子模式的形状较复杂,除构成子模式的部首的笔划外,总带有其它笔划。右下角子模式的形状较简单,除它本身的笔划外,无其它笔划。两者比较,在设计汉字多步自动识别的系统时,以右下角的子模式作初分类的依据,比较有利。

关键词: NULL

Abstract: The structure of 3000 printed Chinese characters which were selectedfrom the Chinese textbook for elementary school was analysed. Each characteris divided into four quadrants i. e. upper-left (UL), upper-right (UR), bot-tom-left (BL) and bottom-right (BR).When the shapes composed of the strokesin UL and BR were the same or similar respectively they were attributed toone group and were called subpatterns. The result showed that the subpatterns of BR were more simple than thatof UL and the number of junctions of the former was less than that of thelater. The author argued that the subpatterns of BR are used as group maskin the early stage of the processing of multi-stage matching the computationtime may be reduced.

Key words: NULL