ISSN 0439-755X
CN 11-1911/B
主办:中国心理学会
   中国科学院心理研究所
出版:科学出版社

心理学报 ›› 1994, Vol. 26 ›› Issue (2): 147-152.

• • 上一篇    下一篇

汉字部件信息数据库的建立──部件和部件组合频率的统计分析

韩布新   

  1. 中国科学院心理研究所
  • 发布日期:1994-06-25 出版日期:1994-06-25
  • 通讯作者: 韩布新

DEVELOPMENT OF DATABASE OF CHINESE CONSTITUENTS INFORMATINON──STATISTICAL ANALYSIS OF THE FREQUENCY OF THE CONSTITUENTS AND THEIR COMBINATION

Han Buxin(Institute of Psychology, Chinese Academy of Sciences,Beijing,100012)   

  • Online:1994-06-25 Published:1994-06-25

摘要: 用FoxBASE语言统计了6763个基本汉字集合中的部件和部件组合的频度信息,建立了“部件数据库”和“部件组合数据库”。前者包含567个部件;后者包含汉字中实际存在的7583个两部件组合。统计结果表明部件和部件组合均呈偏态分布,绝大多数的频率很低。这两个数据库不仅可应用于研究汉字认知中整体和局部的关系、汉字的学习和记忆,而且也可供汉字学的定量研究、中文信息的计算机处理研究参考。

关键词: 部件, 部件组合, 组字次数, 频率, 数据库

Abstract: Frequency parameter of Chinese characterconstituent and their combinations,in GB2312-80 were computed using FoxBASE techenique. "The Database of character constituents" and The Database of Character "Constituent Combinations"were produced as the result. The former consisted of 576 character constituents.the later consisted of 7583 2 constituents combinations Everv constituent or combination had 2 attributions. one was the number of Chinese characer combining by character constituents or their combination, another was the frequency.The character constituents or their combination, another was the frequency,The character constituents and their combinations had similar pattern of uneven distribution,and most of them had low frequences. These 2 databases could be applied in the experimental research of Chinese cognition, learning and memory, it also could be used in the qualitative analvsis of Chinese character and computer processing ot Chinese information.

Key words: Chinese character constituent, combination of Chinese character constituents, number of Chinese characters combining by character constituents or their combinations, frequency, database