Download as pdf or txt
Download as pdf or txt
You are on page 1of 19

Behavior Research Methods, Instruments, & Computers

2000, 32 (3), 482-500

A Japanese logographic character


frequency list for cognitive science research
NOBUKO CHIKAMATSU
DePaul University, Chicago, Illinois

SHarCHI YOKOYAMA
National Language Research Institute ofJapan, Tokyo, Japan

HIRONARI NOZAKI
Aichi University ofEducation, Kariya, Japan

ERIC LONG
National Language Research Institute ofJapan, Tokyo, Japan
and
SACHIO FUKUDA
Yokohama National University, Yokohama, Japan

This paper describes a Japanese logographic character (kanji) frequency list, which is based on an
analysis of the largest recently available corpus of Japanese words and characters. This corpus com-
prised a full year of morning and evening editions of a major newspaper, containing more than 23 mil-
lion kanji characters and more than 4,000different kanji characters. This paper lists the 3,000most fre-
quent kanji characters, as well as an analysis of kanji usage and correlations between the present list
and previous Japanese frequency lists. The authors believe that the present list will help researchers
more accurately and efficiently control the selection of kanji characters in cognitive science research
and interpret related psycholinguistic data.

In many empirical psycholinguistic studies, word fre- In the past, compiling linguistic corpora was an ex-
quency is used as an independent variable to select ma- tremely labor-intensive process, plagued by reliability
terials having desired frequency characteristics or as a concerns caused by human error. However, as computer
control variable to match two or more sets of materials in technology continues to develop, researchers are obtain-
order to minimize performance differences attributable ing more reliable linguistic corpora and compiling word
to word frequency effects in word recognition, memory, or character frequency lists on the basis of these corpora
or retrieval performance. It is sometimes important in for linguistic or cognitive science research. For American
psycholinguistic research to focus on the frequency effects English, some widely used word frequency lists are the
oflinguistic units smaller than words, such as letter clus- Brown corpus (Kucera & Francis, 1967), the American
ters (e.g., bigrams or trigrams), syllabic-type units (sylla- Heritage Word Frequency Book (Carroll, Davies, & Rich-
ble vs. nonsyllable), morpheme units, as well as position man, 1971), and the Thorndike-Lorge count (Thorndike
(i.e., word-initial, word-middle, or word-final positions; & Lorge, 1944; see the summary in Solso, Juel, & Rubin,
Appleman & Mayzner, 1981; Grainger & Jacobs, 1993; 1982). Many of these corpora and lists are available in
Srinivas, Roediger, & Rajaram, 1992). For example, logo- computer database form and/or through the Internet. Con-
graphic character frequency is a crucial factor to consider sequently, researchers may use these corpora and lists to
in word experiments using languages with logographic control word frequency in empirical language research
scripts, such as Chinese, Japanese, or Korean, where each more easily, efficiently, and accurately than in the past.
logographic character may function as a word (Matsunaga, Corpora for non-English languages, including Japanese,
1996). In short, it is important to carefully control the fre- remain limited in number or are still under development
quency of printed characters and/or words when empiri- (Edwards, 1993; Leech & Fligelstone, 1992).
cal psycho linguistic studies are conducted.
Word and Character
Frequency Lists in Japanese
The authors thank the reviewers of the paper for their valuable sug- Over the last two decades, researchers in the area of
gestions. Correspondence concerning this article may be sent to
N. Chikamatsu, Department of Modern Languages, DePaul University,
experimental psychology, especially word recognition and
802 West Belden Avenue, Chicago, IL 60614 (e-mail: nchikama@ memory, have increasingly focused on the Japanese lan-
condor.depaul.edu). guage, owing to the uniqueness ofits writing system (Kess

Copyright 2000 Psychonomic Society, Inc. 482


JAPANESE CHARACTER FREQUENCY LIST 483

& Miyamoto, 1994; Paradis, Hagiwara, & Hildebrandt, of 83 hiragana and 86 katakana forms encoded in JIS and
1985; Yokoyama, 1997). In particular, kanji (characters EUe. Hiragana and katakana share the same syllabic-
in a logographic script that is one of three scripts used in sound representation and can be transcribed one by the
writing Japanese) has been widely used in experimental other (e.g., a syllable I sa I is transcribed as ~ in hiragana
materials in order to examine new aspects of cognitive and '!t in katakana). The third Japanese orthography,
processing, relative to alphabetic language systems, in kanji, is a logographic script adopted from the Chinese
word recognition and memory and hemispheric involve- language, in which each symbol represents meaning and
ment in the acquisition and usage oflanguage. However,
although many studies have been conducted that use
Japanese words, the development of Japanese word fre- * B*
part ofa word (e.g., in
*
functions as a morpheme. A single kanji character may
represent an independent word (e.g., I hon I, book) or
Inihon/,Japan). The mean-
quency lists or kanji character frequency lists has not kept ing of each constituent (i.e., a single character) in a kanji
up with the demand for such lists. As a result, for exam- word is sometimes less clear or transparent than that of an
ple, the kanji character or word frequency of selected independent word. Owing to the manner in which kanji
kanji items has often not been controlled or mentioned in characters were transferred from the Chinese to the Japa-
Japanese word recognition studies when frequency has not nese language over the centuries, a single kanji charac-
been used as a dependent variable (e.g., Eko & Nakamizo, ter may have obtained more than one reading and may
1989; Flores d'Arcais & Saito, 1993; Flores d'Arcais, be pronounced in several different ways. For instance,
Saito, Kawakami, & Masuda, 1994; Kikuchi, 1996; Mor- the character Wi, which means head, is read as Ito/,
ton, Sasanuma, Patterson, & Sakuma, 1992; Nagae, 1994; Idol, Izul, Iju/, /saki/, latama/, Ikashira/, Ikobe/,
Naito & Komatsu, 1989; Osaka, 1992; Sekiguchi & Abe, I kaburi I, and /tsumuri/ (Coulmas, 1989). Furthermore,
1992; Wang, 1988; Yokosawa & Shimomura, 1993). In a great number of homophones (i.e., kanji characters that
many other studies, the frequency of kanji characters or
words is controlled on the basis of(1) the researcher's sub-
jective, intuitive judgment (e.g., Flores d' Arcais, Saito,
share a common pronunciation but represent different
meanings) occur in Japanese kanji usage. For instance,
(tree), ~ (feeling), • (chance), 1tl(toglow), WI (period),
*
& Kawakami, 1995; Hatta, Koike, & Langman, 1994; and many others are all pronounced Ikil. Thus, in con-
Shimomura & Yokosawa, 1991), (2) the examinee'sjudg- trast to both hiragana and katakana, kanji characters do
ment, such as subjects' rating on selected items (e.g., Ya- not have a systematic sound representation or a one-to-
mada, Mitarai, & Yoshida, 1991), (3) the categorization one relationship between sound and symbol. The number
of kanji characters standardized by the Japanese Min- of kanji characters is quite large and practically uncount-
istry of Education (e.g., Kyoiku kanji or Gakushu kanji 1; able (i.e., kanji dictionaries may contain between 12,000
see, e.g., Hayashi, 1988; Hirose, 1992; Nakagawa, 1994; and 50,000 entries; Kindaichi, 1991; Morohashi, 1989).
Sakuma, Itoh, & Sasanuma, 1989), (4) lists compiled by Among hiragana, katakana, and kanji, usually only one
examiners themselves (e.g., Wydell, Butterworth, & Pat- is conventionally chosen and used to write a given Japa-
terson, 1995; Wydell, Patterson, & Humphreys, 1993), or nese word. Hiragana is used primarily for words that have
(5) the National Language Research Institute's (NLRI's) a grammatical function, such as particles or case-makers,
1962 or 1976 word/character frequency lists (Cabeza, and for some native Japanese content words. Katakana is
1995; Morikawa, 1985; Naito & Komatsu, 1988; Sasa- used for loan words (i.e., words mainly borrowed from
numa, Sakuma, & Kitano, 1992; Tsuzuki, 1993). western languages, such as English, French, and Por-
One of the main impediments to the development of tuguese). Kanji is used for content words, such as nouns,
Japanese word frequency lists is that the electronic rep- verbs, and adjectives.? Thus, a single Japanese sentence
resentation of Japanese characters is more complicated is usually written with all three scripts combined.
than that of alphabetical languages. At present, there are However, choice of script is not always consistent and
three primary electronic coding systems for Japanese may vary, depending on a writer's intention or a publish-
characters (i.e., kana and kanji): Japanese Industrial er's guidelines for style. For instance, the Japanese word
Standards (JIS), Shift-JIS (SJIS), and Extended Unix Code meaning egg, pronounced Itamago I, could be written as
(EUC). Generally, EUC is used in Unix workstations on 'k"iJ;.Z:: in hiragana in one context, but as j~ in kanji in
the Internet, whereas JIS is used for Japanese electronic another context. In their study ofthe subjectivefrequency
mail. However, SJIS has been adopted for use with per- ofscript type, Ukita, Sugishima, Minagawa, Inoue, and
sonal computers. Consequently, if different coding sys- Kashu (1996) studied 750 Japanese words that can be
tems are used across tasks or methods, one must transfer written in more than one Japanese script. The study was
one character code to another, using a converter such as conducted by asking 836 Japanese college students to
the network kanji code conversion filter (NKF). judge whether a given word (written in a given script) is
Another factor impeding the development of Japanese seen often, occasionally, or rarely. The results showed
frequency lists is the Japanese writing system itself, that more than half of the tested words were identified as
which comprises three types of orthographies-hira- words seen in more than one script. This inconsistency in
gana, katakana, and kanji. Hiragana and katakana are orthographic representation makes word counting in Japa-
syllabic scripts in which each symbol represents a sound nese extremely difficult and complicated. In addition,
unit (a syllable). These scripts each contain 46 basic forms, word segmentation in Japanese is more complex than that
with additional diacritic and historical forms giving a total in English, since word boundaries are not separated with
484 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

spaces in written texts. A single kanji character could be through the Internet. In consideration of the word seg-
a morpheme of a part of a word or a word by itselfand may mentation and other problems mentioned above, the au-
be pronounced differently, depending on the context. With- thors decided to start by developing frequency lists ofkanji
out clear word boundaries, compound words are easily characters before attempting kanji word frequency lists.
formed. Thus, complications in word counting and seg- Furthermore, the current usage of kanji in printed mass
mentation present nontrivial challenges for those compil- media is analyzed on the basis of a comparison with 1966
ing word frequency lists in Japanese. frequencies from the NLRI 1976 list.
Owing to these problems, few have attempted to make
word and kanji character frequency lists in Japanese. The Source of Data, Method of Analysis, and Results
NLRI in Japan published a word frequency list in 1962, The present corpus is available on a CD-ROM entitled
based on a corpus derived from 90 different journals and CD-HIASK'93 (Nichigai Associates & Asahi, 1994),
magazines with five different genres, all published in which contains the text of articles appearing in a major
1956. A total of 140 million tokens, consisting of 40,000 newspaper, covering 1 year of morning and evening edi-
different words (i.e., types), were analyzed in order to de- tions published in 1993. The data analysis was conducted
velop a frequency list of words possessing a frequency of on Unix workstations, with the program written by the
at least nine. In 1976, the NLRI also published a kanji authors in Perl and awk. First, headlines were excluded-
(character) frequency list based on a corpus published in from the approximately 110,000 articles> in the data. Sec-
1966 derived from three major newspapers, Asahi, Yomi- ond, all the printable characters were counted and ranked
uri, and Mainichi. This corpus provided a total of by frequency from highest to lowest in each category (e.g.,
991,375 kanji tokens and a frequency list of 3,213 dif- kanji, hiragana, or katakana). Frequency ratio (%) and cu-
ferent kanji characters. This was the first attempt to an- mulative frequency ratio (%) were also calculated.
alyze a Japanese corpus with computers, and the results The corpus provided a total of 56,563,595 printable
were used to standardize and regulate the use of kanji characters,v making it, at the time, the largest corpus
characters for mass media and education in Japan. For used for the compilation of Japanese word/character fre-
the past three decades, researchers have used these lists as quency lists. Ideally, in addition to newspapers, corpora
an informative resource for many language-related re- would be collected from other printed texts, such as mag-
search projects. In 1997, the NLRI published its 1962 list azines and novels of various genres. However, most of
in floppy disk format (NLRI, 1997). these printed materials are not yet available in computer-
However, several problems are associated with the use readable form. As a result, it is not currently feasible to
of the NLRI lists for empirical studies. First, the lists are gather sufficient amounts of data from these materials.
based on dated media samples. Almost three or four Furthermore, copyright issues complicate the process of
decades have passed since the corpora of the 1962 and copying and studying printed mass media in Japan and
1976 lists were collected. Consequently, the reliability of may pose an obstacle to making lists based on such ma-
these lists is open to question, since the use of words or terials freely available to the public.
kanji in mass media and education may have changed. The 56,563,595 characters in the 1993 collection ofthe
Second, the 1976 list does not identify low-frequency Asahi newspaper form the basis of the present character
kanji. The 1962 word list does not contain words possess- frequency list and break down into the general categories
ing a frequency less than nine, and the 1976 kanji list does shown in Table I. A total of23,408,236 kanji tokens were
not provide characters possessing a frequency less than found, making up 4,476 different types. Kanji covered
nine. Low-frequency words or characters with the fre- 41% of all the printable characters in the newspapers.
quency of one are required for many empirical language The central result of this survey is a list ofthe kanji char-
studies. Third, the lists are not easily accessed, since acters found, sorted according to their frequencies. The
both were available only in hard copy form until recently. first 3,000 characters are listed in Appendix A with their
Although the NLRI 1962 list is now available in floppy ranking, raw frequency, frequency ratio (%), and cumu-
disk format' it is not as accessible, especially to re- lative frequency ratio (%). In addition, frequency lists of
searchers outside Japan, as other language lists that are hiragana and katakana characters are also included in Ap-
available over the Internet. It is crucial that these lists be pendices Band C,7 respectively, with the same headings.
accessible to researchers in computer database format to Cumulative percentage of kanji character use. The
help make the control and selection ofword and kanji fre- cumulative frequency ratio of kanji characters, ranked
quency simpler, more effective, and more accurate. from high to low frequency, is shown in Table 2. Although
In 1994, the situation for Japanese corpus linguistics it has been conventionally said among learners and
changed for the better, when CD-ROM databases ofnews- teachers of Japanese that knowledge of 2,000~3,000
paper articles became available at a relatively low cost. kanji characters is required for one to read Japanese news-
These computerized corpora of newspapers made it pos- papers, this conventional wisdom was not supported by
sible to develop an updated kanji frequency list accessible the present data. According to the results of the present
on computers and over the Internet. analysis, the top 500 most frequent kanji characters ac-
The purpose ofthe present project is to develop a new counted for approximately 80% oftotal kanj i use. Further-
kanji character frequency list to be made accessible more, the top 1,600 most frequent characters covered 99%
JAPANESE CHARACTER FREQUENCY LIST 485

Table 1 developed from a single newspaper. Although the news-


Printable Character Type Totals paper used in the present project is a major Japanese news-
Characters Types Tokens Frequency Ratio (%J paper that currently publishes 10 million copies every
Kanji 4,476 23,408,236 41.38 day for 120 million people in Japan (one paper is usually
Hiragana 83 20,711,361 36.62 read by more than one person), there may be some bias
Katakana 86 3,608,288 6.38
in terms of the selection of topics and style. Goto (1995)
Punctuation and symbols 99 7,406,035 13.09
Arabic numerals 10 1,169,902 2.07 warned ofthis problem for researchers using a corpus de-
Latin alphabet 52 259,773 0.46 rived from newspapers, arguing that newspapers usually
focus heavily on politics, economics, and foreign policy.
As a result, vocabulary related to those areas may appear
of total use and the rest-that is, the next 3,000 charac- in newspapers more often than in other types ofpublished
ters-made up only 1% of the total kanji use. materials. In addition, each newspaper has its own regu-
The cumulative frequency ratio obtained from the lations or standards for choosing words and expressions.
1966 corpus (i.e., the NLRI 1976 list) is also indicated in Furthermore, the number of newspaper writers, compared
Table 2. The results from the two corpora, which are with the number of readers, is limited, and consequently,
30 years apart, indicate almost identical ratios for each the effects of idiosyncratic writing styles and word selec-
frequency level. Furthermore, according to the current tion may be exaggerated, causing a bias in any analysis
frequency list, of the 500 most frequent characters ranked based exclusively on such data. Since the present char-
in 1966, 445 characters are also ranked among the 500 acter list was made from a corpus of newspapers, future
most frequent characters in the 1993 corpus, but all of research should be aimed at employing a variety of genres
those 55 characters lower than the first 500 fall within the and types of publications to develop linguistic corpora.
1,000 most frequent characters in the 1993 corpus. Thus, To examine this issue of the bias in the present news-
the high-frequency kanji characters that account for over paper corpus, a correlation analysis ofkanji frequency was
80% of the total kanji use have not changed much in the conducted among the present list developed with 1993
last 30 years. newspapers, the NLRI 1962 list with 1956 magazines,
Correlation coefficient of chronological character and the NLRI 1976 list with 1966 newspapers. The re-
use. To examine change in kanji usage over the past 30 sults indicated a significant and high correlation be-
years, a correlation analysis was conducted between the tween the 1956 magazine list and the 1966 newspaper
present 1993 list and the NLRI 1976 list based on the 1966 list (r = .92,p < .01), and between the 1956 magazine list
newspaper corpus. The raw frequencies of the 3,000 and the 1993 newspaperlist, (r= .89,p < .01; see Table 3).
highest frequency characters in the present list'' and the The higher correlation between the 1956 and 1966 lists
equivalent characters in the NLRI 1976 list were con- was probably due to the closer timing of data collection,
verted to their base-1 0 logarithmic equivalents? and sub- compared with that between the 1956 and 1993 lists.
mitted to a Pearson correlation coefficient analysis. The However, these high correlation results may indicate that
result indicates a high correlation (r = .95) and was sig- the data collected from a single newspaper in this present
nificant [F(l,3029) = 28,037.67,p < .01]. In Figure 1, analysis may be a reliable resource for predicting kanji
although low-frequency characters scatter more than high- character frequency in printed mass media in general.
frequency characters, the distribution is close to linear.t? To examine the reliability of the present list, an addi-
Thus, on the basis ofthe analysis, it appears that the over- tional correlation analysis was conducted with the kanji
all pattern of kanji usage has not significantly changed characters grouped according to frequency level. The
over the past 30 years. 3,000 ranked characters were divided into three groups:
(l) high-frequency characters ranked from 1 through
Discussion 1,000, (2) mid-frequency characters ranked from 1,000
In the present project, the authors introduced the first
Table 2
computer-based kanji character frequency lists derived Cumulative Frequency Ratio of Kanji Character Use
from the largest corpus of Japanese newspaper texts. It
Cumulative Frequency Ratio (%J
is important to control the frequency of kanji characters
Frequency Rank 1993 1966
in psycholinguistic studies, although researchers have
often neglected this variable, owing to the unavailability 10 10.00 10.61
50 27.41 27.64
of reliable frequency lists. Using the present list, the au- 100 40.71 40.15
thors have compared kanji usage observed in data from 200 57.02 56.05
1966 and 1993. Although changes in language usage are 500 80.68 79.42
often discussed in Japanese linguistic studies in terms of 1,000 94.56 93.88
1,500 98.63 98.40
syntax, phonology, lexicography, and pragmatics, the 2,000 99.72 99.63
usage of kanji characters did not show any significant 2,500 99.92 99.90
variation over the past 30 years. 3,000 99.97 99.98
However, there are several issues left to discuss in the Note-Frequency rank ~ size of the group of highest frequency char-
present study. First, the corpus ofthe present word list was acters.
486 CHlKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

s
.....,
,.-..
.--
to 4
cQ)
:::;
zr +
....
Q)
+ +
-.-
S 3 +
.+. .,.
++

* +!J *
...

+..
..J.
+

*.
....
(Q

'-"'
0
0'1
* dpl
+ + ++

0
Co
(Y)
2 ~ ::: ·
III ++:~:+ +
+ +

!'t>*1tt
(J)
(J) .,...
.-- .~ *i
~

~. ..: +++ ~+

O:}----+----+----+----+------I
o 2 3 4 s
1966 [log 10 (r aw frequency+ 1)]
Figure 1. Kanji usage correlation distributions of top 3,000 characters between 1966
and 1993.

though 2,000, and (3) low-frequency characters ranked The size of the 1993 kanji corpus was much greater than
from 2,001 through 3,000. Although the correlations vary, the other two, containing 23,408,236 tokens and 4,476
depending on the levels and lists, all were significant at types. The 1966 list was compiled on the basis of 99 1,375
the p < .01 level (see Table 4). kanji tokens and 3,213 types, and the 1956 list was based
The results indicate high correlations at the high- on 280,094 kanji tokens and 3,328 types. Since the pres-
frequency levels between the present list and the other ent correlation analysis was conducted with the 3,000
two lists. Notably, a high correlation of the 1993 news- highest ranked characters in the 1993 list, some charac-
paper list with the 1966 newspaper list at the high- ters did not even exits in the 1966 or 1956 list. For in-
frequency level (r = .92) indicates again that the present stance, among the 38 characters ranked at 2,996 in the
list can be a reliable resource, especially at the high- 1993 list, 17 characters were not included in the 1956
frequency levels, although the data were collected from a list, and 23 characters were not included in the 1966 list.
single newspaper. Thus, the size difference in the corpus seems to be the
In contrast, at the low-frequency levels, the correla- most probable reason for the low correlation between the
tions of the present list are low with the 1966 list (r = .43) lists. If that is the case, the largest corpus could be the
and with the 1956 list (r = .28). This may imply a drastic most reliable information for low-frequency characters-
change of kanji usage over the decades or some unrelia- that is, the present list can be plausibly used as the most
bility of the present list at the low-frequency level. The
first interpretation is easily dismissed, considering the Table 3
overall high correlations with other lists, as indicated in Correlation of Kanji Frequency Among 1993, 1966,
Table 3, and the similar low correlation between the 1966 and 1956 Lists With All 3,000 Ranked Characters
and 1956 lists at low-frequency level (r = .40), which 1993 N 1966 N 1956 M
were developed relatively close in time. Does this mean List r r r
that the present list is not reliable at the low-frequency 1993 N 1.00
list? That does not have to be the conclusion. The most 1966 N .95 1.00
likely interpretation is that the low correlations are due 1956M .89 .92 1.00
to the differences in size of the corpora of the three lists. Note-N, newspapers; M, magazines.
JAPANESE CHARACTER FREQUENCY LIST 487

Table 4 sources for language researchers. In 1. A. Edwards & M. D. Lampert


Correlation of Kanji Frequency Among 1993, (Eds.), Talking data: Transcription and coding in discourse research
1966, and 1956 Lists With High-, Mid-, (pp. 263-310). Hillsdale, NJ: Erlbaum.
and Low-Frequency Ranked Characters EKO,R., & NAKAMIZO, S. (1989). Coded representations of kanji, kana
1966N 1956M and figures. Japanese Journal ofPsychology, 60, 265-268.
FLORES D'ARCAIS, G. B., & SAITO, H. (1993). Lexical decomposition of
List Frequency r r complex kanji characters in Japanese readers. Psychological Re-
1993N high .92 .80 search, 55, 52-63.
mid .68 .55 FLORES D'ARCAIS, G. B., SAITO, H., & KAWAKAMI, M. (1995). Phono-
low .43 .28 logical and semantic activation in reading kanji characters. Journal
1966N high 1.00 .86 of Experimental Psychology: Learning, Memory, & Cognition, 21,
mid 1.00 .60 34-42 .
low 1.00 .40 FLORES D'ARCAIS, G. B., SAITO, H., KAWAKAMI, M., & MASUDA, H.
Note-N, newspapers; M, magazines. High = characters ranked from (1994). Figural and phonological effects in radical migration with
1st through 1,000th. Mid = characters ranked from 1,001st through kanji characters. Advances in the Study of Chinese Language Pro-
2,000th. Low = characters ranked from 2,001st through 3,000th. cessing, 1, 241-254.
GOTO, H. (1995). Gengo kenkyu no tame no data toshite no corpus no
gainen ni tsuite [Corpus for linguistic research]. Tohoku University
Linguistics Journal, 4, 71-87.
reliable resource. Thus, the correlation analysis among GRAINGER, J., & JACOBS, A. M. (1993). Masked partial-word priming in
visual word recognition: Effects of positional letter frequency. Jour-
the three lists can support the high reliability of the pres- nal ofExperimental Psychology: Human Perception & Performance,
ent list. 19,951-964.
Another issue relating to the present project is the re- HATTA, T, KOIKE, M., & LANGMAN, P. (1994). Laterality of mental im-
liability of a computerized Japanese corpus (i.e., CD- agery generation and operation: Testing with brain-damaged patients
and normal adults. Journal of Clinical & Experimental Neuropsy-
HIASK'93), transformed from the original hard copy
chology, 16,577-588.
form (i.e., the Asahi newspaper). There are few instances HAYASHI, R. (1988). The role of semantic attributes of the distractor
in which kanji characters appearing in the newspaper did word in the script type effect on Stroop color-word interference task.
not appear encoded as kanji in the current CD-ROM cor- Japanese Journal ofPsychology, 59, 1-8.
pus. The original locations of those characters in the HIROSE, H. (1992). An investigation of the recognition process for
jukugo by use of priming paradigms. Japanese Journal ofPsychol-
newspaper were marked by symbol characters, such as ogy, 63, 303-309.
the "geta mark" (=) in the electronic format (Yokoyama, KESS, J., & MIYAMOTO, T. (1994). Japanese psycholinguistics: A clas-
Sasahara, Nozaki, & Long, 1998). This is due to several sified and annotated research bibliography. Amsterdam: Benjamins.
revisions made in JIS, 11 which is an electronic character KIKUCHI, T (1996). Detection of kanji words in a rapid serial visual
presentation task. Journal ofExperimental Psychology: Human Per-
encoding standard and assigns a code to approximately ception & Performance, 22, 332-341.
6,300 kanji characters. Although only a few hundred of KINDAICHI, K. (1991). Shimeikai kokugojiten [New Japanese word dic-
these cases were observed out of23 million kanji tokens tionary]. Tokyo: Sanseido.
in the current CD-ROM corpus and these were among KUCERA, H., & FRANCIS, W. N. (1967). Computational analysis ofpresent-
the lowest frequent characters, this inconsistency between day American English. Providence RI: Brown University Press.
LEECH, G., & FLlGELSTONE, S. (1992). Computers and corpus analysis.
printed and electronic data formats should be kept in mind In C. S. Butler (Ed.), Computers and written texts (pp. 115-140). Ox-
when low-frequency characters are used or focused on. ford: Blackwell.
In the present project, a new kanji character frequency MATSUNAGA, S. (1996). The linguistic nature of kanji reexamined: Do
list was developed with the largest newspapers corpus in kanji represent only meaning? Journal ofthe Association of Teach-
ing ofJapanese, 30, 1-22.
Japanese corpus linguistic history. Although there are MORIKAWA, Y. (1985). Stroop phenomena in the Japanese language:
still some issues left, the present list provides useful in- II. Effects of character-usage frequency and number of strokes. In
formation on kanji usage in the current printed media. H. S. R. Kao & R. Hoosain (Eds.), Linguistics, psychology & Chinese
Furthermore, the present list has become available and language (pp. 73-80). Hong Kong: University of Hong Kong Centre
accessible in computer database over the Internet, 12 which of Asian Studies.
MOROHASHI, T (1989). Dai kanwajiten [Japanese kanji character dic-
helps researchers control and interpret logographic psy- tionary]. Tokyo: Taishuu-kan.
cholinguistic studies more accurately and efficiently. MORTON, J., SASANUMA, S., PATTERSON, K., & SAKUMA, N. (1992). The
organization of the lexicon in Japanese: Single and compound kanji.
REFERENCES British Journal ofPsychology, 83, 517-531.
NAGAE, S. (1994). Semantic processing functions of kanji and kana
ApPLEMAN, I. B., & MAYZNER, M. S. (1981). The letter-frequency ef- words in the right hemisphere. Japanese Journal of Psychology, 65,
fect and the generality offamiliarity effects on perception. Perception 144-149.
& Psychophysics, 30, 436-446. NAITO, M., & KOMATSU, S. (1988). Attributes of memory that mediate
CABEZA, R. (1995). Investigating the mixture and subdivision of per- priming effects in perceptual identification. Japanese Journal ofPsy-
ceptual and conceptual processing in Japanese memory tests. Mem- chology, 58, 352-358.
ory & Cognition, 23, 155-165. NAITO, M., & KOMATSU, S. (1989). Effects of conceptually driven pro-
CARROLL, J. B., DAVIES, P., & RICHMAN, B. (1971). The American Her- cessing on perceptual identification. Japanese Psychological Re-
itage wordfrequency book. New York: American Heritage. search, 31, 45-56.
COULMAS, F. (1989). The writing systems ofthe world. Oxford: Black- NAKAGAWA, A. (1994). Visual and semantic processing in reading kanji.
well. Journal of Experimental Psychology: Human Perception & Perfor-
EDWARDS, J. A. (1993). Survey of electronic corpora and related re- mance, 20, 864-875.
488 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

NATIONAL LANGUAGE RESEARCH INSTITUTE OFJAPAN (1962). Gendai YOKOSAWA, K., & SHlMOMURA, M. (1993). On the role of stimulus sim-
zasshi 90shi no yogo yoji [The total vocabulary and their written ilarity and segmentation in misprint detection. In D. Brogan, A. Gale,
forms in ninety magazines of today]. Tokyo: Shuuei-sha. & K. Carr (Eds.), Visual search 2 (pp. 371-378). London: Taylor &
NATIONAL LANGUAGE RESEARCH INSTITUTE OFJAPAN (1976). A study Francis.
ofthe use ofChinese characters in modern newspapers (The National YOKOYAMA, S. (1997). Hyoki to kioku [Orthography and free recall]
Language Research Institute Report 56). Tokyo: Shuuei Shuppan. (Japanese Psychological Monographs 26). Tokyo: Japanese Psycho-
NATIONAL LANGUAGE RESEARCH INSTITUTE OFJAPAN (1997). Gendai logical Association.
zasshi 90shi no yogo yoji: FD format [The total vocabulary and their YOKOYAMA, S., SASAHARA, H., NOZAKI, H., & LONG, E. (1998). Shin-
written forms in ninety magazines of today]. Tokyo: Sanseido. bun denshi media no kanji [A study of kanji in electronic newspaper
NICHIGAI ASSOCIATES, & AsAHI [NEWSPAPER] (1994). CD-HIASK'93. media]. Tokyo: Sanseido.
Tokyo: Kinokuniya.
NOZAKI, H., CHIKAMATSU, N., & YOKOYAMA, S. (1997). Compiling NOTES
katakanafrequency listsfrom Japanese newspaper corpus. Unpub-
lished manuscript. Aichi University of Education, Nagoya. I. The set of Kyoiku kanji contains 881 characters to be taught in el-
OSAKA, M. (1992). Effect of memory set-size upon event related po- ementary schools. These characters were selected by the Ministry of
tentials for concrete and abstract kanji stimuli. Perceptual & Motor Education in 1948. This set was revised, and 1,006 characters were se-
Skills, 75, 401-402. lected and defined as Gakushu kanji for the same purpose in 1989.
PARADIS, M., HAGIWARA, H., & HILDEBRANDT, N. (1985). Neurolinguis- 2. Verbs, adjectives, and adverbs are often written in a combination
tic aspects ofthe Japanese writing system. New York: Academic Press. of kanji and kana characters. The stem of those words, which does not
SAKUMA, N., ITOH, M., & SASANUMA, S. (1989). Recognition units of change in conjugation, is written in kanji, and the inflectional endings
kanji words: Priming effects on kanji recognition. Japanese Journal in hiragana. For instance, 1t""oQ Ita-berul(to eat), *~L' loo-kiil
ofPsychology, 60, 1-8. (big), and !!!< Ihaya-kul (early) are all written with the first character
SASANUMA, S., SAKUMA, N., & KITANO, K. (1992). Reading kanji with- in kanji and the rest in hiragana.
out semantics: Evidence from a longitudinal study of dementia. Cog- 3. Owing to Japanese copyright ownership issues, it may not be fea-
nitive Neuropsychology, 9, 465-486. sible to provide the 1976 NLRI kanji list in computer database form in
SEKIGUCHl, H., & ABE, I. (1992). Functional hemisphere differences in the near future.
recognition of words expressed in kanji. Japanese Journal ofEduca- 4. Headlines were excluded from the present analysis because incon-
tional Psychology, 40, 3 I 5-322. sistencies were observed in headlines in hard copy versions ofthe news-
SHIMOMURA, M., & YOKOSAWA, K. (1991). Processing of kanji and kana papers, as compared with headlines downloaded from the CD-ROM.
characters within Japanese words. Perception & Psychophysics, 50, 5. In addition to the articles downloaded from CD-ROM, 114 articles
19-27. were input manually into the database, since they were not included in
SOLSO, R. L., JUEL, C., & RUBIN, D. C. (1982). The frequency and ver- the CD-ROM.
satility of initial and terminal letters in English words. Journal of Ver- 6. These printable characters include kanji, hiragana, katakana, Latin
bal Learning & Verbal Behavior, 21, 220-235. alphabets, Arabic numerals, punctuation, and symbols (commas, peri-
SRINIVAS, K., ROEDIGER, H. L., III, & RAJARAM, S. (1992). The role of ods, quotation markers, etc.).
syllabic and orthographic properties of letter cues in solving word 7. Appendix C, katakana frequency list, is cited from Nozaki, Chika-
fragments. Memory & Cognition, 20, 219-230. matsu, and Yokoyama (1997).
THORNDIKE, E. L., & LORGE, I. (1944). The teacher 50 word book of 8. The first 3,000 ranks included 3,031 characters in the 1993 list.
30,000 words. New York: Columbia University, Teachers College 9. Logarithmic transformation was used because the data were posi-
Press. tively skewed-that is, there were more low-frequency characters and
TSUZUKl, T. (1993). Effects ofcontext-dependent and context-independent fewer high- frequency characters. A value of I was added before the log-
associative strength between prime and target words on the processing of arithmic transformation,-that is, log I0 (raw frequency+ 1) instead of
lexical ambiguity. Japanese Journal ofPsychology, 64, 191-198. 10gi0 (raw frequency). This is because many oflowest frequency char-
UKITA, J., SUGISHIMA, I., MINAGAWA, N., INOUE, M., & KASHU, K. acters in the 1993 list were not found at all in the 1966 list (i.e., zero oc-
(1996). Nihongo no hyoki keitai ni kansuru shinrigaku kenkyuu [Re- currence), which would make the logarithmic transformation impossi-
search on written forms of Japanese words] (Japanese Psychological ble.
Monographs 25). Tokyo: Japanese Psychological Association. 10. No value of logarithmic transformation was plotted between 0
WANG, J. (1988). Do phonological and semantic processing of kanji fin- and I in the y-axis because all top 3,000 ranked characters occurred
ish at the same time? Japanese Journal ofPsychology, 59, 252-255. more than 13 times in the 1993 list and the value oflog IO(raw frequency
WYDELL, T. N., BUTTERWORTH, B., & PATTERSON, K. (1995). The in- + I) was more than I.
consistency of consistency effects in reading: The case of Japanese II. lIS has been revised four times over the last two decades, in 1978,
kanji. Journal of Experimental Psychology: Learning, Memory, & 1983, 1990, and 1997. In each revision, a few new codes were added or
Cognition, 21,1155-1168. replaced with old codes for kanji characters. Consequently, although it
WYDELL, T. N., PATTERSON, K. E., & HUMPHREYS, G. W. (1993). is very rare, some characters may appear different, depending on the
Phonologically mediated access to meaning for kanji: Is a rows still version used for decoding on computers. For instance, four kanji char-
a rose in Japanese kanji? Journal of Experimental Psychology: acters that are part of the 1983 JIS standard were replaced by symbol
Learning, Memory, & Cognition, 19,491-514. characters on the CD-ROM. For further discussion, see Yokoyama et al.
YAMADA,J., MITARAI, Y., & YOSHIDA, T. (1991). Kanji words are easier (1998).
to identify than katakana words. Psychological Research, 53, 136- 12. The presently developed kanji character frequency list is available
141. at http://nozaki-Iab.ics.aichi-edu.ac.jp/on the World-Wide Web.
JAPANESE CHARACTER FREQUENCY LIST 489

APPENDIX A
Kanji Character Frequency List
R C r R C F R C F R C F R C F R C F
1 B 336465 51 1't 79543 101 1& 50477 151 M 36776 201 [Ii] 29488 251 W 25212
2 285089 52 13 79040 102 ffl 50447 152 3t 36750 202 *li 29372 252 m 24992
3 + 254534 53 iI1li 76965 103 N1 50357 153 f1:J 36728 203 Ili'iJ 29329 253 ~ 24944
4 223075 54 pq 76927 104 !JiI. 49844 154 1JQ 35822 204 ~ 29150 254 m 24852
5 A Ei
*
218967 55 76402 105 $ 49730 155 rfij 35672 205 29075 255 ~ 24761
6
* 218693 56 75885 106 ~ 49272 156 35583 206 28933 256
I=l
~ ~ uu ~ 24643
7 4p 216931 57 -t; 75735 107 *£ 49270 157 Jf 35472 207 ffl 28881 257 ~ 24601
8 ~ 214989 58 ;<E 72917 108 0 48710 158 '!X. 35185 208 ,Iiij 28645 258 11 24598
9
10
00
=
199502
173495
59
60
~
I't-J
72711
72485
109
IlO
~iS
:Iii
48152
47347
159
160
il!
"'"
'Ji}
34984
34845
209
210
~
is
28644
28343
259
260 *i1' 24517
24395
Jl :$: 170127 61 ~ 72438 1Il !I!f 47088 161 ~
~ 34842 211 ~ 28332 261 Itk 24379
/2 :& 141025 62 Cf 71638 112 )J[ 46819 162 71< 34761 212 '31 28252 262 1ilIi 24237
.....
13 r:j:t
J.+ Ii
138362
129146
63
64
si.
EEl
71475
70301
113
114 *
~
46750
46373
163
164
~
I
34483
34475
213
214
~
Ii.
28078
28047
263
264
1ft€
rr
24203
24143
15 l±l 128173 65 @.] 69078 115 lliiJ 46263 165 ili 34449 215 11:.
28035 265 at: 24010
16 :$ 117146 66 ~ 66881 Il6 ;;. 46208 166 3"1i 34355 216
'"
flJ 28026 266 :(£ 23961
17 t± 116636 67 4- 66659 Il7 IK 45537 167 'I: 34184 217 itt 27919 267 ~ 23799
18 rTi 116094 68 it 65066 Il8 11= 45442 168 J~\ 34036 218 Idf 27543 268 'It 23693

* it!:
114768 64462 119 45322 169 34030 219 ;X 27526 269 IJ
""00
19 69 ~ 23422
20 J3 114266 70 ft; 64343 120 Wl 45294 170 ~ 33863 220 # 27462 270 riO 23384
21 1m 113331 71 64144 121 ~ 44641 171 B 33828 221 *t 27355 271 m 23300
22 :fL 113226 72 f-J 62107 122 ~ 44302 172 ~t 33755 222 ~ 27353 272 ~ 23215
23 [jiJ 107910 73 tJ 61533 123 ~ 44160 173 j(; 33499 223 ;f)t 27286 273 ~ 23190
24
25
13
J&
107594
104317
74
75
~
~
60381
58480
124
125 *
~
43502
43500
174
175
X
if
33456
33206
224
225
~
jffi
27200
27115
274
275
~
iliili
23153
23025

-
26 ~ 103394 76 ajj 58291 126 1J 43371 176 ~a 33090 225 iIit 27115 276 {)): 22841
27 ~ 103093 77 UJ 58290 127 #!t 43047 177 1JJ 32808 227 1£ 27108 277 ~ 22768
28 71 99180 78 !Ilb 57575 /28 -g 42392 /78 32752 228 M 27081 278 19- 22672
29 1: 98454 79 li 57238 /29 i!£ 42157 /79 ~ 32680 229 ~ 26996 279 Et 22402
30 Jlij 96880 80 im 57128 130 ,eX; 41775 /80 Jf.f 32674 230 ~ 26698 280 '1: 22371
31 ~ 95740 81 § 56649 131 ~ 41498 18/ fl 32619 231 {JIIJ 26673 281 $ 22244
32 A
'-' 95483 82 T 56562 132 f4 41321 /82 § 32609 232 fA 26386 282 1/i 22213
33 fi 95351 83 ::£ 56089 133 #i 40315 183 IllT 32408 233 m 26328 283 :}H 22108
34 it~ 95054 84 Jj1: 55727 134 ~ 39517 184 t!!: 32407 234 ~' 26272 284 l£ 22071
35 it!!. 94297 85 ~ 55624 135 fI1i 39433 185 9:. 32208 235 ~ 25994 285 1m 22038
m
36
37
~
III
88040
87500
86
87
r,,~
i;k;
54953
54695
136
137
jfi
{*
39407
38994
186
187 *
~
31999
31607
236
237 im
25973
25858
286
287
~
~
21999
21864
38
39
}'t
/\
87133
86828
88
89

*=
54614
54562
138
139
~
1m
38775
38645
188
189
1f:
:tt
31539
31519
238
238
#
'go
25779
25779
288
289 *
W.
21843
21808
40 .IX 86051 90 ~ 54077 140 ~ 38639 190 IN. 31240 240 >1< 25754 290 ~ 21612
41
..i..
1\ 85900 9/ ~ 54048 /4/ DR 38404 191 En 31087 241 ¥ 25640 29/ ~ 21533
42 ~ 85767 92 ~ 54047 /42 ,~, 38272 192 ~ 30621 242 }JIJ 25521 292 11 21530
43 ro~ 84934 93 ~ 53796 /43 J;J. 37777 193 T 30617 243 ~ 25465 293 'ff.,. 21472
44 tIT 84073 94 l!!1 52743 {44 5i 37577 /93 ~ 30617 244 If: 25377 294 ~ 21456
1f. /45 3jZ 1*


45 ~ 83847 95 52129 37547 195 Ii!JI 30581 245 ~ 25330 295 21338
46 A 82282 96 {t 52076 146 t~ 37433 196 it 30563 246 25292 296 :m 21211
47 ~ 81652 97 ltIIJ 51942 147 m 37309 /97 !f!IJ 30441 247 @: 25281 297 ~ 20973
48 fI3 80909 98 jiJf 51314 148 8b 37052 198 'Ji:. 30439 248 ~ 25279 298 -g 20969
49 =
'T 80845 99 Ij\ 51107 149 tiS 36971 /99 jl§ 30405 249 *lE 25273 299 1ljl 20783
m:
50
CF * 79738
27.4104
---
100
CF
/F 50768
40.7119
150
CF
~ 36947
49.9366
200
CF
30219
57.0228
250
CF
II 25253
62.8109
300
CF
A 20690
67.7131
490 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIX A (Continued)
R C F R C F R C F R C F R C F R C F
301 L!: 20667 357 l& 17216 407 IT 14694 451 I: 12884 501 ~ 11241 551 1< 9865
302 Ii: 20579 352 & 17108 402 it 14668 452 :Wi 12865 502 fjj 11203 552 ff!!. 9828
m
303
304 '*
$
20557
20507
353
354 ,@
17106
17040
403
404
iiI
{t:
14659
14601
453
454
:Ibi.
~
12820
12773
503
504
~
~
11197
11174
553
554
i'I.tJ
ij(
9827
9805
305 jIfIJ 20281 355 bi!i 16982 405 1'j- 14566 455 #Ii 12765 505 ~ 11130 555 n 9796
306 ~ 20234 356 f;t 16925 406 om 14564 456 I1f 12713 506 ~ 11 I 15 556 :w: 9792
307 :P 20100 357 *It 16851 407 Ii~ 14471 457 ~ 12706 507 t~ 11066 557 llli..
~~ 9780
± W.

308 20027 358 iE9 16798 408 ~ 14383 458 ~ 12685 508 11047 558 ~2 9768

* *
309 iJii 19987 359 16746 409 14369 459 12631 509 W 11046 559 jj. 9762
310 1f 19917 360 k1< 16712 410 IlR 14335 460 ~ 12610 510 WI 11037 560 {~ 9722
311 ~ 19884 361 lm 16709 411 ~ 14264 461 ffej 12585 511 5!l! 11020 561 fpJ 9711
372
373
:0
~
19633
19561
362
363
~
~
16660
16657
412
413
~
:fit
14262
14202
462
463
$
{!!;
12552
12544
512
513 *
1IIl
11012
10955
562
563
t-t
.ijj.
9708
9699
314 ~ 19517 364 ~ 16610 414 ~ 14129 464 l!i 12482 514 ltt 10871 564 j! 9684
375 ~ 19363 365 ~ 16602 415 a 14120 465 '¥ 12461 515 tl 10822 565
**jl}J 9642
316 ~\ 19359 366 M 16563 416 11}] 14113 466 )il 12443 516 1$ 10796 566 9634
317 ilii 19320 367 W 16520 417 1m 14053 466 £ 12443 517 1i 10795 567 ~ 9624
378 lit 19254 368 ~j£ 16463 418 ~ 13949 468 1Ii! 12393 518 Vi- 10790 568 ~ 9595

* :k *-:
319 jiIff 19246 369 ~ 16324 419 13933 469 fj 12334 519 10752 569 9546
320 ~ 19104 370 ~ 16308 420 ~ 13911 470 ~ 12259 520 ~ 10695 570 ~~ 9532
321 :l* 18976 371 iR 16280 421 m 13818 471 Jil 12175 527 1i 10644 571 ~ 9444
322 ~ 18815 372 1N 16259 422 ~ 13817 472 $ 12148 522 S 10564 572 ~ 9436
323 m 18796 373 ~ 16201 423 JfIi 13803 473 ~ 12137 523 ~ 10563 573 !A 9377
324 t'l. 18746 374 .fF 16165 424 ~ 13788 474 § 12125 524 IDi 10557 574 ~ 9337
±:
325 iiJf 18714 375 1Jl, 16136 425 ~ 13711 475 Tr.
13 11987 525 ill 10535 575 <=t 9317
326 ~ 18702 376 ~ 16093 426 !i~ 13708 476 )J! 11983 525 im 10535 576 /.iii 9314
327 JfX 18605 377 ~ 16063 427 P 13646 477 'iT 11968 527 n>¥ 10451 577 ~ 9311
328 W 18596 378 ~ 16052 428 1:t 13624 478 it 11946 528 I4l 10447 578 :i1[ 9310
329 Wl 18557 379 :If 16023 429 1'iIJ 13525 479 ~ 11923 529 ~ 10345 579 ~ 9281
330 'fjl: 18521 380 ~ 16017 430 iJt 13490 480 ~ 11863 530 ftiji 10289 580 l3€ 9254
331 1l~ 18513 381 M 16005 431 tt 13480 487 ~ 11808 531 5E 10288 581 ~ 9160
332
333
334
It
iR
ffil
18385
18379
18302
382
383
384
*
1;
fftl
15978
15916
15729
432
433
434
~
J.f!
~
13479
13458
13379
482
483
483
::t:
X-
$
11804
11756
11756
532
533
534
!In:
~
U
10249
10228
10224
582
583
584 •
~

i#
9104
9101
9083
335 ~ 18194 385 {Iffl 15708 435 ~ 13312 485 ~ 11739 535 Wi 10220 585 ~ 9081
336 I/tf 18109 386 11]" 15703 436 t5t 13292 486 M 11721 536 ~ 10197 586 i'& 9009
337 ff
m
18075 387 Wf
.fJt
15625 437 :I: 13285 487
*m 11681 537 ~ 10107 587 00
JpJ:
9000


338 18056 388 15566 437 p~ 13285 488 11648 538 fJiX 10094 588 8998
W, fJ<

339 EB 17958 389 15514 439 13241 489 ~ 11624 539 {~ 10093 589 8989
340 Ilt 17877 390 5ll 15401 440 m 13219 490 It 11516 540 ft 10057 590 8975
Jll fti llJ
,.
44/ 10029 8951
".,.
341 ~ 17855 391 F 15245 ~ 13194 491 11465 541 591
342 U; 17850 392 ~ 15241 442 itt 13150 492 ~ 11440 542 ~ 10023 592 )6J 8940
343 'Ff 17845 393 ~ 15190 443 13125 493 ~ 11433 543 !it 9992 593 ~ 8902

•m
344 Ji 17785 394 1'8. 15101 444 ~ 13069 494 W 11398 544 [lW 9991 594 ti 8892
345 *- 17655 395 9'J 15078 445 ~ 13056 495 :ffi 11388 545 m 9986 595 8867
±
346
346 ft
17596
17596
396
397
3i
~
14952
14941
446
447
~
1Jf
13043
13009
496
497
1l
J(f.
11306
11296
546
547 ...
~ 9967
9944
596
597
~ 8856
8847
348 !? 17527 398 ~ 14846 448 "f 13004 498 .it£: 11293 548 71f; 9908 598 M 8815
349 12 17505 399 IW 14835 449 ~ 13000 499 frJ 11289 549 :IX 9882 599 ~ 8737
350 %" 17362 400 ~ 14780 450 ~ 12958 500 £. 11273 550 ~ 9873 600 -1m 8733
CF 71.7398 CF 75.1727 CF 78.1088 CF 80.6841 CF 82.9314 CF 84.9251
JAPANESE CHARACTER FREQUENCY LIST 491

APPENDIX A (Continued)
R C F R C F R C F R C F R C F R C F
601 t: 8727 65/ ffl 7701 701 ~ 6760 751 ~ 6064 801 f.&, 5313 85/ ii 4810
602 ~ 8705 652 fj 7693 702 fj 6748 752 ~¥ 6063 801 ~ 5313 852 t: 4798
603 7E 8703 653 ~ 7692 703 W 6740 753 W 6062 803 ~ 5300 853 it 4796
604 rI 8700 654 J!! 7691 704 *& 6736 754 ~ 6021 803 ~ 5300 854 m 4787
605 ~ 8656 655 i1t 7654 705 it 6735 755 ~ 6009 805 :l1Ji 5293 855 j'jIJ 4780
605 $) 8656 656 -=-
;n:; 7624 706 m 6728 756 II 6003 806 ~ 5292 856 @; 4760
ri!i

607 8652 657 ~ 7616 707 ~ 6725 757 tb 5980 807 i1IU 5280 857 ~! 4759
608 Jffi 8614 658 ~ 7594 708 ~ 6689 758 m 5949 808 5273 858 llt 4754
609 tw 8547 659 t.:. 7545 709 :f.t 6683 759 f'€f 5939 809 fjJ 5270 859 ~ 4747
610 14k 8543 660 ~ 7507 710 ~ 6664 760 'W 5920 809 ilJ 5270 860 1# 4724
s: $ c±r


611 8533 661 7503 711 :0: 6632 761 IlB 5845 811 ~
A 5254 861 }f 4720
612 tv 8517 662 ~ 7492 712 6627 762 ~ 5831 812 !lEt 5251 862 ~ 4716
6/3 ~ 8505 663 ~ 7454 713 ~ 6625 763 J.'R 5817 813 f'? 5231 863 ~ 4715
6/4 f@ 8477 664 gt 7440 714 ~ 6623 764 i!t 5811 814 ~ 5217 864 m 4710
615 ft\ 8473 665 ][ 7423 715 T 6590 765 1ft 5768 815 ij#' 5208 865 ~ 4701
616 ~ 8472 666 :j:f 7422 716 tit 6556 766 lilri 5766 816 $A 5201 866 i:1iil.
,..,
4675
617 7$ 8468 667 ~ 7402 717 ;pJ 6552 767 :jffI 5747 817 $! 5196 867 fl. 4674
618 ~ 8448 668 ~ 7382 718 L: 6545 768 lfu 5742 8/8 ~ 5172 868 *:g 4672
619 ~ 8444 669 ~ 7376 719 J!g 6536 769 J$ 5738 8/9 1m 5169 869 tlJ 4663
619 J6J 8444 670 Hi: 7357 720 Wt 6529 770 ~ 5727 820 m 5163 870 *f1l 4637
621 if 8443 671 ~ 7351 721 1* 6527 771 JIll 5724 82/ rjfj 5158 87/ jp 4627
W Wi! M W 4623

622 {df1 8430 672 ~ 7339 722 6524 772 5703 822 5143 872
623 1l' 8422 673 ~ 7285 723 M 6507 773 1lfl 5686 823 5141 873 % 4622
624 Mr 8327 674 *C 7224 723 # 6507 774 }tIJ 5650 824 ~U 5120 874 ~ 4612
625 il 8320 675 JriS 7201 725 jff1 6505 775 ~ 5647 825 m 5118 875 ??r 4595
626 m 8298 676 f~ 7188 726 ~ 6501 776 ~ 5635 825 ,!t 5118 876
*~ 4585 4591
627 ~ 8248 677 !i& 7176 727 ~ 6488 777 ~ 5618 827 ~ 5109 877 sfj
628 ~ 8194 678 ~ 7175 728 tL 6480 778 1m 5601 828 jJ 5106 878 JIll 4553
629 @: 8165 679 if 7168 729 th 6472 779 ~ 5590 829 m 5094 879 Ij\ 4532
630 tit 8156 680 llt 7135 730 ;6;.
m 6447 780 ~ 5587 830 ~ 5082 880 1m 4521

* £
631 8129 68/ tli 7106 731 6380 780 fflJ 5587 831 [{ili 5080 881 fll; 4513
632 ~ 8125 682 fB 7083 732 rF 6372 782 i1J! 5584 832 f~ 5064 882 m 4512
633 ~ 8106 683 j)!; 7069 733 E§ 6355 783 ~ 5572 833 fl 5056 883 ~ 4499
634
.",.
,." 8102 684 ;'fi 7061 734 f$ 6332 784 m~ 5568 834 lilt 5037 884 ,~ 4492
635 ~j;\ 8101 685 ~ 7036 735 ~ 6327 785 iM 5547 835 JSL 5024 885 ~ 4466
636 Jl 8098 685 ~ 7036 736 ~ 6324 786 ill 5499 836 ~ 5021 886 *PJ 4452

* iE!
637 18 8090 687 ,~ 7023 737 6310 787 ji! 5496 837 5018 887 ~ 4449
638 Me 8069 688 J! 6959 738 m 6276 788 ~ 5477 838 ~ 4984 888 ~ 4446
639 it 8047 689 x: 6921 739 :E 6251 789 tr 5458 839 it 4981 889 )l 4439
640
64/
*L
itt
8007
8002
690
690
'J
~
6910
6910
740
74/
It
::K
6237
6233
790
79/
~
JlJ
5439
5433
840
84/
~~
~
4967
4961
890
891
Jill
~
4433
4432
642
643
M
~
7982
7968
692
693 *fi 6902
6891
742
743
rtlI
~
6225
6224
792
793
~
IIIIi
5427
5388
842
843
~
Ilt1
4930
4918
892
893
~
!l
4431
4407
644 ,~ 7943 694 &. 6874 744 !iii 6217 794 X 5346 844 ~ 4904 894 1: 4404
645 'If. 7915 695 ~ 6836 745 ~ 6210 795 W 5344 845 it 4887 895 ~ 4400
646 § 7908 695 1t 6836 746 is 6186 796 m 5333 846 ,~ 4874 896 ~ 4397
647 ~ 7901 697 JliJj 6796 747 m. 6159 797 ~ 5326 847 ~ 4862 897 ili1i 4387
648 m 7881 698 ~ 6791 748 ~p 6099 797 fit 5326 848 Z 4848 898 f'rl 4379
649 =Itf 7851 699 Wi 6780 749 ~ 6094 799 ~ 5324 849 tl 4837 899 ~ 4378
649 #t 7851 700 ~ 6779 750 '$ 6067 800 m 5314 850 Jf!. 4823 900 ~~ 4370
CF 86.6954 CF 88.2397 CF 89.6189 CF 90.8281 CF 91.9183 CF 92.8971
492 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIX A (Continued)
R C F R C I' R C F R C F R C F R C F
90/
902 ttJ
m 4368 951 :l't
1l
3878 1000
*
#.<
3387 1051 ~
ID
2893 1/01 ~
f!
2556 1/51 ~ 2262


4362 952 3875 1002 3376 1052 2886 1/02 2554 1/52 ~ 2260
903 ~ 4346 953 3866 1003 i!I! 3372 1053 JlI 2884 1/03 ~ 2552 1/53 4>
'" 2259
904 e: 4345 954 ~ 3858 /004 m: 3362 1054 1Ji 2883 1/04 m 2548 1/54 tx 2244
905 i2't aD 3854 m 3358 IJ& 2537 1/55 Ml
~
4304 955 ~±
1n05 1055 :j!f 2882 1/05 2240
.-
906 ~ 4298 956 i1i 3853 1006 li'P 3355 1056 ~ 2876 //06 ~ 2528 1/55 "l! 2240
907 it 4293 957 IlJ 3840 1007 1$ 3344 1057 :J$ 2874 //06 m 2528 1/55 11( 2240
908 1'11 4284 958 7 3835 1007 Ii 3344 1058 ~ 2857 //08 r.t 2519 1/58 w!l 2239
909 ~ 4278 958 ~ 3835 1009 ~ 3329 1059 q: 2849 //09 C. 2515 1/59 )If 2237
909 ~ 4278 960 M; 3831 1010 ~ 3308 1060 ~ 2848 1110 1>] 2513 1/60 =§ 2236
9// i'l] 4271 961 iliIil 3827 1011 trf 3290 1061 ~ 2846 1111 ~ 2509 1/61 *L 2235
9/2 frj1 4262 962 r;h 3820 1012 Ilfi1 3287 1062 nl'l 2814 1/12 ~ 2501 1162 j;j,t 2232
913 fro 4261 963 ~ 3818 1013 m 3260 1063 ~ 2810 1113 ~ 2496 1163 ~ 2230
9/4 11- 4245 964 ~ 3817 1014 ~ 3254 1064 M 2807 11/4 1m 2495 1164 ~ 2229
915 tJj 4239 965 ~ 3802 1015 ~ 3253 1065 ml 2806 11/5 ~ 2480 1165 ~ 2201
916 Wi 4230 966 !jJ 3793 1016 m 3245 /066 ~ 2796 1/15 $ 2480 1/66 ~ 2189
917 :;; 4198 967 ~ 3773 1017 ~ 3238 1067 ~'J 2790 1/15 'flU 2480 1/67 1:!1 2183
918 fir 4190 968 ~ 3749 1018 lJf 3220 /068 ~ 2789 1/18 #f 2472 1168 m 2179
9/9 *,J 4188 969 ;(p 3731 10/9 ~ 3217 1069 lli 2788 11/9 ~ 2468 1/69 J£ 2162
920 #f 4174 970 ~ 3728 1020 AA 3194 1070 ~ 2787 1/20 ~ 2460 1/70 Jitif 2151
92/ ~ 4113 971 ji;j 3714 1021 fll; 3186 /071 ~ 2782 1/21 m 2457 1/7/ E2: 2149
92/ W. 4113 972 ~ 3707 1022 ~ 3185 1072 ~ 2770 1/22 ~ 2452 1/72 m 2148
92/
924
~
itt
4113
4109
973
974
!7
~
3704
3691
1023
1024
fa
{IE
3183
3149
1073
/074
IlX
jill
2767
2764
1/23
1/24 *
~
2448 1/73
2441 1/74 i** 2143
2141
925 W: 4095 975 fa; 3649 1025 J1 3131 1074 ,~ 2764 1125 {~ 2439 1175 {= 2139
926 ffl' 4077 976 M 3645 /026 ~ 3128 1074 !Iff 2764 1/26 z 2437 1/76 ~ 2135
927
928
'f
n
4069
4059
977
978
~
~
3637
3634
1027
1n28
i'i
16
3114
3113
1077
1078 *£ 2752
2751
1127
1/28
~
~
2428 1I77 ~...
2413 1178 ~
~c.. 2133
2129
929
930 ~
Jij 4053
4036
979
980
~
!X
3598
3591
/029
1030 *'WlJ 3106
3079
1079
1080
~
fr{
2744
2739
1/29 ~
1/30 tpp
2405 1179 ~
2397 1/80 ~
2117
2108
93/ t«I: 4034 98/ $: 3586 103/ 1ll: 3078 108/ ltf 2737 1/31 ft 2390 118/ :Wl 2089
932 11' 4033 982 i!t 3572 /032 ~ 3077 1082 tfi. 2712 1132 I/l} 2388 1182 1l. 2081
933
934
m.
fJT
4031
4024
983
984
~
5lIt
3564
3556
1033
1034
fiib
~
3066
3055
1083
/084
~
1ft
2687
2679
1133
//34 *
Il
2381 1183 fg;
2375 1184 7E
2079
2075


935 lltl 4019 985 ~ 3553 1035 iii 3042 1085 JFjq 2670 1135 ~ 2372 //85 flJ. 2062
936 Wi 4010 986 ~ 3548 1036 3033 1086 13: 2667 1136 ~ 2349 //86 ~ 2061
937 N.iIi 4009 987 t'2 3538 1037 {JiJ 3032 1087 !Ii 2659 1137 Ji[ 2341 1/87 ~ 2057
938 ~ 3995 988 1W 3531 1038 III 3015 1088 ~ 2655 1138 ME 2335 1188 l:f 2056
939 m 3969 989 tit 3515 1039 ~ 3006 1089 Jijj 2650 1I38 ~ 2335 1189 ~ 2052
940 fl1fl 3963 990 fl 3511 1040 fOC 3004 1090 iii 2645 1140 ~ 2334 1190 ~X; 2043


941 aR 3961 99/ ~(!j 3509 1041 ~ 2993 109/ ~ 2639 /14/ ~J 2325 1/9/ ~ 2029
942 n 3957 992 ~ 3496 1042 2962 1092 ~ 2632 1142 ilii< 2320 119/ ~ 2029
943 nt 3953 993 {tj 3467 1043 ~ 2959 1093 ~ 2631 1142 ~ 2320 1193 l PJ 2028
944 5J, 3923 994 m 3465 1044 ~ 2948 /094 4:i 2622 1144 ~ 2312 1194 ~ 2025
945 M 3919 995 1W 3452 /045 tilQ 2925 1095 5£ 2621 1145 l'i!i 2303 1194 Ml 2025
946 ~~ 3913 996 {l.. 3451 1046 ~ 2923 1096 ~ 2599 ll46 ~ 2294 1196 ~ 2022
947 tR 3900 997 JJ! 3430 1046 itt 2923 1097 7/lf 2595 ll47 m 2281 1197 ~ 2019
948 W 3893 998 ~ 3412 1048 f,Ij 2918 1097 ~ 2595 1148 ~ 2274 ll98 llli 2011
949 W 3887 999 ~ 3393 1049 .Ii 2912 1099 lIT 2580 1148 ~ 2274 ll99 :it 2008
950
CF
~ 3880 1000
93.7751 CF * 3387
94.5564
1050
CF
fI3 2902
95.2278
1100
CF * 2559
95.8138
1150
CF
:till 2272 /200 ~
96.3309 CF
2002
96.7869
JAPANESE CHARACTER FREQUENCY LIST 493

APPENDIX A (Continued)
R C F R C F R C F R C F R C F R C F
1201 liF 2000 1251 *- 1774 1301 m 1616 1351 ~ 1425 1401 M 1222 1451 ~ 1075
r¥ *,J

1202 if 1996 1252 ~ 1771 1301 ~ 1616 1352 1424 1402 ~ 1221 1452 1070
1203 l!'P 1991 1253 1767 1303 Mi 1615 1353 R1l. 1408 1403 Jli 1212 1453 ifIJ 1067
1204 ~ 1982 1254 fJt 1758 1304 iJil 1614 1354 ~ 1406 1404 #3 1210 1454 JJL 1063
1205 r:i 1959 1255 rAe 1756 1305 ~ 1612 1354 ~ 1406 1405 ~ 1204 1455 m 1060
1205 ~ 1959 1256 j:j[ 1754 1306 .Ii 1611 1356 ~ 1404 1405 :ill! 1204 1456 ~ 1056
1207 ~ 1957 1257 ~ 1748 1307 lit 1608 1357 it 1395 1407 tt 1201 1456 r\Jl 1056
1208 ".
m 1954 1258 ""1"1 1747 1308 ~ra 1607 1357 m 1395 1408 ~ 1200 1458 1J1! 1054
1208 ilP 1954 1259 ~ 1744 1309 :f.!l. 1601 1359 R 1391 1409 1* 1199 1458 Iii 1054
1210 iWl 1952 1260 }il 1743 1310 {~~ 1600 1360 ~ 1389 1410 {'R 1195 1460 ~ 1052
1211 ij! 1950 1261 ~ 1741 1310 tt 1600 1361 *'J 1387 1410 ~ 1195 1461 rM 1046
1212 ~ 1946 1262 ~ 1737 1312 tt 1596 1362 ~ 1380 1410 Ili1 1195 1462 ~ 1045
1213 11 1943 1263 §'\ 1736 1313 m 1584 1363 rll 1377 1413 !IEl 1189 1463 '/[ 1041
1214 ~ 1941 1264 ~ 1734 1314 )jJ 1578 1363 '>l'~ 1377 1413 ~ 1189 1463 u 1041
1215 ;fiil 1940 1264 ~ 1734 1315 f'J, 1577 1365 ~ 1373 1415 ~ 1187 1463 )(IJ 1041
.!} til iii 11

1216 ~ 1936 1266 ~ 1732 1316 1569 1366 1370 1416 1185 1466 1040
1217 if! 1933 1267 1731 1317 ~ 1568 1367 M 1360 1417 1.! 1183 1467 r~ 1035
1218 ~ 1926 1268 ~ 1730 1318 r~ 1566 1367 foil 1360 1418 11 1181 1468 jrli 1031


1219 1£ 1923 1269 ~ 1729 1319 ~ 1562 1369 '$ 1354 1419 JlI!i 1179 1469 ~ 1030
1220 ~ 1922 1270 1726 1320 ~ 1560 1369 ~ 1354 1420 iff 1174 1470 ~ 1028
1221 $ 1919 1271 /ffJJ 1706 1321 'Jt 1559 1371 ijj;JJ 1351 1421 #
j~'" 1172 1471 IE 1023
1221 fR 1919 1271 Jtill 1706 1322 ;tJI[ 1547 1372 rm- 1346 1422 rrr. 1166 1472 ~ 1019
1223 ~ 1915 1273 ~ 1704 1323 fj- 1541 1373 ~ 1345 1423 -5 1160 1473 fE 1017
1224 ~ 1905 1274 it 1688 1324 Y: 1538 1374 fA 1338 1424 JfJ 1159 1474 1't 1016
1225 nt 1904 /275 qJ)II; 1687 1325 1J~ 1531 1375 *5J 1332 1425 ~ 1151 1474 L![ 1016
it

1226 ~ 1897 1276 1682 1326 IE 1528 1376 ~ 1329 1425 ~J 1151 1476 ~ 1014
1227 M 1883 1277 tJ§ 1680 1327 1527 1377 ~ 1321 1427 Iii 1142 1476 Hi 1014
1228
1229
1230
M
~
m
1881
1880
1872
1277
1279
1280
*
.!ilIJ
~
1680
1677
1676
1328
1329
1330
~
9'.t
~
1525
1524
1521
1378
1379
1380
~
~
~
1320
1309
1306
1428
1429
1429
tt
IIJ{
1ft
1140
1139
1139
1476
1479
1480
1.iE
III
iW;
1014
1013
1008
Mt IlJj.


1231 1871 1281 ill 1675 1331 fII.! 1517 1381 '11ri 1300 1431 ~ 1136 1481 1000
1232 ~ 1870 1282 r~ 1668 1332 1516 1382 ~ 1298 1432 ~Jl 1129 1482 ~ 997
1233 ""
a 1867 1283 Mi 1661 1333 'fff 1511 1383 m 1297 1433 fllIf 1127 1483 ~ 995
1234 iB 1864 1284 ~ 1658 1334 1\ 1507 1384 ~ 1292 1434 tr ll20 1484 FJi 987
1235 fk 1860 1285 tfi 1656 1335 ~ 1490 1385 III 1289 1434 it 1120 1485 3t 978
1235 5t. 1860 1286 ;Wi 1654 1336 ~ 1489 1386 ;jfjJ 1286 1436 ~ 1119 1486 ~~ 973
1237 ~ 1859 1287
XL
"!!. 1653 1337 g 1487 1387 Il& 1285 1436 I'ifJ 1119 1486 )ffi 973
1238 5If 1857 /288 m 1649 1338 ~ 1484 /388 ~ 1284 1438 ~ 1111 1488 '~ 972
1239 ~ 1849 1289 jW 1641 1339 1:£ 1483 1389 mi 1267 1438 11 1111 1489 ~'.i 970
1240
1241
~
m;
1842
1841
1290
1290
*@ 1640
1640
1340
1341 '*
~
1482
1481
/390
139/
fJi
m
1264
1257
1440
1441
#
1Yj~
1110
1107
1490
1491
U~
3rji
968
964
/242 ~ 1829 1290 T 1640 1342 Jj~ 1478 /392 ~ 1256 /442 ~ 1102 1492 fJ& 963
1243 ~ 1825 1293 il 1639 /343 ~ 1476 1392 'tt 1256 1443 3ii 1094 1492 II!!l 963
1244 ltt 1819 /294 ~ 1637 1344 ~ 1475 /394 'Ilr 1255 1444 ifr 1086 1494 It! 959
1245 ~ 1804 1295 1'Jj] 1631 1345 ~ 1471 1395 ~ 1254 1445 .m 1085 1495 ~ 958
1246 It 1802 1296 ~ 1626 1346 )iJi 1456 1396 1* 1253 1446 11 1083 1496 ~ 957
1247 In[ 1799 1296 ~ 1626 1347 Wi 1449 1397 ~ 1246 1446 Ilfi!J 1083 /497 ;it 954
1248 1m 1788 1298 t# 1625 1348 ~ 1447 1398 ~ 1231 1448 MJj 1078 1498 ~ 953
1249 ~ 1785 1299 frl 1623 1349 if! 1437 1399 $X 1228 1449 1* 1077 1498 m 953
1250 M 1779 1300 kW. 1622 1350 im 1435 1400 1l 1223 1450 ~ 1076 1500 ~ 948
CF 97.1912 CF 97.5529 CF 97.8808 CF 98.1645 CF 98.4101 CF 98.6258
494 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIX A (Continued)
R C r R C F R C F R C F R C F R C F
1501 Ilt 943 1551 rfj 818 1601 H 729 1650 ~ 649 1701 ~ 559 1750 m 484
1502 ~ 942 1552 lli 815 1602 ;!I 728 1652 m 648 1702 m 558 1750 f\1t 484
1503 f~ 940 1553 :R 814 1603 11 727 1653 ~ 645 1703
'tM* 555 1753 n~ 483
1504
*
1505 ft*
939
937
1554
1555
~
~
813
811
1604 i1t
1605 ~
725 1654 i'.fl
722 1655 fJ<
639
636
1704
1704 'I:
554 1753 t~
554 1755 f(.,'J
483
480


1506 ~ 936 1556 !Iff 809 1605 1€1 722 1656 :$ 634 1706 I!/ll 553 1756 ~ 475
1507 tff 935 1556 ~ 809 1607 721 1656 w: 634 1707 *~ 552 1757 It 474
1508 ff 932 1558 ~ 807 1608 :$ 719 1656 lb 634 1708 ~ 545 1757 ~~ 474
1508 P~ 932 1559 *1l 796 1609 I~ 716 1659 Z 632 1709 ~t1i 541 1759 :£ 472
1510 llJii 929 1560 .& 793 1610 !9! 715 1659 fliIi 632 1710 $ 540 1760 ~ 469
1511 fib 919 1561 '& 792 1611 £iii 714 1659 lt=li 632 1711 ~ 539 1761 8 466

.
1512 ~ 917 1562 ~ 787 1612 ~ 709 1662 jjiiJ 628 1711 tJji 539 1762 ~ 463
1513 jig 915 1563 ~ 786 1613 ~ 708 1662 w.: 628 1711 $ 539 1762 l'k 463
1514 *4 913 1563 ~ 786 1614 if 706 1664 if 625 1714 535 1764 7ifl 462
T m 704 1665 j l


1515 ~ 912 1565 784 1615 624 1714 ~ 535 1765 00- 461
1516 tr 903 1566 ~ 782 1616 702 1666 1t 622 1716 ~ 534 1766 ~~ 458
/517
1518 ~ *
1519 {eb
902
895
887
1567
/567
1567 ~
JJ
i't
781
781
781
1617 i.l
1618 If.\"
1618 Id:
700 1667 ~
699 1667 ~
699 1669 m
618
618
615
1717
1717
1719
~,f
~
~
533 1767 ;i
533 1768 7'J
528 1768 ~
456
454
454
1520 M 883 1567 l?j 781 1618 it 699 1669 ~ 615 1719 If 528 1770 u± 453
1521 ~ 881 1571 m 779 1621 JftIl 698 1671 .m 614 1719 1j: 528 1771 ~ 451
1522 !'! 876 1571 lft 779 1622 :n 695 1671 Wt' 614 1722 ~ 527 1771 ~ 451
1523 ~ 875 1573 *El 778 1623 ~ 691 1673 ~ 612 1723 f1 523 1773 l!I 449
mr

1524 ill! 874 1574 ItF 777 1623 ~ 691 /673 ~ 612 1724 520 1774 ~ 444
1525 873 1575 m 773 1625 m 689 /675 ~ 6Il 1724 -g9( 520 1774 lifl 444
1525 ~ 873 1576 mi 770 1626 '~ 687 1676 IJJL 608 1726 1/1:1 519 1774 ~ 444
m
1527 :mt
1528 ~
868
865
1577
1578
~
nl:!
767
766
1627 ~
1628 FK.
685 1677
682 1678 ~
606
605
1727
1727 *
f.ft
518 1777 ~
518 1778 ~
443
441
1529 i:ll:
1530 nl'i
1530 ~
864
862
862
1579
1579
1581
*-!c
*
""=
F'
764
764
762
1629 ~
1630 ~
163/ 1lll
680 1679 {/II
679 1680 #e!
676 1681 ~
589
587
586
1727
1730
1730
.-
$

;hi:
518 1779 ~
514 1780 ~
514 1781 m
440
439
436
1532 ~ 861 1582 *'li 761 1632 ~ 675 1682 t}j 584 1730 ~ 514 1782 P¥ 434
1533 ~ 860 1582 ~ 761 1633 fs 674 1683 ~ 583 1733 IlE 513 1783 ~ 432
1534 1E
1535 ~ffi
1535 ~
858
855
855
1582
1585
1586
r11l
lffil
~
761
759
758
1634 -'it
1635 ~f.
1635 pq
673 1684 ~
671 1685 $)(
671 1686 -m
582
581
580
1734
1735
1736
*m-t 512 1783 ~
509 1783 'if
508 1783 an.
432
432
432
1537 ;W 854 1587
* 754 /637 :liT 668 1686 ~ 580 1737 ~ 503 1787 IE 431

**
1538 ~ 852 1587 ~ 754 1638 J1j 667 1686 $G 580 1738 mt 502 1787 431
1539 rr 850 1589 J* 753 1639 ~ 664 1689 *.§. 578 1739 'R' 496 1787 431
1540 1ti 847 1589 W1: 753 1640 fJlJ 662 1690 ~ 574 1740 ~ 494 1790 ~ 430
1540 lW1 847 1591 fdf 751 1641 8lt 660 1690 Ii§" 574 1741 IE 491 1791 f.j 429
1542 jffl 846 1592 #: 750 1642 7f 659 1692
656 1693 t&'* 573 1742 Jjifi 490 1791 ~ 429


1543 71< 843 1593 lIPJ 748 1643 r,E 572 1742 #i 490 1793 iii 425
1544 iii 842 1594 ?!t 747 1643 656 1694 it 567 1744 III 489 1793 IJlIi 425
-m

1545 III 838 1594 111 747 1645 !t. 655 1694 :It 567 1744 489 1795 Ii!J 424
1546 1$ 832 1596 741 1646 It 652 1694 :B 567 1744 00 489 1796 ~Q 423
1547 fi!i 825 1597 tJjIj 739 1646 Ii'{ 652 1697 ~ 566 1747 n 487 1797 ~ 419
1548 '!l 821 1598 ~II 735 1646 iII 652 1698 IiA 565 1747 487 1798 ~ 418
,.
~
1549 ~ 820 1599 lf1,\J 734 1649 it 650 1699 S 563 1747 ijl 487 1799 71. 416
1550 ifJL 819 1600 fI 732 1650 tJi. 649 1700 "1 560 1750 484 1800 Wi 415
CF 98.8138 CF 98.9790 CF 99.1262 CF 99.2550 CF 99.3665 CF 99.4618
JAPANESE CHARACTER FREQUENCY LIST 495

APPENDIX A (Continued)
R C F R C F R C F R C F R C F R C F
1800 f,jj 415 1851 ~ 363 1901 ~ 302 1951 !'KJ 254 2000 ~ 206 2049 f~ 169

*
/802 411 1852 ~ 360 1902 ?,f 300 1951 j£ 254 2002 ~ 204 2052 ~ 168
/803 1Jm 410 1853 tE 359 1903 :!jr 299 1951 ~ 254 2002 ~ 204 2053 illi 167
1804 ,fl¥j 408 1854 Ii 358 1904 ~Ill 295 1951 1t 254 2004 ~ 202 2054 .~ 166
1804 frIiX 408 1855 ,j:g 354 /904 ~ 295 1955 z, 252 2004 JM 202 2055 m 165
1806 mE 405 1856 ~ 351 1904 #l! 295 1956 ifir 251 2006 ifl 201 2056 ~ 164
1806 ~ 405 1857 ~ 350 1907 M 293 1957 iii 250 2(J(J6 ~ 201 2(J57 m 163
1"~

1808 iJi{ 404 1857 $ 350 1907 293 1957 ~ 250 2008 ID& 199 2057 ~ft 163
1808 ~ 404 1857 *7) 350 1907 293 /959 Iii 249 2008 199 2059 ~ 162

* t& 198 162


1808 404 1860 ifJ; 348 1907 ifIIJ 293 1960 248 2010 ~ 2059 f~
1808 ~~ 404 1861 tt 346 19J1 *It 292 1960 l!4 248 2011 ~ 197 2059 f!£ 162
1812 ~ 402 1861 ffl! 346 1912 5* 288 1962 ~I'J 247 2011 iii 197 2062 15k
fJJ
161
/812 i3 402 1861 f$ 346 1912 ~ 288 1963 ~ 246 2013 #l 195 2063 160
/8/4 f7e 400 1864 ~ 344 /914 ~ 287 1964 Mi 245 2014 II~ 194 2064 !f 159
:Mi

1815 ~ 399 1864 ~ 344 1914 287 1965 f1I 243 2015 ~ 193 2064 $ 159
1816 i1J. 398 /866 '$: 340 1916 f;lk 286 1966 M< 241 2016 M 191 2066 157
1817 ¥* 397 1867 ~ 334 /9/7 ~ 285 1967 B 239 2016 ~ 191 2066 jij!j 157
8 ff,j m
••
1818 ~ 396 1867 ffi 334 /918 ~ 284 1968 237 2016 191 2068 156
396 1869 R 333 /9/9 283 1969
.1b
236 2016 if[, 191 2068 1"1 156


1818 ~ R
18/8 396 1869 Ef$ 333 19/9 283 1969 ~Ij 236 2016 1i 191 2070 frl!i 155
1821 ~~ 395 1871 ~ 332 1921 ~~ 282 /971 J§ 233 2016 ~ 191 2070 iI 155
1821 ~ 395 1872 X 329 1922 ~ 278 1971 $Jt 233 2022 fji 190 2072 ~ 154
/823 ~ 389 1873 it 327 1922 m 278 1971 ~ 233 2023 PX 189 2073 ~ 153
m :&l

/824 388 1874 f~ 326 1924 Ilij 277 1974 # 230 2023 JI\ 189 2073 153
1825 ~ 385 1874 326 1925 nfOJ 274 1975 ~ 228 2025 1'5 188 2073 if! 153
1826 fiJi 384 1876 ill 323 1925 Ijl 274 1976 ~ 225 2U26 J![ 187 2073 :~ 153
JIt *lE 150


1827 ~ 383 1876 f'f 323 1925 R 274 1977 ~ 223 2026 187 2077
1828 i'Jll 382 1878 ~ 322 /928 272 1978 ~ 221 2028 t* 185 2078 itJ 149
1829 Jlft 380 1878 '!!i 322 1929 It! 271 1978 tv 221 2028 JO.: 185 2U79 ~ 148
1830 ~ 379 1880 ~ 321 1929 JL 271 1978 1'& 221 2028 M 185 2079 11 148
m
183/
/831
}L
f.m
377
377
1880
1882
'II
!f:t
321
320
/93/
/93/
j(:
RJJ
269
269
/981
1982
~
»f.
219
217
2028
2032 it:
185
183
2081
2081 *
il
146
146

* '1

183/ ~ 377 1883 319 193/ 269 1982 ~ 217 2032 ~ 183 208/ ~ 146
1834 ~ 376 1883 ~ 319 1934 lOft 268 1984 216 2032 f!f 183 2081 m 146

**
1835 tt 374 1885 i~ 316 1934 i'l 268 1984 f8 216 2032 183 2085 ~ 142
4- :tit 141


1836 ;i 373 1886 1f. 315 1934 ~ 268 1984 216 2036 182 2086


1836 B 373 1886 tu 315 1937 267 1987 III 214 2036 ~ 182 2087 ~ 140
f's. 140

•**
/838 ~ 372 1888 t~ 314 1937 ~ 267 1987 214 2038 181 2087
1!?39 fl 371 1889 ~ 312 1939 m 266 1989 1# 213 2039 180 2089 JlJR 139
/839 II 371 /890 7iIJ 311 1940 ~ 265 1990 ~ 212 2040 iilli 179 2089 5Mi 139
371 /890 ~ 311 1940 ~ 265 1990 1!It 212 2041 i!iJJ 176 2091 )i 138
/839 ~


JiiI


1842 ill 370 /892 00 310 1942 ~ 264 1990 JlI 212 2041 ~ 176 2091 138
1842
1844
1l!
~
370
369
1892
1894
Jlt
IPl
310
309
1943
/943 *'
u~
263
263
1993
1994 tHI
210
209
2043
2044 it
.ftf
175
173
2091
2094
f!l
at
fIfi
138
137
136


1845 J! 368 1895 ill« 308 /945 lfl 262 1995 ~ 208 2045 172 2095
/846 6 367 1895 fR 308 1945 262 1995 *- 208 2045 it 172 2095 tt 136

*
/847 *Ji: 366 1897 307 1947 Itt 258 1995 00 208 2047 ~ 171 2095 tri'i 136
/848
1848
~
~
365
365
1898
1899
11.1
~
305
303
1948
1949
~
W.
256
255
1998
1998
JIjj
~
207
207
2048
2049
'*if!, 170
169
2095
2099
~
~
136
135
/850 ;DL
a 364 1899 is 303 /949 ~ 255 2000 m 206 2049 ~ 169 2099 ~ 135
CF 99.5445 CF 99.6146 CF 99.6736 CF 99.7226 CF 99.7627 CF 99.7953
496 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIX A (Continued)
R C F R C F R C F R C F R C F R C F
2099 ~ 135 2150 m 112 2200 i#. 95 2249 ~ 79 2298 m 67 2351 ~ 59
2099 1L 135 2152 ffi. 111 2200 flUj 95 2249 ~ 79 2298 '1~ 67 2351 JlJit 59
2103 fIX 134 2152 R: 111 2203 ff 94 2253 ~ 78 2298 w: 67 2351 :tt 59
2103 ~ 134 2152 jIJl 111 2203 ~ 94 2253 m 78 2298 ~ 67 2351 itJ 59
2105 }J:t 133 2152 fc 111 2205 ~ 93 2253 7f 78 2305 li; 66 2351 m 59

*
2105 133 2156 ~ 110 2205 {IE 93 2256 • 77 2305 ijI 66 2351 BrlJ 59
2105 *f1f 133 2156 JIti' 110 2207 iii 92 2256 te 77 2305 {g 66 2351 i1t 59
2105 fill 133 2156
*
2159 j1(
110 2207 III 92 2256 :M! 77 2305 ~ 66 2351 rlJj
2351 M
59

•m
2105 ~ 133 109 2209 ~ 91 2256 ~ 77 2305 1{ 66 59
2110 132 2160 ~ 108 2209 ~ 91 2260 ;ti~ 76 2310 JiI\ 65 2351 ~Dl 59
2110 132 2160 ft 108 2209 ~ 91 2260 ~ 76 2310 ~ 65 2351 ~ 59
2112 ;9t 131 2162 ffii 107 2212 ~ 90 2260 if 76 2310 ~ 65 2351 M 59
2112 :It 131 2162 ~ 107 2213 1f 89 2260 nI 76 2310 ~ 65 2351 ¥- 59
21/-1 !Ill 130 2164 !i!! 106 2213 ~ 89 2260 ill!! 76 2310 1if 65 2364 'If 58
21/-1 H 130 2165 J#! 105 2213 ~ 89 2265 ~ 75 2310 ~ 65 2364 J!l 58
2116 1m 129 2165 ~ 105 2213 ~t 89 2265 • 75 2310 .t1J 65 2364 ti£! 58
2116 j! 129 2167 11$ 104 2213 ~ 89 2265 ~ 75 2317 Ii 64 2364 1i 58
2118 PI'l 128 2167 it 104 2213
*
2219 jlJf
89 2265 jt!
2265 :Ii
75 2317 {bo
1!f
64 2364 M 58

*
2118 ~ 128 2167 ~ 104 88 75 2319 63 2369 ~ 57
2219 JE


2120 'I&- 127 2167 iIi: 104 88 2270 74 2319 ?(i 63 2369 itE 57
2121 !llf 126 2167 M 104 2219 88 2271 1!- 73 2319 1:>1 63 2369 ~13 57
2167 if 2219 it 2271 fm II! 2369 tit


2122 ~ 124 104 88 73 2319 63 57
2123 iffi 122 2167 m 104 2223 ~ 87 2271 • 73 2319 63 1373 1fI\ 56
2123 I!l!I 122 2174 $ 103 2223 I!!l 87 2274 ~ 72 2319 ~ 63 2373 it 56
2125 !lII1:i 120 2174 n 103 2223 ~ 87 2274 if 72 2319 URP 63 2373 ~8 56
2125 iI 120 2174 1f 103 2226 *t 86 2274 fIA 72 2326 !,it. 62 2373 ~ 56
2127 fit 119 2174 if 103 2226 $1 86 2277 Jiir 71 2326 m 62 2373 U{t 56
2127 ~ 119 2174 !tt 103 2226 1m 86 2277 m 71 2326 tJ 62 2373 t$ 56
2226 iii m

2129 ~ 118 2179 ~ 102 86 2277 ~ 71 2326 ~ 62 2379 55
2130 ~ 117 2179 ~ 102 2226 ~ 86 2277 ~ 71 2326 fJ> 62 2379 55
2179 n± m 11

2379 ;lit


2130 fJ 117 102 2231 85 2281 ~ 70 2326 62 55
2130 % 117 2182 m 101 2231 85 2281 jJ" 70 2326 4iJf 62 2379 55
2130 M§ 117 2182 it 101 2233 m 84 2281 Ii 70 2326 I:}'i 62 2379 ruk 55
2134 it 116 2182 lit 101 2234 ~ 83 2281 ~ 70 2326 ~ 62 2379 a& 55
2281 fiij 1M.

2134 ~ 116 2185 ~ 100 2235 ~ 82 70 2326 62 2385 ~ 54
2134 116 2186 !IE 99 2235 1m 82 2281 §l!j 70 2336 III 61 2385 #Ii 54
2134 '&£ 116 2186 ~ 99 2235 m 82 2287 III 69 2336 i'lJ 61 2385 flL 54
2134 "'"
'E 116 2186 ~ 99 2235 ~ 82 2287 rt 69 2336 W: 61 2385 ~ 54
2139 ~ 115 2186 1& 99 2235 1ft 82 2289 ~ 68 2336 Wlf 61 2385 f# 54
2139 11.ti 115 2186 ~ 99 2235 P)l 82 2289 ~ 68 1336 15 61 2385 11 54
2139 V 1I5 2191 ~ 98 2235 ~ 82 2289 :m 68 2336 ({] 61 2385 ~ 54
2139 ~ 115 2191 ttli 98 2235 iii 82 2289 ~ 68 2336 ~ 61 2385 ~ 54
m

2143 ft 114 2191 ~ 98 2243 ~ 81 2289 11f 68 2336 61 2385 ~ 54
2143 JfiJ 114 2194 ~ 97 2243 81 2289 m 68 2336 ~ 61 2385 ~~ 54
m:
•. .
2143 ff. 114 2195 96 2243 Jfla 81 2289 U 68 2336 ftlt 61 2385 ;jl 54
2143 IIMJi 114 2195 ~ 96 2243 (£ 81 2289 68 2336 'It 61 2396 MIl 53


2147 8 113 2195 Iff 96 2247 IW 80 2289 68 2347 ~ 60 2396 53
2147 ~~ 113 2195 ~ 96 2247 80 2298 ~ 67 2347 ~ 60 2396 Pm 53
2298 m


2147 ~ 113 2195 ~ 96 2249 ~ 79 67 2347 {Jt 60 2396 @ 53
2150 ft 112 2200 It 95 2249 il¥ 79 2298 ;fib 67 2347 ~ 60 2400 52
CF 99.8220 CF 99.8434 CF 99.8626 CF 99.8776 CF 99.8926 CF 99.9039
JAPANESE CHARACTER FREQUENCY LIST 497

APPENDIX A (Continued)
R C F R e F R C F R C F R C F R C F
2400 B 52 2449 ~ 46 2496 J.Il 40 2543 !Ill 35 2599 ~ 31 2646 ~ 28
2402 ~ 51 2449 !l5e 46 2496 it 40 2543 ~ 35 2599 m 31 2646 ~ 28
2402 J& 51 2449 1ri! 46 2503 ~ 39 2543 :I!l 35 2599 It 31 2646 FoX 28
If?{ !\!\! .m let

2402 51 2449 ~ 46 2503 39 2543 ~ 35 2599 31 2646 28
2402 ~ 51 2449 ~ 46 2503 Ii 39 2543 jlj 35 2599 !!1. 31 2646 28
2402 ~ 51 2449 if:! 46 2503 ~ 39 2543 ~ 35 2599 tit 31 2646 14' 28


2402 ~ 51 2449 ~ 46 2503 jili)i 39 2543 ~ 35 2599 ifi 31 2646 ~J 28
2402 ffi. 51 2449 J]p 46 2503 ~ 39 2543 :f!lt 35 2599 31 2646 1.t 28
2402 ~ 51 2459 ~ 45 2503 ~ 39 2559 ~ 34 2599 it 31 2646 ~ 28
2402 fJi 51 2459 i'Jf 45 2503 MIJ 39 2559 nit 34 2599 ~ 31 2646 ~ 28
g WJ
*" 34 2599 IlfB
2411 50 2459 ~ 45 2503 39 2559 31 2646 ~ 28
2411 fi 50 2459 at 45 25/2 il!I 38 2559 ~ 34 2599 Il 31 2646 {lL 28
2411 ~ 50 2459 A 45 25/2 ~ 38 2559 ~ 34 2599 mJ 31 2646 QOO 28
24fl @ 50 2464 'Pf 44 2512 JlIl 38 2559 :It 34 2599 !I'J 31 2664 ~ 27
24fl
2411 *
J]lf
50
50
2464
2464
fIJ
-;Jf
44
44
2512
25/2 m
~ 38
38
2559
2559
~
lil
34 2599 Pfi
34 2616 it:
31
30
2664
2664
Il!J:
;1'2
27
27


Wi

2411 {# 50 2464 44 2512 ~ 38 2559 ~ 34 2616 ~ 30 2664 ~ 27
2411 ~ 50 2468
* 43 2512 38 2568 ~
/Rl
33 2616
33 2616 1m
30 2664 ~
fi
27


24JJ ~ 50 2468 ~ 43 2512 ~ 38 2568 30 2664 27
24fl ~ 50 2468 iif 43 2512 38 2568 ~ 33 2616 im 30 2664 00 27
24JJ ~ 50 2468 ~ 43 2512 *BIt 38 2568 iIJJi 33 2616 M 30 2664 ~ 27
2422 ~ 49 2468 ~ 43 25/2 ~ 38 2568 ~J 33 2616 JtE 30 2664 1m 27
-m
.,
2422 ~ 49 2468 ~ 43 25/2 38 2568 tal 33 2616 i;Ili 30 2664 '$ 27
2422 1# 49 2468 ~ 43 2512 IIi* 38 2568 ~ 33 2616 3": 30 2664 ~ 27
2422 g 49 2475 42 25/2 ~ 38 2568 III 33 26/6 !Itt 30 2664 i? 27
2422 5E
m
49 2475 ~ 42 2512


(4f 38 2568
*
m
33 26/6 mt 30 2664 ~ 27

,. ••
2422 49 2475 M 42 2527 37 2568 33 2616 :g£ 30 2664 ~ 27
2422 1M; 49 2475 ia 42 2527 PI 37 2568 33 2616 ~ 30 2664 ~ 27


M

2422 ~ 49 2475 42 2527 37 2568 33 2629 ~ 29 2664 ~ 27
2422 Fr 49 2475 42 2527 B: 37 2568 ~ 33 2629 29 2664 !Rt 27
2422 i1lJ 49 2475 IIf 42 2527 it( 37 2581 $ 32 2629 iAl 29 2681 M 26

.
W

2432 48 2475 ~ 42 2527 ~ 37 2581 ~ 32 2629 ~ 29 2681 ~ 26
2432 'I1J 48 2475 m 42 2533 i1ill 36 2581 ~ 32 2629 $ 29 2681 26
2432 II 48 2475 42 2533 ,fit! 36 2581 M 32 2629 II 29 2681 1C. 26
2432 #!!! 48 2475 7.X 42 2533 lit 36 2581 ~ 32 2629 !M 29 2681 ~ 26
2432 m 48 2486 ~ 41 2533 51! 36 258/ ffl'I 32 2629 ~ 29 2681 m 26
2432 P1! 48 2486 :it 41 2533 i'ili 36 258/ ~ 32 2629 ~ 29 2681 ~ 26
2432 l'IJ 48 2486 IBJ 41 2533 4: 36 2581 tt 32 2629 ~ 29 268/ ~ 26
2432 11: 48 2486 iltf 41 2533 IT 36 258/ lit 32 2629 iI 29 268/ :~ 26


2440 ~ 47 2486 ~ 41 2533 {iJi 36 258/ P'f 32 2629 i~ 29 2681 W 26

"
2440 ~ 47 2486 41 2533 ~ 36 2581 ~ 32 2629 29 2681 tl 26
2440 !ill 47 2486 ~ 41 2533 !i 36 2581 ~ 32 2629 W 29 2681 ~ 26
2440 1$ 47 2486 Pl! 41 2543 ~ 35 2581 i6!t 32 2629 Jft; 29 268/ fa 26
2440 ~ 47 2486 trJi 41 2543 Ii 35 2581 WI. 32 2629 $ 29 2681 M 26
m

2440 mi 47 2486 it 41 2543 ~ 35 258/ ~ 32 2629 ~ 29 2681 26
2440 #i 47 2496 tlj 40 2543 it 35 258/ ~ 32 2646 ~ 28 268/ 26
2440 ~ 47 2496 ~ 40 2543 m 35 2581 !G 32 2646 # 28 2681 ~ 26

'* :Ill:
2440 11# 47 2496 ~ 40 2543 35 258/ 32 2646 ~ 28 2681 .I;€ 26


2449 :¥Ill 46 2496 TiIJj 40 2543 w.Jt 35 2599 ~ 31 2646 1# 28 2681 ~ 26
2449 ~f 46 2496 Em 40 2543 Yi 35 2599 31 2646 f! 28 268/ M 26
CF 99.9139 CF 99.9239 CF 99.9331 CF 99.9381 CF 99.9431 CF 99.9481
498 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIX A (Continued)
R C F R C F R C F R C F R C F R C F
268/ IR 26 2725 ~Ji 24 2785 ~ 21 2836 :¥f 19 2886 if! 17 293/ m IS

•'*
2702 Il!k 25 2752 J5 23 2785 ~ 21 2836 It 19 2886 ~ 17 2931 Ii~ 15


2702 It 25 2752 ~ 23 2785 21 2836 p~ 19 2886 ?&. 17 293/ jg IS
2702 ~ 25 2752 ~ 23 2785 21 2836 *HI 19 2886 ~ 17 2931 IS
2702 ~ 25 2752 fiji 23 2785 ~ 21 2836 1l 19 2886 ~ 17 2931 ft IS
2702 &Ii 25 2752 Ilf 23 2806 II 20 2836 1Il~ 19 2886 m 17 2931 JEt 15
2702 ~r 25 2752 ,~ 23 2806 Jjf 20 2836 Sf 19 2907 ft 16 2931 ~i'f 15
2702 t:I 25 2752 III 23 2806 Kl 20 2836 1# 19 2907 IlR 16 293/ frJI 15
2702 flI 25 2752 ~ 23 2806 ~ 20 2836 rtJ 19 2907 3i.} 16 293/ ~ 15
m

2702 ~ 25 2752 23 2806 ~ 20 2836 • 19 2907 fa: 16 2931 ~ 15
m

2702 i1lt 25 2752 Q 23 2806 20 2861 ~ 18 2907 16 2931 10 15
2702 ~ 25 2752 23 2806 ~ 20 2861 ¥J). 18 2907 *L 16 2931 mJ 15
a:
2702
2702 ~
Il% 25
25
2752
2752
:!1i
~
23
23
2806
2806
lEI
V1f
20
20
2861
2861 :Ii!i
18
18
2907
2907 *
m
16
16
2931
2931
!I*
!IW
IS
15
2702 ~ 25 2752 ~ 23 2806 II#i 20 286/ ** 18 2907 ~ 16 2931 '!Ii 15
2702 *n 25 2752 .IE 23 2806 ~ 20 2861 ~ 18 2907 :RlJ 16 2966 j$ 14
2702 ,w 25 2752 m 23 2806 ~ 20 286/ ~ 18 2907 WJl 16 2966 11 14
2702 ~ 25 2752 ~ 23 2806 ~ 20 286/ JJJZ 18 2907 7§. 16 2966 1m 14
2702 ~ 25 2752 &if 23 2806 fi 20 2861 tI 18 2907 ~ 16 2966 JI3't 14
2702 Jl1{ 25 2752 ~ 23 2806 ii!i 20 2861 FX 18 2907 '& 16 2966 R 14


2702 ~ 25 2752 #. 23 2806 ffi 20 2861 it 18 2907 M: 16 2966 ~ 14
2702 :fa 25 2772 iii 22 2806 20 286/ If{; 18 2907 tilt 16 2966 1ft 14
1@: !l t-J

2702 ~ 25 2772 22 2806 T1J:. 20 2861 18 2907 16 2966 ~ 14
m:
,.
2702 25 2772 1* 22 2806 ffl 20 2861 18 2907 ~ 16 2966 J:l 14
2725 !l'I 24 2772 ~ 22 2806 ~p 20 2861 fJ 18 2907 !mi 16 2966 ~ 14
2725 ~ 24 2772 ~ 22 2806 1t 20 286/ %5 18 2907 16 2966 ~ 14
2725 ~ 24 2772 & 22 2806 ~ 20 2861 fit! 18 2907 il& 16 2966 ~ 14
2725 ~ 24 2772
of±
IT,," 22 2806 Wi 20 2861 ~~ 18 2907 i!!I. 16 2966 ~ 14
2725 £j~ 24 2772 jjfl 22 2806 i$ 20 2861 ~ 18 2907 ~ 16 2966 lth 14
2725 JJJD 24 2772 ~~ 22 2806 ~ 20 2861 ~ 18 2907 fit 16 2966 T 14
2725 Wi 24 2772 ~ 22 2806
*~ 20 2861 ~ 18 2931 ~ 15 2966 m 14
2725 ft{ 24 2772 7Mt 22 2806 20 2861 :Hi 18 2931 ~l 15 2966 vJ; 14
2725 :II\! 24 2772 ~ 22 2806 ;!!!;;
7C 20 2861 Jm 18 2931 00 15 2966 ~ 14
2725 ~ 24 2772 ~ 22 2806 {frt 20 2861 ~ 18 2931 ~ 15 2966 :;~ 14
2725 Pm 24 2785 ,fl 21 2806 1£ 20 2861 ~ 18 2931 ii 15 2966 flIL 14
2725 iii 24 2785 wm 21 2836 it 19 2886 • 17 2931 ~ 15 2966 t1i 14
2725 #i! 24 2785 ~ 21 2836 ~ 19 2886 If* 17 2931 to 15 2966 m 14
2725 m 24 2785 ~ 21 2836 :n 19 2886 m 17 2931 ~ 15 2966 til 14
2725 ~ 24 2785 tJ 21 2836 :tS 19 2886 ~ 17 2931 if IS 2966 :II 14
iOO 24 2785 IIQl: 21 2836 ~ 19 2886 • 17 2931 Ii IS 2966 14
""
2725
ft EI
•e
2725 ~ 24 2785 ~ 21 2836 ~ 19 2886 17 293/ IS 2966 ~ 14
2725 24 2785 ~ 21 2836 T* 19 2886 II 17 2931 ~ IS 2966 ~ 14
2725 24 2785 ~ 21 2836 ~ 19 2886 :jp 17 2931 fIij IS 2966 .fti 14
2725 til 24 2785 1$ 21 2836 IJlI¥ 19 2886 ~ 17 2931 tJi IS 2966 W 14
2725
2725
2725
Il'F
J$
.z:
24
24
24
2785
2785
2785
fllJ
;l
l4l
21
21
21
2836
2836
2836
~
~
~
19
19
19
2886
2886
2886
Il\I
ff1
rx
17
17
17
293/
2931
2931
*fA
m
15
15
15
2966
2996
2996
~
J$i
ilH 13
14
13

f<f it m

2725 ilk 24 2785 5'1 21 2836 19 2886 Jl1J 17 2931 15 2996 13
2725 -tJ; 24 2785 IliH 21 2836 tIl 19 2886 .5l!l 17 2931 l,t 15 2996 13
2725 :i\l; 24 2785 m 21 2836 ~ 19 2886 1l! 17 2931 m IS 2996 III 13
cr 99.9531 CF 99.9581 CF 99.9631 CF 99.9681 CF 99.9731 CF 99.9781
JAPANESE CHARACTER FREQUENCY LIST 499

APPENDIX A (Continued) APPENDIXB


R C F Hiragana Letter Frequency List
2996 ;m 13 R C F R C F
2996 ~ 13 1 (J) 1918313 49 '1 48752
2996 !Ill 13 2 vi: 1108840 50 '" 47013
2996 JlI(f 13 3 t: 1067566 51 rJ 32312
2996 ill 13 4 v\ 1060284 52 tJ 31212
13 5 I;): 937811 53
~.
26965
2996 ~ '-

2996 *!Ji: 13 6 :a: 936356 54 :tJ. 23490


2996 fi 13 7 .!:: 927938 55 ~ 23280
2996 :JjiiJ 13 8 ~ 916652 56 <" 21549
2996 m 13 9 tJ' 860742 57 ~ 19865
2996 ~ 13 10 L- 848132 58 V- 19148
2996 ~ 13 11 -r: 764834 59 J: 14425
2996 :r-J 13 12 -c 758316 60 .-:5 13125
13 13 tt. 720156 61 1£ 12402
2996 '"
2996 I!DII 13 14 tJ' 537294 62 « 12108
2996 1Jtf 13 15 -o 467350 63 h 11606
2996 fj 13 16 n 450805 64 ~ 11522
2996 11{; 13 17 G 423294 65 e 10047
2996 IiIiI 13 18 b 396142 66 I1l 8486
2996 f7} 13 19 ? 352965 67 -If 6893
2996 ~ 13 20 T 340654 68 ~ 5124
2996 11 13 21 ~ 333999 69 1-;1: 4349
2996 ii\'f 13 22
~
~ 312227 70 ~ 2755
2996 n 13 23 ti. 280911 71 rJ 1608
2996 m 13 24 't 278599 72 It 1315
2996 !tj 13 25 ~ 258960 73 ~ 986
2996
2996
~
M
13
13
26
27
~
~
233505
223806
74
75
'"i:> 477
125
2996 ~ 13 28 < 221960 76 ;t 106
2996 ~ 13 29 .:b 204256 77 "t? 82
2996 ~ 13 30 It 199362 78 2- 75
2996 it'S 13 31 E 196555 79 d;) 48
2996 ~ 13 32 Iv 190068 80 ;to 21
2996 ~ 13 33 ;t 163664 81 v- 21
CF 99.9815 34 J:: 154206 82 n 3
35 -o 153999 83 :; 1
Note-R = Rank of frequency, C = Character ofkanji, F = Frequency
occurrence in23,408,236 character corpus. 36 ~ 146156
37 -t- 131611
38 b 123077
39 ij 99183
40 7-f.. 89264
41 tt 83444
42 0 73467
43 t!:" 72228
44 j;J 65870
45 t;. 56857
46 A. 56005
47 i' 53256
48 Ij' 49126
Note-R = Rank of frequency, C = Character of hiragana, F = Fre-
quency occurrence in20,711,361 character corpus.
500 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIXC
Katakana Letter Frequency List
R C F R C F
1
2
3
:/
Iv
A
290948
189442
178214
49
50
51
*jJ
/'\
22462
22061
21839
4 !- 162802 52 '7 21784
5 7 127845 53 'J 20784
6 .{ 120807 54 -7 20633
7 '7 117203 55 'E 20070
8 !J 106744 56 J 19572
9 17 98209 57 7:' 19240
10 ~'/ 86894 58 I! 18692
J1
12
7J
~
82982
80626
59
60 *zr;
18204
17817
13 lj 75319 61 :3 17731
14
15
0
F
75301
74257
62
63
"'::1'" 14881
13931
16 '/ 61171 64 .y 12526
17 7 61115 65 ~ 10732
18 i- 60608 66 ::l 10318
19 j. 60230 67 ~ 10144
20 ::J 58724 68 7" 10121
21 -:(' 56123 69 -tt' 7689
22 7' 54159 70 t 7289
23
24
T
s;
53404
50758
71
72
'"
::L
7129
6653
25 T 48437 73 If 6481
26 /-\ 44970 74 :t 6245
27 ~' 44462 75 ~ 2897
28 If 40433 76 '/ 2640
29 :\'- 39608 77 ij 1145
30 r7 39323 78 ? 1050
31 -t)- 39202 79 T 149
32 ::::- 38711 80 'Y 127
33 -r 38047 81 =:1 122
34 J:. 36458 82 I 73
35 7' 35920 83 4- 40
36 /-\ 35416 84 ;h 14
37 -e 34883 85 'J 9
38 ;;f" 34718 86 -7" 2
39 -1 33747
40 T 32665
41 ::t. 32616
42 ~ 29262
43 -\' 28144
44 ~ 26651
45 lj' 26396
46 'Y 24541
47 ~ 23742
48 .r( 22755
Note-R = Rank of frequency, C ~ Character of katakana, F = Fre-
quency occurrence in3,608,288 character corpus.

(Manuscript received February 2, 1999;


revision accepted forpublication February 27, 2000.)

You might also like