Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 3

Word frequency data

Corpus of Contemporary American English

k/60k

list

ons

Notes:

1. Please do not post material from this frequency list on other websites without clearly identifying the source
(http://www.wordfrequency.info), and creating a link from that site to this. There is unique information in this list

you as the owner of this list (drvida8@yahoo.com). A Google query will run every night to search for copies of thi

Web. If it is found and there is no link to this site, you will be identified as the owner and you will be asked to eithe
link immediately or to remove the list from the web. Thanks for your understanding.
2. You can copy and paste this table into Excel and then re-sort by word (alphabetical) or part of speech.

3. The rank order is a function of both frequency and "dispersion", which measures how evenly the word is spread a
corpus (1 = most evenly distributed; 0 = appears irregularly in few texts).

emma

rmats

00)

ta

ase

cabulary

du

4. The free list contains just the bare list of the top 5,000 words in American English. It is also possible to download

that contain the top 20-30 collocates (nearby words) for each of these words -- which provides useful information on

meaning and usage -- as well as to see which words are most common in certain genres (e.g. spoken or academic). I

possible to download highly accurate lists for the top 10,000, the top 20,000 words, and the top 60,000 words in En
their top collocates as well.
5. Part of speech codes can be found at http://ucrel.lancs.ac.uk/claws7tags.html.

6. If you want a printed version of the list with collocates, genre information, etc, you might consider the book Freq
Dictionary of American English: word sketches, collocates, and thematic lists (Routledge, 2010).

Rank

Word

Part of speech

Frequency

the

22038615

be

12545825

and

10741073

of

10343885

Dis

You might also like