Most Common Words in English - Wikipedia
Most Common Words in English - Wikipedia
Studies that estimate and rank the most common words in English examine texts written in
English. Perhaps the most comprehensive such analysis is one that was conducted against the
Oxford English Corpus (OEC), a massive text corpus that is written in the English language.
In total, the texts in the Oxford English Corpus contain more than 2 billion words.[1] The OEC
includes a wide variety of writing samples, such as literary works, novels, academic journals,
newspapers, magazines, Hansard's Parliamentary Debates, blogs, chat logs, and emails.[2]
Another English corpus that has been used to study word frequency is the Brown Corpus, which
was compiled by researchers at Brown University in the 1960s. The researchers published their
analysis of the Brown Corpus in 1967. Their findings were similar, but not identical, to the
findings of the OEC analysis.
According to The Reading Teacher's Book of Lists, the first 25 words in the OEC make up about
one-third of all printed material in English, and the first 100 words make up about half of all
written English.[3] According to a study cited by Robert McCrum in The Story of English, all of the
first hundred of the most common words in English are of Old English origin,[4] except for
"people", ultimately from Latin "populus", and "because", in part from Latin "causa".
Some lists of common words distinguish between word forms, while others rank all forms of a
word as a single lexeme (the form of the word as it would appear in a dictionary). For example,
the lexeme be (as in to be) comprises all its conjugations (is, was, am, are, were, etc.), and
contractions of those conjugations.[5] These top 100 lemmas listed below account for 50% of all
the words in the Oxford English Corpus.[1]
A list of 100 words that occur most frequently in written English is given below, based on an
analysis of the Oxford English Corpus (a collection of texts in the English language, comprising
over 2 billion words).[1] A part of speech is provided for most of the words, but part-of-speech
categories vary between analyses, and not all possibilities are listed. For example, "I" may be a
pronoun or a Roman numeral; "to" may be a preposition or an infinitive marker; "time" may be a
noun or a verb. Also, a single spelling can represent more than one root word. For example,
"singer" may be a form of either "sing" or "singe". Different corpora may treat such difference
differently.
The number of distinct senses that are listed in Wiktionary is shown in the polysemy column. For
example, "out" can refer to an escape, a removal from play in baseball, or any of 36 other
concepts. On average, each word in the list has 15.38 senses. The sense count does not include
the use of terms in phrasal verbs such as "put out" (as in "inconvenienced") and other multiword
expressions such as the interjection "get out!", where the word "out" does not have an individual
meaning.[6] As an example, "out" occurs in at least 560 phrasal verbs[7] and appears in nearly
1700 multiword expressions.[8]
The table also includes frequencies from other corpora. As well as usage differences,
lemmatisation may differ from corpus to corpus – for example splitting the prepositional use of
"to" from the use as a particle. Also the Corpus of Contemporary American English (COCA) list
includes dispersion as well as frequency to calculate rank.
OEC
Word Parts of speech COCA rank[9] Dolch level Polysemy
rank
be Verb 2 2 Primer 21
to Preposition 3 7, 9 Pre-primer 17
of Preposition 4 4 Grade 1 12
a Article 6 5 Pre-primer 20
I Pronoun 10 11 Pre-primer 7
it Pronoun 11 10 Pre-primer 18
he Pronoun 16 15 Primer 7
at Preposition 20 22 Primer 14
Preposition, adverb,
but 22 23, 1715 Primer 17
coordinator
we Pronoun 27 24 Pre-primer 6
or Coordinator 31 32 Grade 2 11
if Preposition 44 40 Grade 3 9
me Pronoun 50 61 Pre-primer 10
Dolch list of 95
time Noun 55 52 14
nouns
people Noun 61 62 9
year Noun 63 54 7
see Verb 69 67 25
also Adverb 80 87 2
Dolch list of 95
back Noun, adverb 81 108, 323, 1877 36
nouns
Dolch list of 95
way Noun, adverb 90 84, 4090 16
nouns
Dolch list of 95
day Noun 98 90 9
nouns
Parts of speech
The following is a very similar list, also from the OEC, subdivided by part of speech.[1] The list
labeled "Others" includes pronouns, possessives, articles, modal verbs, adverbs, and
conjunctions.
3 year do first in a
Languages portal
Basic English
Letter frequencies
Zipf's law, a theory stating that the frequency of any word is inversely proportional to its rank in
a frequency table
Word lists
Dolch Word List, a list of frequently used English words
References
4. Bill Bryson, The Mother Tongue: English and How It Got That Way, Harper Perennial, 2001, page 58
5. Benjamin Zimmer. June 22, 2006. Time after time after time... (http://itre.cis.upenn.edu/~myl/languag
elog/archives/003274.html) . Language Log. Retrieved June 22, 2006.
6. Benjamin, Martin (2019). "Polysemy in top 100 Oxford English Corpus words within Wiktionary" (http://
kamu.si/polysemy_top_100) . Teach You Backwards. Retrieved December 28, 2019.
External links