This site contains links to several free online corpora that we have created or are currently creating:

Display problems above?


Corpus of Contemporary American English (COCA) 400 million words 1990 - 2009
Corpus of Historical American English (COHA; Summer 2009 beta) 400 million words 1810s - 2000s
BYU-BNC: British National Corpus 100 million words 1980s - 1993
TIME Corpus of American English 100 million words 1920s - 2000s
BYU-OED: Oxford English Dictionary 37 million words 1000s - 2000s
Corpus del Español 100 million words 1200s - 1900s
Corpus do Português 45 million words 1300s - 1900s

These corpora allow for a very wide range of queries, including word, phrase, substring, part of speech, lemma, synonyms, customized wordlists, and collocates. Any of these features can be compared across sections of the corpus -- time periods and/or genres -- to look at variation.

It is also possible to collaborate with other users of these corpora. You can search the profiles of other users, and see notes, projects, and publications that they have created that are based on the corpus data (as well as creating your own). And you can volunteer to help (as little as sixty seconds), including creating tutorials and mentoring others.

We hope that these corpora are useful for you in your research, teaching, and learning.