corpora, size, queries = better resources, more insight

 Upgrade   Contributors 

 Academic site license 

Size, speed, queries
Insight into variation

Updates (May 2016)
History / updates
FAQ / questions

Register / create profile
Log in / password
Reset password

Related resources
   Full-text data
   Word frequency
   Academic vocabulary

Contact us

The corpora at this site were created by Mark Davies, Professor of Linguistics at Brigham Young University. These are probably the most widely-used corpora currently available.

The corpora have many different uses, including:

  • finding out how native speakers actually speak and write

  • finding the frequency of words, phrases, and collocates

  • looking at language variation and change; e.g. historical, dialects, and genres

  • gaining insight into culture; for example what is said about different concepts over time and in different countries

  • designing authentic language teaching materials and resources.

In addition to the ten corpora (and the Google Books (Advanced) interface), there are also many corpus-based resources. These allow you to: