corpus.byu.edu

corpora, size, queries = better resources, more insight


 Contribute 

Overview
Corpora
Size, speed, queries
Insight into variation

History / updates
FAQ / questions
Researchers
 Volunteer

Log in / password
Profile / register
Site license

Related resources
   Full-text data
   Word frequency
   Collocates
   N-grams
   WordAndPhrase
   Academic vocabulary

Problems
Contact us


The corpora at this site were created by Mark Davies, Professor of Linguistics at Brigham Young University. They are used by more than 170,000 people each month (more than 330,000 visits) -- for example, 65,000 distinct people each month for COCA alone. This makes them perhaps the most widely-used corpora currently available.

The corpora have many different uses, including:

  • finding out how native speakers actually speak and write

  • finding the frequency of words, phrases, and collocates

  • looking at language variation and change; e.g. historical, dialects, and genres

  • gaining insight into culture; for example what is said about different concepts over time and in different countries

  • designing authentic language teaching materials and resources.

In addition to the ten corpora (and the Google Books (Advanced) interface), there are also many corpus-based resources. These allow you to: