corpus.byu.edu

corpora, size, queries = better resources, more insight


 Contribute 

 Academic site license 

Overview
Corpora
Size, speed, queries
Insight into variation

History / updates
FAQ / questions
Site license vs full-text
Researchers
Volunteer !

Log in / password
Profile / register

Related resources
   Full-text data
   Word frequency
   Collocates
   N-grams
   WordAndPhrase
   Academic vocabulary

Problems
Contact us


The corpora at this site were created by Mark Davies, Professor of Linguistics at Brigham Young University. They are used by more than 200,000 people each month (more than 370,000 visits) -- for example, 65,000 distinct people each month for COCA alone. This makes them perhaps the most widely-used corpora currently available.

The corpora have many different uses, including:

  • finding out how native speakers actually speak and write

  • finding the frequency of words, phrases, and collocates

  • looking at language variation and change; e.g. historical, dialects, and genres

  • gaining insight into culture; for example what is said about different concepts over time and in different countries

  • designing authentic language teaching materials and resources.

In addition to the ten corpora (and the Google Books (Advanced) interface), there are also many corpus-based resources. These allow you to: