corpus.byu.edu

corpora, size, queries = better resources, more insight


 Contribute   Contributors 

 Academic site license 

Overview
Corpora
Size, speed, queries
Insight into variation

History / updates
FAQ / questions
Researchers
Log in / password
Profile / register

Related resources
   Full-text data
   Word frequency
   Collocates
   N-grams
   WordAndPhrase
   Academic vocabulary

Problems
Contact us


The corpora at this site were created by Mark Davies, Professor of Linguistics at Brigham Young University. These are probably the most widely-used corpora currently available.

The corpora have many different uses, including:

  • finding out how native speakers actually speak and write

  • finding the frequency of words, phrases, and collocates

  • looking at language variation and change; e.g. historical, dialects, and genres

  • gaining insight into culture; for example what is said about different concepts over time and in different countries

  • designing authentic language teaching materials and resources.

In addition to the ten corpora (and the Google Books (Advanced) interface), there are also many corpus-based resources. These allow you to: