corpus.byu.edu

corpora, size, queries = better resources, more insight


 Upgrade   Contributors 

 Academic site license 

Overview
Corpora
Size, speed, queries
Insight into variation

Updates (May 2016)
History / updates
FAQ / questions
Researchers

Register / create profile
Log in / password
Reset password

Related resources
   Full-text data
   Word frequency
   Collocates
   N-grams
   WordAndPhrase
   Academic vocabulary

Problems
Contact us


The BYU corpora are free, but there are two ways to obtain increased access to the corpus data: purchasing full-text data, and obtaining an academic / site license. These are two very different options, and universities or other organizations typically choose just one of the two.

 

Academic / site license

Full-text data

Access

Online access to the corpora, such as COCA, COHA, GloWbE, or BYU-BNC.

Download the data to your own computer(s). Several corpora currently available: COCA, COHA, and GloWbE, Wikipedia, Spanish, NOW, NOW updates

Typical users

Students, and teachers/professors who are fine with the web interface, and who do not need to manipulate the underlying corpus data.

Those who want to process the corpus data for their own purposes (typically those with programming skills, to manipulate lots of data), and those who do not want to be constrained by the web interface.

Effort involved

Essentially none; just use the web interface.

(Potentially) quite a bit: downloading the files, (possibly) formatting them for quick retrieval, mastering the use of text retrieval software, etc.

Format / queries

Queries available via the web interface.

Three formats (simple text, word/lemma/PoS, and database); many different uses.

Price

$200 - $600 for a one year site license (more...)

$245 - $795, depending on the number of corpora and the number of users  (more...)

Duration

Typically one year, although there is the possibility of discounted two-year and five-year licenses.

One-time purchase; the license never expires.