Corpora Overview PDF Videos Resources Help / FAQ My account

English-Corpora.org

QUERIES

All of the corpora have exactly the same architecture and interface, which allows users to carry out the following types of searches. One of the important advantages of our corpus architecture is that with one simply query and one click, users can analyze variation by comparing different sections of a corpus; e.g. genres in COCA or the BNC, dialects in GloWbE or NOW, or across time periods (COHA, TIME, recent changes in COCA or NOW, and Google Books (Advanced)).

Visualization. You can see (examples with end up V-ing):	Limiting and comparing sections
1) a chart with the overall frequency of all matching strings 2) the individual strings (overall - all sections) 3) individual strings (in each section of the corpus: genre, dialect, or time period)	1. You can also limit the search to just particular sections of the corpus (e.g. hard NOUN in Fiction) 2. More importantly, you can compare between two sections of the corpus (e.g. hard NOUN in FIC vs ACAD) -- either by genre, dialect, or time period.

Note: click on any link on this page to see the corpus data, and then click on the "BACK" image (see left) at the top of the page to come back to this page.

Type of search	COCA-General	COCA-Genres	GloWbE-Dialects	COHA-Historical
Specific word or phrase	I guess	validity	lah!	of no little
Substring	*al_j	*al_j (MAG/ACAD)	*ism (core/SAsia)	*ism (earlier/later)
Lemma (forms of a word)	CONJ PRON BE like , ( and she was like , )	ADJ CHAIN (FIC/ACAD)	BE different to	HAVE quite V-ed
Part of speech	ADJ eyes	ADJ body (MAG/ACAD)	went ADJ	a most ADJ NOUN
Synonyms	=strong	=strong (FIC/ACAD)	=beautiful WOMAN	=beautiful =girl
User-defined lists	@colors @clothes	FEEL @emotions (FIC/ACAD)	@colors @clothes	felt @emotions
Sortable concordance lines	fathom	argue (ACAD)	diametrically	swell (1930s)
Collocates (nearby words)	BREAK_v	chair (FIC/ACAD)	scheme (US/GB)	gay (earlier/later)
-- Use Mutual Information score	BREAK_v
-- Compare two words	utter / sheer