CORPUS.BYU.EDU

seven online corpora | 45 - 425 million words each

corpora queries history researchers publications profile | register FAQ | questions contact us offline data


The following are some publications that are based on the corpora. Entries submitted in the last three months are marked in green, with the most recent entries first. Only a few of the authors have submitted entries, so probably only about 20-30% of the publications are listed here. But it's a start, and it should give you an idea about how the corpora are being used.
Click on the headings "Type" or "Corpora" or "Author" to re-sort the entries
Click on a corpus in the "Corpora" column to limit the results to that corpus, or reset to see all corpora
Click on the "Title" column to go to the website for the publication.
 
  InDatabase Type Date Corpora Author Title Source
2/9/2012 Chapter 2012 TIME  Juhani RUDANKO The Transitive into -ing Construction in Early Twentieth-Century American English, with Evidence from the TIME Corpus The article examines the transitive ''into -ing'' pattern in the 1920s, and the data come almost exclusively from the TIME Corpus. English Corpus Linguistics: Looking Back, Moving Forward, ed., by Sebastian Hoffmann, Paul Rayson and Geoffrey Leech. Amsterdam: Rodopi, pp. 179-190
2/7/2012 Journal 2010 CDP  gimena DEL RIO RIANDE Intertextualidad, intermelodicidad y metatextualidad en una cantiga de escarnio del rey Don Denis” I used the corpus do portugues for quotations Estudios Románicos 20, 2011, pp. 201-214.
2/6/2012 Other 2011 COCA BNC  Yusuke Minami “Mary is pretty to look at vs. Mary looks pretty: Property Cognition through Visual Instances supporting my claim are cited in the paper, most of which are those of the construction "Subject be Adjective to look at". (Unfortunately, contents of the proceedings are not given in English on the Web.) Papers from the 11th National Conference of JCLA
2/6/2012 Journal 2011 COCA  Sheyko Anastasia Propositional and semantic realization of value characteristics of ''time'' in the English linguaculture In the article we used entries from the COCA including the word ''time''. Lingua Mobilis, # 5 (31), 2011. pp. 98 - 108
2/5/2012 Chapter 2008 BNC  Szczesniak, Konrad; Marcus Callies Europameisterschaft zu Erdribbeln Looking for sentences exemplifying the x''s way construction in order to establish which English verbs occur in it compared to counterparts in German and Polish. "The Linguistics of Football", Narr Verlag, Tübingen
2/2/2012 Journal 2010 COCA  Michael Ramscar, Teenie Matlock and Melody Dye Running down the clock: the role of expectation in our understanding of time and motion We used the Corpus of Contemporary American English to analyze the following distribution of number words between one and a hundred, and the verbs ''comes,'' ''goes'' and ''runs''. We were interested to look at the extent to which these words were future- or past- biased and how this related to data collected from a set of fictive motion experiments. Ramscar, M., Matlock, T., & Dye, M. (2010) Running down the clock: the role of expectation in our understanding of time and motion. Language and Cognitive Processes, 25(5), 589-615.
2/2/2012 Journal 2011 COCA CDE  Michael Ramscar, Melody Dye, Hanna Popick, Fiona O'Donnell McCarthy The enigma of number: Why children find the meanings of even small number words hard to learn and how we can help them do better The Corpus of Contemporary American English and Corpus Del Español were used to assess the relative frequency with which the numbers 1–7 are used to describe sets in spoken English and Spanish, respectively. Ramscar M, Dye M, Popick HM, O’Donnell-McCarthy F (2011) The enigma of number: Why children find the meanings of even small number words hard to learn and how we can help them do better. PLoS ONE 6(7): e22501. doi:10.1371/journal.pone.0022501
2/1/2012 MA/PhD 2011 CDE  Roberto Aranovich Optional agreement and grammatical functions: a corpus study of dative clitic doubling in Spanish This dissertation discussed the phenomenon of dative clitic doubling in Spanish. The data were gathered from Corpus del Español (Davies, 2002-). Significance tests were conducted in order to determine the relevance of different variables as predictors of dative clitic doubling. University of Pittsburgh
1/30/2012 Book 2012 COCA BNC  Alexander Tokar Introduction to English Morphology In Chapter I I explain how COCA and BYN-BNC can be used to establish whether the form under analysis qualifies as a word (i.e. whether it is a free or a bound form). In Chapter 4 I explain how a student of English Morphology can determine whether a particular derivational affix has recently given rise to new lexemes in English. In addition, most of the analyses in this book are illustrated with examples drawn from COCA. Peter Lang: Frankfurt am Main
10  1/29/2012 MA/PhD 2011 CDE  Antonio PIMENTEL Cambio morfológico en español. Aproximación al sufijo -ble Research words in spanish from XIII to XIX Universitat de Girona
11  1/27/2012 Chapter 2011 COCA  Miller, Philip The choice between verbal anaphors in discourse. This paper investigates factors affecting the choice between various VP anaphora constructions in English, namely VP-ellipsis (VPE), "do so", and "do it/this/that", on the basis of an extensive corpus analysis based on the COCA. Several hundred examples of each type were collected in different registers and were analyzed with respect to various syntactic, semantic and pragmatic criteria. Hendrickx, I., Lalitha Devi, S., Branco, A., and Mitkov R. (eds.). Anaphora Processing and Applications. 8th Discourse Anaphora and Anaphor Resolution Colloquium, DAARC 2011, p.82-95.
12  1/27/2012 Journal 2011 COCA BNC  Ramon Marti Solano Phraseological units in English: variation through lexical insertion The results of the retrieval of all types of transformations, idiom variants and exploitations in my corpus have been systematically contrasted with potential occurrences in the BNC and the COCA in order to account for institutionalisation, lexicalisation, ad hoc transformations, contextualisation and levels of creativity. RILA (Rassegna Italiana di Linguistica Applicata), 1-2, 389-404
13  1/25/2012 Journal 2010 BNC  Taverniers, Miriam & Gudrun Rawoens Three types of oriented adjuncts in English and Swedish All English examples are from the BNC. Moderna Språk 104(1): 1–14
14  1/25/2012 Journal 2012 COCA  Marc BRYSBAERT Adding Part-of-Speech information to the SUBTLEX-US word frequencies The SUBTLEX-US corpus has been parsed with the CLAWS tagger, so that researchers have information about the possible word classes (Parts of Speech; PoS) of the entries. Because the current definition of lemma frequency does not seem to provide word recognition researchers with useful information (as illustrated by a comparison of the lemma frequencies and the word form frequencies from the Corpus of Contemporary American English), we do not provide this variable. Behavior Research Methods
15  1/24/2012 Journal 2009 BNC  Wynne, Martin Focus on Corpus Design The architecture is referred to in a section on the design of corpora and online services to access them. Anglistik, 20:1
16  1/22/2012 Journal 2011 COCA BNC  Lanteigne, B. and Crompton, P. Analyzing Use of "Thanks to You": Insights for Language Teaching and Assessment in Second and Foreign Language Contexts The study analyzes use of "thanks to you" in the Corpus of Contemporary American English and the British National Corpus to gain insights for English language instruction /assessment in the American context, as well as English-as-a-lingua-franca contexts. Research in Language, 9(2), 29-50
17  1/21/2012 Journal 2011 CDE  Paul O'Neill Los verbos defectivos en la lengua española: estudio sincrónico y diacrónico descriptivo basado en datos de corpus. My study was based around statistical tests from corpora data. I used the corpus del español for both the diachronic and synchronic data. Boletín de la Real Academia Española, 300:255-287
18  1/21/2012 Journal 2008 COCA BNC  Chinger Zapata Harry Potter y el Príncipe Mestizo. Diferencias léxicas entre la edición americana y británica The study uses the Corpus of Contemporary American English, the British National Corpus and the American National Corpus to verify and contrast the use of the lexicographic repertoire of both the British and the American edition of Rowling's sixth book Harry Potter and the Half-Blood Prince. Revista Perfiles, Vol 29, N° 7. pp. 7-33
19  1/20/2012 Journal 2012 COCA TIME  Patrick DUFFLEY Collocation, Interpretation and Explanation: The Case of ''just any'' Data from COCA and TIME were used to complete the dataset of collocations of the quantifier ''any''. LINGUA 122: 24-40
20  1/18/2012 Chapter 2012 CDE  Montserrat BATLLORI Diacronía de los verbos psicológicos: una propuesta de entrada léxica I extracted data concerning the verb ACEPTAR in order to analyze the evolution of this verb from the Latin to Modern Spanish. I used data from different sources (one of them was the BYU corpus). In: Historia del léxico: perspectivas de investigación. Iberoamericana/Vervuert.
21  1/8/2012 Journal 2011 COCA BNC TIME  Dilin LIU Making grammar instruction more empowring: An exploratory case study of the use of corpora in the learning/teaching of grammar BNC, COCA, and the Time corpus were used by students to study contemporary English lexicogrammatical usage. Research in the Teaching of English, 45.4, 353-377
22  1/8/2012 Journal 2012 COCA BNC  Dilin Liu The most frequently-used multi-word constructions in academic Using the academic writing sub-corpora of the Corpus of Contemporary American English and the British National Corpus as data and building on previous research, this study strives to identify the most frequently-used multi-word constructions (MWCs) of various types (e.g., idioms, lexical bundles, and phrasal/prepositional verbs) in general academic writing across the academic divisions of the corpora and to examine their usage patterns. English for Specific Purpuses, 31.1, 25-35
23  1/6/2012 Chapter 2010 COCA BNC  Agata Lewandowska Korpus językowy - recepta na własne materiały dydaktyczne nauczyciela języka obcego (The language corpus - a recipe for a language teacher''s own teaching materials) The paper is an overview of corpus-based teaching materials, prepared to address students' needs and problems. Example tasks and activities discussed in the paper are based mostly on BYU BNC and COCA. Materiały edukacyjne w nauczaniu języków obcych (Educational materials in foreign language teaching), Krystyna Droździał-Szelest (ed.), pp109-124.
24  1/2/2012 Journal 2011 COCA  Donald E. Hardy, Crystal Broch Colombini A genre, collocational, and constructional analysis of RISK We extend existing scholarship by analyzing the noun RISK over the last two decades of contemporary American discourse with recourse to the Corpus of Contemporary American English. Using frequency analysis, concordance contextual analysis, collocational analysis, and a variation on distinctive-collexeme analysis, we revisit the assertion that RISK is semantically negative, and then investigate the semantic potential for good risk, a phenomenon associated with economic and adventuring sectors. International Journal of Corpus Linguistics 16.4 (2011): 462-85
25  1/2/2012 Journal 2011 COCA BNC  Dilin Liu "The most-frequently used English phrasalverbs in American and British English: A multi-corpus examination." This study uses the Corpus of Contemporary American English and the British National Corpus as data and Biber, Johansson, Leech, Conrad, and Finegan''s (1999) and Gardner and Davies'' (2007) informative studies as a starting point and reference. The study offers a cross-English variety and cross-register examination of the use of English phrasal verbs (PVs), one of the most difficult aspects of English for learners of English as a foreign language or English as a second language. TESOL Quarterly. 45.4, 661-688.
26  1/2/2012 Journal 2011 COCA COHA TIME  Geoffrey LEECH The modals ARE declining: Reply to Neil Millar''s "Modal verbs in TIME: Frequency changes 1923-2006" I show that, contrary to what is suggested in Millar''s article in International Journal of Corpus Linguistics 14.2 (2009), the modal auxiliaries as a class have been declining significantly in recent American and British English. International Journal of Corpus Linguistics. 16.4, 547-564
27  12/29/2011 Chapter 2011 COCA  Gabrijela Buljan, Alma Vančura Waltzing and snaking through physical space: English denominal verb conversions on the lexical-constructional interface The COCA corpus was used to extract all verb lemmas occurring in the Intransitive Motion Construction [V into NP] to inform our hypothesis about the semantic affinities between certain lexical conceptual classes of verbs and the intransitive motion constrution Peter Lang, Berlin
28  12/27/2011 Journal 2011 COCA BNC  christelle lacassain-lagoin Periphrastic verbal phrases in perception utterances: Have a Taste, Give a Look and Take a Listen. I searched the corpora for perception periphrastic phrases. My aim was to show that some perception verbs are not used in such periphrastic phrases while others are rather frequent. Blandine Pennec & Olivier Simonin (dirs.) Les locutions de l’anglais : Emplois et stratégies rhétoriques – Fixed phrases in English: Use and rhetorical strategies, Cahiers de l’Université de Perpignan n°40, Presses Universitaires de Perpignan, 2011, pp. 149-179
29  12/26/2011 Journal 2012 COCA  ilana salem L1-L2 sentence translation in classroom grammar tests i used coca to find out the typicality of lexico grammatical relationships through their frequency in coca texts. scheduled for ELTJ 66/2, at present advanced access: http://eltj.oxfordjournals.org/content/early/recent
30  12/22/2011 Book 2011 COCA  Keith Folse Clear Grammar 1 In several chapters, data from the BYU corpora were used to determine which lexical items are used most frequently for a given grammatical point. For example, the top 10 count nouns used after HOW MANY are: people, times, years, hours, children, kids, women, days, votes, men. (p. 252) University of Michigan Press (Ann Arbor, MI)
31  12/19/2011 Book 2008 CDE  Johannes KABATEK Johannes Kabatek (Hrsg.), Sintaxis histórica del español y cambio lingüístico: Nuevas perspectivas desde las Tradiciones Discursivas Collective work with several papers using serveal corpora. CORDE and Corpusdelespanol are used in different papers. Madrid: Iberoamericana
32  12/19/2011 Other 2011 BNC  Alexander Osherenko EmoText: Opinion Mining in Long Texts Opinion mining in long texts using the BNC frequency list and comparison with other lists, e.g. words from Whissell's Dictionary of Affect Language. Socioware Development
33  12/18/2011 Journal 2011 COCA  Simone Sarmento; Livia Mottin Chick Lit, Chick Magnet, Biker Chick, Hippie Chick: Metaphorical Uses of Chick in Contemporary American English This paper aims at unraveling how the lexeme “chick” can be metaphorically used to refer to female universe. We review some literature on metaphors and how the domain of animals can be used to explain human behavior. The data was collected in the Corpus of Contemporary American English. In each subcorpus of COCA, the first 100 random concordance lines were analyzed in context to check if “chick” was used in its metaphorical or in its literal meaning. Brazilian English Language Teaching Journal, vol.2-n.1-p 97-112
34  12/16/2011 MA/PhD 2008 CDP  Camilla Canella Moraes Luzorio/Tania Shepherd Grammaticalization and Complex Prepositions in Portuguese: corpus-based study The dissertation investigated a number of complex prepositions in Portuguese which are undergoing a process of Grammaticalization. To this end the Corpus do Português was used extensively in a diachronic manner. Universidade do Estado do Rio de Janeiro
35  12/15/2011 Journal 2011 BNC  Van de Velde, Freek Left-peripheral expansion of the English NP In line with the usage-based approach taken in this article, BNC was used as a source for authentic corpus examples. English Language and Linguistics 15.2: 387–415
36  12/13/2011 Chapter 2011 COCA  Thomas Egan Perception and conception: The ‘see x to be y’ construction from a cognitive perspective The study investigated a comparatively rare construction, the ''see x to be y'' construction. COCA was used as a source of tokens of the construction. Converging Evidence: Methodological and theoretical issues for linguistic research, Edited by Doris Schönefeld, 57-80
37  12/12/2011 MA/PhD 2011 BNC  Alfred Jones A Corpus-Linguistic Approach to Foreign/Second Language Learning: An Experimental Study of A New Pedagogic Model for Integrating Linguistic Knowledge with Corpus Technology Data from BYU-BNC were used for attesting grammatical phenomena and for learning purposes (data-driven learning). The University of Hong Kong
38  12/7/2011 Book 2012 COCA BNC  Hannisdal, Bente and Jenset, Gard Analyzing English Grammar: Exercises for Advanced Students Exercise/workbook. Sentences from COCA and BYU-BNC were used as material for grammar exercises aimed at Norwegian undergraduate university students of English. Approximately 25-30% of the exercises are related to COCA/BNC. The book refers to the corpora. Fagbokforlaget (Bergen; Norway)
39  12/4/2011 Book 2011 COCA BNC TIME  Dmitry Yermolovich Russian Practical Dictionary (Russian/English-English/Russian) As my dictionary shows typical collocations for English equivalents of Russian words, such collocations were identified or verified using the BYU corpora. Hippocrene Books, N.Y.
40  12/2/2011 Journal 2011 BNC  Hélène Margerie Grammaticalising constructions: TO DEATH as a peripheral degree modifier I used the data from BNC to investigate the frequency rate and the collocational profile of TO DEATH as degree modifier in present-day English Folia Linguistica Historica 32, pp.115-148
41  11/30/2011 Journal 2011 CDE CDP  Andreea Calude, Mark Pagel How do we use language? Shared patterns in the frequency of word-use across seventeen World languages We used the corpora to obtain frequencies of the words encoding basic vocabulary items (the 200 item Swadesh List). Philosophical Transactions of the Royal Society B. ,2011 366:938-948
42  11/30/2011 Journal 2011 COCA  A. Nagornaya Ways of conceptualizing interoceptive sensations and emotions in contemporary English: a comparative analysis The data from the corpus were used to illustrate similarities in differences in conceptualizing interoceptive sensations and emotive states in present-day English European Social Science Journal, No 8, 2011, pp. 110-119
43  11/30/2011 Journal 2011 COCA  A. Nagornaya Verbalization of interoceptive sensations: syntactic level The data from the corpus provided examples of what syntactic structures are employed for verbalizing interoceptive sensations in contemporary English Theory and Practive of Social Development, No. 5, 2011, pp. 291-295
44  11/27/2011 Journal 2010 CDE  Enrique PATO Algo más sobre la historia del posesivo lur In this article we will make a review of the history of possessive pronoun lur (plural lures), and we will see the principal characteristics of illorum in the Romance Languages. In the revesion of this pattern of use, based in notarial documents and literary texts, we verify that the obteined percentages are very similar in both corpora used (CODEA, and Corpus del español). Archivo de Filología Aragonesa, 66, 13-32.
45  11/26/2011 Book 2009 BNC  Sallam, A. How Authentic is the Language Presented in EFL Textbooks?: A case study of Multi-word Units in National Secondary School EFL textbooks in Egypt The book evaluates the authenticity of the multi-word units presented at EFL textbooks used in Egyptian secondary schools by comparing them to the data obtained from the BNC. BYU BNC was of great help to provide the required data for the comparison. Saackenburg, Germany: Lambert Academic Publishing
46  11/26/2011 Journal 2011 COCA  Sandy Habib Angels can cross cultural boundaries Most of the English-based data was drawn from COCA Rask, International Journal of Language and Communication. Vol 34, pp. 49-75
47  11/24/2011 Journal 2011 CDE  Javier RIVAS “Como no me trago el humo...:” A Corpus-Based Approach to Aspectual Se Part of the data for this study was taken from the oral section (texts from España Oral) of Corpus del Español. Studies in Hispanic and Lusophone Linguistics 4/2: 379-416
48  11/24/2011 Other 2011 COCA BNC  Rudanko Juhani Unexpected and Innovative Uses of Constructions: Is There a Need to Supplement a Dictionary with Constructional Information? The BNC and COCA were used as sources of unexpected and innovative tokens of the transitive ''into -ing'' pattern in recent English. Proceedings of the Fifth International Conference on Historical Lexicography and Lexicology ICHLL5 (Collection of Conference Papers), Oxford Research Archive
49  11/23/2011 Journal 2011 CDP  Iva SVOBODOVA Artigo definido e nulo com antropónimos e topónimos Trata-se de Análise do uso do artigo com os antropónimos e topónimos em diferentes registos linguísticos com diferentes nomes geográficos (cidades, países). Svobodová, Iva. Études Romanes de Brno, Masarykova univerzita : Masarykova univerzita, 32/2011/1, 1, od s. 157-169, 13 s. ISSN 1803-7399. 2011.
50  11/23/2011 Journal 2011 COCA  Federico Zanettin Translation and Corpus Design I checked the COCA texts database to see what the proportion is of texts in translation. Synaps, 26/2011, 14-23
51  11/23/2011 Journal 2011 COCA  Ela Krejčová "That''s an interesting question, indeed, not only for you and I": A (non-systematic) fluctuation of personal pronoun forms I used the data from the BYU corpora to demonstrate the idiosyncrasies in the use of the first person singular pronoun in coordinate constructions. Brno Studies in English, Vol. 37, Number 1, 63-78
52  11/22/2011 Journal 2011 COCA COHA BNC TIME  Shinya Hirasawa On the Joint Sense of the English Preposition I argued that, in the constructions exemplified by "John grabbed Susan by the arm" and "John tied the dog to a pole by its tail [by a long chain]," the English preposition "by" introduces a means to connect two entities and make a conceptually unified whole. I used the BYU corpora to find relevant examples. Tokyo University Linguistic Papers
53  11/22/2011 Journal 2011 COCA  Masahiro Akiyama Infinitives, NegP and a Theory of Functional Projection In order to examine the selectional properties of verbs like ''bother'', ''hasten'', ''pretend'', ''vote'', ''vow'', ''want'' etc., I searched for examples where the infinitival complements of these verbs contain perfect ''have'', progressive ''be'' or the negative marker ''not''. Proceedings of the 13th Seoul International Conference on Generative Grammar, 29-44
54  11/21/2011 Journal 2011 CDP  Benjamin FAGARD Conjonc­tions et gram­ma­ti­ca­li­sation : le cas des langues romanes. I used the Corpus do Portugues in order to check the frequency and use of a large array of conjunctions in Portuguese, in contrast to other Romance languages. Mémoires de la Société de Lin­guis­tique de Paris 19 : L’évolution gram­ma­ticale à travers des langues romanes (François, J. & Prévost, S., eds)
55  11/21/2011 Journal 2011 CDP  Benjamin FAGARD La construction ’en effet’ dans les langues romanes médié­vales et la question de l’emprunt. I used the Corpus do Português to study the existence, frequency and evolution of "em ef(f)eito" in Medieval Portuguese. Osla 3/1, Dis­course markers in Romance languages, 69-​​94.
56  11/21/2011 Book 2010 CDP  Benjamin FAGARD Espace et grammaticalisation – L’évolution sémantique des prépositions dans les langues romanes. I used the Corpus do Português marginally in this book, to check the existence of given constructions and uses of adpositions in Portuguese, in a diachronic perspective. Editions Universitaires Européennes, Sarrebruck.
57  11/15/2011 Journal 2009 CDP  Valerio BÁEZ SAN JOSÉ, M. Loma-Osorio Fontecha y G. Fernández Rodríguez-Escalona Diccionario sistemático de los determinadores temporales en español y sus correlatos ingleses Based on working papers in which the system of the temporal determination of the utterance was described, the authors present a dictionary on temporal determiners together with their English correlates. Each entry consists of a lemma with its own definition via 1) the determinative value of each determiner within the performative structure of the utterance, 2) a clarifying example from “this analised corpus”, and 3) its semantic-pragmatic features. Diccionario sistemático de los determinadores temporales en español y sus correlatos ingleses, Estudios de Lingüística del Español (ELiEs), Volumen 29, ISSN:1139-8736, páginas 1265.
58  11/11/2011 Journal 2011 COCA BNC  Qodrat Hassani A Corpus-Based Evaluation Approach to Translation Improvement I used COCA as my frame of reference to see how a corpus-based approach to evaluating translations could affect the overall quality of the translations done by professional translators in the long run. Meta: Translators' Journal, Volume 56, No. 2, 351-373
59  11/11/2011 Journal 2011 COCA COHA  D. A. Kwary & K. W. A. Arum Lincoln''s vs. Obama''s Presidencies: A Diachronic Corpus Based Analysis of the Adjectival Collocates of [man] and [woman] in the American English We use COCA and COCA to find the adjectival collocates of [man] and [woman] during Lincoln''s and Obama''s presidencies. ReVEL, v. 9, n. 17.
60  11/7/2011 Journal 2011 BNC  Stanojević, Mateusz-Milan; Kryžan-Stanojević, Barbara; Parizoska, Jelena A contrastive view of adjectives in Croatian, Polish and English: subjectification as a local phenomenon The data from the BNC were used as illustration of theoretical notions, and were contrasted with the data from a Polish and Croatian Corpus. Cognitive Studies, 11, 31-52
61  11/2/2011 Journal 2011 COCA  Van de Ven, M., Tucker, B.V, Ernestus, M. Semantic context effects in the comprehension of reduced pronunciation variants Used COCA to calculate word frequencies. MEMORY & COGNITION Volume 39, Number 7, 1301-1316
62  11/2/2011 Journal 2011 COCA  Tremblay, Antoine, Tucker, Benjamin The effects of N-gram probabilistic measures on the recognition and production of four-word sequences COCA was used for extracting frequency counts of four word sequences. The Mental Lexicon. 6(2): 302-324.
63  10/31/2011 Journal 2010 TIME  Eva BERLAGE The Lexicalisation of Predicative Complements in English The article deals with the lexicalisation of predicative complement constructions in English and is based on large-scale analyses of electronic corpora. The Time Corpus of American English has been used to retrieve evidence for the semantic and syntactic changes affecting theses constructions between 1923-2006. The paper uses qualitative and quantitative evidence and compares data from differen time periods. Transactions of the Philological Society 108:1: 53-67
64  10/26/2011 Journal 2011 COCA  Neuman, Y. Turney, P. Cohen. Y., & Assaf. D How Language Enables Abstraction: A Study in Computational Cultural Psychology For gaining more insight into the relations in which "darkness" and other signs are embedded we use the Corpus of Contemporary American English (COCA) Integr Psych Behav DOI 10.1007/s12124-011-9165-8
65  10/24/2011 Book 2009 BNC  Jesús Fernández-Domínguez Productivity in English word-formation. An approach to N+N compounding This book focuses on the notion of ''morphological productivity'' and, by use of corpus-based techniques, introduces two measures, Indicator of Profitability (p) and Trend of Profitability (P), which can be applied across word-formation processes and are able to compute their productivity based on semantic categories. Bern: Peter Lang
66  10/19/2011 Journal 2011 COCA  François MANIEZ Structures syntaxiques et schémas phraséologiques de l''anglais médical contemporain : tentative de description d''un style spécialisé We attempt to show that some syntactic patterns are typical of medical language, as regards the formation of complex terms as well as phraseology. The study of the ACADEMIC subset of the Corpus of Contemporary American English (Davies 2009) allows us to statistically confirm a number of distinctions between medical language and general language, and to sketch a description of the most salient differences between medical research articles and those written in other areas of academic expertise. Bulletin de la Société de stylistique anglaise, numéro spécial « Le style spécialisé », ESA n°2, BSSA n° 35
67  10/18/2011 Chapter 2012 COHA  Reijirou SHIBASAKI ‘Please tilt me-ward by return of post: On the vicissitude of a marginal pronominal construction in the history of English’ COHA was used together with other corpora to investigate the pronominal construction ''me-ward'' and its variants along with all other grammatical person constructions e.g. him-ward. This study exhibits the decline of the construction over time, with a rise in the counterpart construction ''toward me''. Irén Hegedűs, József Andor, &Alexandra Fodor (eds.), Papers from the l6th ICEHL (held in Pécs, Hungary, August 2010), Amsterdam: John Benjamins.
68  10/18/2011 Chapter 2009 COCA CDE  Olga Lashevskaja, et al Frequency Dictionary of Contemporary Russian (based on RNC data) References to other corpus-based frequency dictionaries in Introduction Azbukovnik: Moscow
69  10/17/2011 Book 2011 CDP  Maria do Carmo VIEGAS (Org.) Minas é Plural O livro "Minas é Plural" é resultado do trabalho do grupo de pesquisa da Faculdade de Letras da UFMG, coordenado pela Profa. Nessa publicação, é dada ênfase ao estudo das vogais, visto que, nas divisões dos falares brasileiros, a realização das vogais é sempre um dos fatores mais importantes. Em grande parte das pesquisas utilizou-se o corpus VARFON-Minas. Nos dois últimos capítulos, utilizou-se o Corpus do Português de Davies & Ferreira (2006-). Faculdade de Letras da UFMG - Belo Horizonte
70  10/17/2011 Chapter 2011 CDP  Pamella PEREIRA; Maria do Carmo VIEGAS A expressão não obstante: gramaticalização no português Este estudo tem como objetivo a análise da expressão não obstante ao longo da história do português, tendo em vista a proposta de gramaticalização de Hopper & Traugott (1993). Para isso foi utilizado o banco de dados de Davies e Ferreira (2006-), denominado "Corpus do Português" e disponível em http://www.corpusdoportugues.org. Os dados foram coletados e analisados conforme critérios sintático, semântico e de frequência. Minas é Plural - Faculdade de Letras da UFMG - Belo Horizonte
71  10/17/2011 Journal 2011 COHA  Martin HILPERT Dynamic visualizations of language change: Motion charts on the basis of bivariate and multivariate data from diachronic corpora In this paper, I use COHA data to make so-called motion charts, which present a new way to visualize diachronic language change. International Journal of Corpus Linguistics 16/4, 435-461
72  10/15/2011 Journal 2009 COCA  John Newman English posture verbs: An experientially grounded approach COCA is used as the source of data to explore the use of posture verbs (sit, stand, like) with inanimate subjects in English. Annual Review of Cognitive Linguistics 7: 30-58
73  10/15/2011 Book 2010 COCA TIME  Randi Reppen Using corpora in the language classroom COCA and Time were used as exampels for classroom activities and also screen shots from these are included. Cambridge: Cambridge University Press
74  10/11/2011 MA/PhD 2011 COCA COHA  JI WON LEE A Corpus-based Study of _much_ as a Negative Polarity Item My paper investigates the diachronic development of much as an NPI. This comes with three related questions: 1) Is the frequency of use of NEG much related to the non-use of paucity little (I don’t have much money vs. I have little money)? 2) How is the emergence of the quantifier (a) lot(s) in the early 19th century related to much becoming an NPI? 3) Do different functions of much affect its NPI tendency? (e.g. adverbial vs. pronominal use of much) University at Buffalo
75  10/9/2011 MA/PhD 2010 CDE CDE  Carla CORREIA A Razón de Amor e a poesia galego-portuguesa Fiz um estudo sobre o vocabulário da Razón de Amor em comparação com a lírica galego-portuguesa, bem como com alguns textos castelhanos medievais. Para isso contribuiu o Corpus del Español! Universidade do Porto
76  10/9/2011 Chapter 2010 COCA  David HOOVER Some Approaches to Corpus Stylistics Evidence from COCA is used in an argument about meaning in a Robert Frost poem. More generally I argue that COCA provides substantial evidence for literary arguments in areas where evidence has never before been available. Stylistics: Past, Present and Future. Shanghai Foreign Language Education Press, 2010: 40-63.
77  10/9/2011 Journal 2007 TIME  David HOOVER he End of the Irrelevant Text: Electronic Texts, Linguistics, and Literary Theory Data from the Time corpus is used in an argument against Chomsky''s innateness hypothesis. More broadly it is part of an argument that the nature of Chomskyan linguistics is partly responsible for the neglect of the text in recent critical practice. Digital Humanities Quarterly 1(2) 2007
78  10/7/2011 MA/PhD 2011 COCA  Sanna Franssila The Marketplace of Ideas. A Corpus Study of Buy and Sell Metaphors in American Political Discourse. The following publications in the COCA were applied in this study: Harper’s Magazine, The Mother Jones and The Washington Monthly for the liberal news magazines, The American Spectator, National Review, U.S. News & World Report and USA Today Magazine for conservative. Fox News represented conservative cable TV news and MSNBC liberal cable TV news. Rush Limbaugh data represented conservative radio news and National Public Radio center-left radio news. The University of Helsinki
79  10/4/2011 Journal 2011 COCA  John Newman Corpora and cognitive linguistics. COCA was used to retrieve frequency information on the occurrence of nouns in two constructions: to experience N, and experience of N. Results were used as the basis for a collostructional analysis comparing preferred collexemes in the two constructions. Brazilian Journal of Applied Linguistics (2011) 11.2: 521-559. Special issue on Corpus studies: future directions, edited by S. Th. Gries.
80  10/3/2011 Journal 2009 COCA BNC  Kyeongmin Kim and Jong-Bok Kim A Functor Treatment of such: A Corpus-based Approach Based on a corpus examination, Jong-Bok Kim and I attempt to provide a uniform treatment in which such is taken to be a `functor'' selecting its grammatical dependents. Korean Journal of Linguistics 34.4: 853-870
81  9/27/2011 Book 2012 CDE  Paul Gévaudan Der spanische Komplementsatz als Ausdrucksform der Modalität This study analyses spanish complement clauses semantically. It verifies the hypothesis that complement clauses are the prototypical expression for modality (onomasiological perspective) and that modality is the prototypical meaning of complement clauses (semasiological perspective). The study is largely based on material of the CDE. University of Tübingen
82  9/21/2011 MA/PhD 2011 COCA  Ji-Yeon Chang The use of general and specialized corpora as reference tools for academic and technical English writing: A case study of Korean graduate students of engineering Along with a hand-made specialized corpus, COCA was used as a language reference tool to assist NNES (non-native English speaking) graduate students of engineering in writing in English. This study was designed to investigate how NNES students use, perceive, and evaluate corpora as language reference tools. Seoul National University
83  9/19/2011 Journal 2004 BNC  Edoardo LOMBARDI VALLAURI Grammaticalization of syntactic incompleteness: free conditionals in Italian and other languages. Interviews on the spoken language of several informants were prepared according to results that were foreseen by previously checking general trends of the studied phenomenon (free conditionals) on the British National Corpus. In “SKY Journal of Linguistics” (The Linguistic Association of Finland), 17, 2004, 189-215.
84  9/19/2011 Journal 2010 COCA COHA BNC TIME  Edoardo LOMBARDI VALLAURI Free conditionals in discourse: the forming of a construction. The relevant corpora were used to control data previously collected from informants, and to check statistical tendencies on the linguistic phenomenon mentioned in the title (free conditionals). In Linguisticae Investigationes 33:1 (2010), 50–85.
85  9/17/2011 MA/PhD 2009 CDE  Rosio MOLINA Lexicografía del noroeste novohispano: entre la tradición y la innovación Se verificó el registro de algunas entradas en diferentes siglos para comprobar las innovaciones de los vocabularios. Ludwig Maximilian Univertät, Müenchen, Alemania
86  9/13/2011 Journal 2011 CDE  Matthew T. Carlson & Chip Gerfen Productivity is the key: morphophonology and the riddle of alternating diphthongs in Spanish The corpus was used to measure productivity of a set of suffixes as well as stem frequencies. Derivations with target suffixes were also analyzed for allomorphy, to show that suffix productivity and whole-form frequency predict stem allomorph. Language 87:3, pp 510-538
87  9/9/2011 Other 2010 COCA CDE  Valerio Báez San José & Marciana Loma-Osorio Fontecha & Guillermo Fernández Rodríguez-Escalona Diccionario sistemático de los determinadores temporales en español y sus correlatos ingleses Descrito el sistema de la determinación temporal del enunciado, se presentan un diccionario de los determinadores temporales con sus correlatos ingleses. Cada entrada consta de un lema, que recoge su valor en el conjunto del sistema, y su correspondiente definición mediante 1) el valor determinativo de cada determinador en la estructura performativa del enunciado, 2) un ejemplo aclaratorio, y 3) sus marcas semántico-pragmáticas. Estudios de Lingüística del Español (ELiEs), Volumen 29, ISSN:1139-8736, páginas 1-1265.
88  9/6/2011 Chapter 2009 COCA BNC  Mark Kaunisto The Rivalry between English Adjectives Ending in -ive and -ory The most frequently occurring rival adjective pairs ending in -ive/-ory were identified in both COCA and the BNC, and the relative frequencies were contrasted with one another. A semantic analysis of the rival words was conducted. Selected Proceedings of the 2008 Symposium on New Approaches in English Historical Lexis (HEL-LEX 2), edited by R. W. McConchie, Alpo Honkapohja, and Jukka Tyrkkö; pp. 74-87
89  9/5/2011 Journal 2010 COCA CDE  Elizabeth PENA Cognates facilitate word recognition in young Spanish-English bilinguals’ test performance. Item frequencies in English and Spanish for cognates and non-cognates on the TOLD-P:3 were conpared. Early Childhood Services Journal. 4, 55-67.
90  9/5/2011 MA/PhD 2009 COCA BNC  Silvie CINKOVA Words that Matter My thesis focuses on Swedish, but some phenomena were shown on English, too. Apart from that, I used COCA a lot to make my English academic writing more idiomatic. UFAL Prague, Charles University
91  8/29/2011 Other 2011 COCA COHA BNC TIME CDP  Neil HONE Communicative English at Work Program 2.0 Level C I am using the data to get a rough idea of usage and frequency of certain word combinations that will be presented to learners in a pubished work in the near future. The publication will be a government of Canada publication and is intended for Canadian public servants. Government of Canada Publications, Ottawa
92  8/23/2011 Book 2007 COCA  Zooming in: An integrated English course I use the corpora to design vocabulary exercises for the textbook. Qin, X.B., Jiang, J.Y., Yang, Y., Deng, X., Lee, K.C., & Zhou, P.D. (2007). Zooming in: An integrated English course. Shanghai: Shanghai Foreign Language Education Press.
93  8/15/2011 Other 2011 COCA BNC  I. Naumov (aka Ken Lee), J. Fyodorova New Millennium English 10 Vocab for the 10th grade students in Russia We checked 226 words and phrases from the active vocabulary list at COCA. We ranked them in order of their occurrence per million words in 1995-1999. We also checked 169 words from the same list at WordCount. Its data comes from the British National Corpus. http://www.titul.ru/central/index.php?id=94&mod=cat&catg=3&prod=259
94  8/11/2011 Journal 2011 COCA  Mark KOPROWSKI A review of 4000 Essential English Words This review critically examines Paul Nation’s self-study vocabulary textbook series designed for high-beginner to advanced students. The six-level series aims to present learners with the most useful 4,000 American English words. However, despite a colorful and attractive layout, engaging vocabulary exercises, and a number of original stories that naturally contextualize the target vocabulary, a corpus-based analysis (using the Corpus of Contemporary American English) reveals that the specified vocabulary words objectively fail to consistently meet the standard of essentiality. Koprowski, M. (2010). Review of 4000 Essential English Words, by Paul Nation. TESL-EJ. 14 (1) Journal on-line.
95  8/11/2011 Journal 2011 COCA BNC  lucie GOURNAY « Comment « formaliser » actually : reformulation, formalisme, ou autre ? » The goal is to find a reliable way to account for the polysemy of Actually. I argue that in the literature so far the data has been either overlooked or underused.New types of searches on the BUY-BNC, thanks to the various tools available, make it easy to come up with impressive data that can be taken into account for a new hypothesis on how Actually works. Travaux du Cerlico, 24, 45-61.
96  8/10/2011 Journal 2009 CDE  Huang Bailong A Study on the Bilingual News Translation of the China Post and Taiwan News Abstract Journalistic English delineated by its saliently characteristic writing style has been a widely implemented course in the ESP field, and so is news translation with its pivot in world communication, especially in the context of globalization, which conduces both language learning and cultural understandings.
97  8/10/2011 MA/PhD 2009 CDP  Caroline SANTOS A tradução da fala do personagem Hagrid para o português brasileiro e português europeu no livro Harry Potter e a Pedra Filosofal: um estudo baseado em corpus This study investigates the intersection between Translation Studies and Children''s Literature, more specifically it investigates the translation of Harry Potter and the Philosopher''s Stone and its translations into Brazilian Portuguese and European Portuguese, both entitled Harry Potter e a Pedra Filosofal. The focus of interest lies in how the translators dealt with Hagrid''s dialect in order to compare what they have done with what Krings(1996) calls reduction strategy and Klingberg (1986) considers a standardization. The initial hypothesis was that both translators would reduce the dialectal marks using, in their translations, transpositions of the spoken variety to the written mode through dialectal marks. A bilingual parallel corpus was used to verify this hypothesis, which constitutes a subcorpus of PEPCo - Portuguese-English Parallel Corpus (FERNANDES; BARTHOLAMEI, 2009). After data analysis it was noticed that the Portuguese translator''s linguistic preferred or recurring patterns gives more emphasis on oral marks than the Brazilian translator''s style, but neither of them makes an option for translating the source-text dialect for an target-text dialect. Universidade Federal de Santa Catarina
98  8/8/2011 Journal 2010 COCA TIME  Lyubimova Svitlana Distributional Analysis Based on Linguistic Corpora (Concept FLAPPER) The article is devoted to distributional analysis of the data from American corpora with the view of completing semantic characteristics of the concept FLAPPER. The results of analysis are arranged in charts and summed up in distributional formulas. Scientific Records of Kirovograd State University, Issue 89 (2), p.38-44
99  8/3/2011 Journal 2008 BNC  wukyung ko A corpus-based study on the weak obligation modals The study investigates the differences in meaning and usage of the three weak obligation modals, should, ought to and had better, through analyzing the 100-million word British National Corpus data. Korean Journal of English Language and Linguistics, 8(2). 207-230.
100  7/30/2011 Journal 2011 COCA CDE  Michael Ramscar, Melody Dye, Hanna Poppick & Fiona O''Donnell McCarthy The Enigma of Number: Why Children Find the Meanings of Even Small Number Words Hard to Learn and How We Can Help Them Do Better Data from COCA & Corpus del Español were used to model frequency at which various number sets are usually discriminated in English and Spanish. These estimates then served as the input to simulations of the development of number discrimination and subitization. PLoS ONE 6(7): e22501. doi:10.1371/journal.pone.0022501
101  7/29/2011 Journal 2010 CDE  Claude Duée SE FAIRE / HACERSE + ATTRIBUT: UNE ÉTUDE CONTRASTIVE DE DEUX SEMI-COPULES PRONOMINALES Cet article aborde la construction se faire / hacerse + adjectif. Nous nous sommes basés sur une analyse de corpus de traduction principalement de Frantext (uniquement littéraire)pour le français, de Davis (miscellanées de textes provenant de la péninsule et l’Amérique du sud) et ADESSE (corpus écrit)pour l''espagnol. Romance Philology, vol. 64, Spring 2010
102  7/27/2011 Journal 2011 BNC  Rafael Alejo Making sense of phrasal verbs: A cognitive linguistic account of L2 learning The BNC corpus was used to compared the data of L2 learners with native speakers. AILA REVIEW
103  7/21/2011 Journal 2011 COCA  ilana salem L1-L2 sentence translation in classroom grammar tests Sentences, or rather particular lexical items and grammatical structures, translated from Hebrew to English were compared with COCA data. Translated grammatical structures or words/word combinations which were not found or were rare in the corpus, were considered pieces of unauthentic language. ELT Journal Advance Access, June 23, 2011
104  7/20/2011 Book 2006 COCA BNC TIME  Anna Dziemianko User-friendliness of verb syntax in pedagogical dictionaries of English. I used the corpora to design the experimental study. They served as sources of sentences (with key words in specific syntactic structures); the sentences were then used in the test. Tübingen: Max Niemeyer Verlag
105  7/18/2011 MA/PhD 2009 COCA  Sarah Love The role of connotation and ideology in economic translation: a case study Defended on August 31, 2009. The COCA corpus is used to show how the range of words that collocate with economic discourse in American English is very different from collocations with those same words in Argentine Spanish. These differences reveal underlying differences in connotation, which translators should consider when approaching a text. University of Geneva, School of Translation and Interpretation
106  7/12/2011 Chapter 2010 COCA COHA  Olga Boriskina An Algorithm for Analysis of Distribution of Abstract Nouns in Cryptotypes COCA-based quantitative analysis of the metaphorical patterns of abstract nouns is aimed at identification of cryptotypes of English nouns. The study of nominal cryptotypes enables us to test hypotheses of discourse behavior of abstract nouns and account for the potential occasional metaphors in English-speaking culture. Statistics of class membership is also COCA-driven. Proceedings of the 2010 International Conference on Artificial Intelligence (ICAI 2010). V. II. Hamid R. Arabnia et al (eds.). WORLDCOMP`10. July 12-15, 2010. Las Vegas, USA. CSREA Press. pp. 907-913.
107  7/12/2011 Journal 2011 COCA COHA BNC  Naoki FURUHI Teaching Basic Collocations for Japanese Learners of English Through Popular Movies This paper shows how the collocations useful for Japanese learners of English are used in famous and popular movies. ATEM Journal Teaching English Through Movies (Naoki FURUHI) 16/, 33-44 2011
108  7/11/2011 Journal 2007 COCA BNC  jiezhi yang Corpus Collocation Retrieval and Approaches to English Synonym Discrimination There is a consensus among linguistic experts on the notion that we shall know a word by the company it keeps.Not only can collocations allow us to get familiar with words, but they can also help us discriminate synonyms. People came to know the importance of collocations in the acquisition of words long ago, it was due to the lack of proper tools like modern corpora and concordancers that people had been unable to systematically analyse and study the collocations of words. Collocation retrieval is to extract combinations of either habitually or grammatically related words from a corpus. With the achievement of collocation retrieval, quantitative measurements of collocational strength and presentation of salient collocations, Sketch Engine and some other concordancing programs provide us with unprecedented perspectives to examine the typical collocations of words. Their achievement represents a trend in the development of a new generation corpora. Corpora with collocation retrieval are becoming practical tools in foreign language teaching and study. The problems encountered in traditional approaches of discriminating synonyms will be readily solved. CAFLE COMPUTER-ASSISTED FOREIGN LANGUAGE EDUCATION
109  7/2/2011 Journal 2011 COCA BNC  HAN CUN XIN Chunk-based exercise design My theis recommends to make full use of COCA and other online corpuses in designing chunk-based exercises. JOURNAL OF CHANGCHUN UNIVERSITY, Volume 21,P108-110
110  6/14/2011 Other 2011 COCA  Archangela Yenny Satriawan Bringing Corpora into the Language Classroom I took concordance lines from COCA to illustrate my proposal. "Exploring Issues in English Studies" (Sugiharto, S., Ed.), "Pelangi Pendidikan" series, pp. 32-40
111  6/13/2011 Journal 2010 COCA  GREGORY FURMANIAK A frame-based approach to modality The case of obligation Examples taken from COCA were used for illustrative purposes. Belgian Journal of Linguistics, 24, 17–35
112  6/11/2011 Chapter 2009 CDE  Johan Pedersen Marginal brug af mådesverber i spanske bevægelsesudtryk. Typologisk status. In this study, I have exclusively used CE for a systematic study of directed motion expressions in Spanish. Ny forskning i grammatik, Merete Birkelund et al, Syddansk universitet, pp. 167-194
113  6/11/2011 Book 2011 CDE  Johan Pedersen The way-construction and cross-linguistic variation in syntax. Implications for typological theory. I have searched in CE for data on expressions of directed motion in Spanish. In J. Hudson et al. (eds.), The construal of spatial meaning: windows into conceptual space, Oxford: Oxford University Press 2011.
114  6/9/2011 Book 2011 COCA  Anne O''Keeffe, Brian Clancy, Svenja Adolphs Introducing Pragmatics in Use We briefly used the COCA corpus in chapter 1 - specifically, we looked at the 10 most frequent collocates for ''Democrat'' and ''Republican''. Routledge
115  6/8/2011 Journal 2011 BNC  Kim Ebensgaard Jensen Construal operations of the English progressive construction This particular study is corpus-supported, rather than corpus-based, in the sense that examples from the BNC are used to illustrate the construal processes involved in the English progressive construction. Naturally occurring usage-events from the BNC are used, rather than introspectively derived examples, in order to maintain objectivity and authenticity of the data. International Journal of Cognitive Linguistics, 1/2, article 2.
116  6/2/2011 Journal 2008 CDE  Gloria Vázquez Diccionario de perífrasis verbales español-rumano Se han usado oraciones del Corpus del Español para completar búsquedas hechas en otros corpus de esta misma lengua en relación al uso de perífrasis. # Topor, M., G. Vázquez, A. Fernández (2008). "Diccionario de perífrasis verbales español-rumano", Ianua. Revista Philologica Romanica, 8, p. 101-111
117  5/23/2011 Journal 2006 BNC  Malgorzata Fabiszak and Przemysław Kaszubski Studying metaphor with the BNC This paper is an exercise in corpus-driven metaphor research. In the main part, we demonstrate the use of collocation evidence for the construction of Fillmorian semantic frames for four ex-pressions in the ‘War’ domain – FIGHT WAR, DECLARE WAR, BATTLEFIELD and BATTLEGROUND. We combine manual categorisation of collocates identified in concordance lines extracted from the British National Corpus (BNC) with automatically derived collocation statistics (z-score). We then show how corpus citations demonstrate the scalar nature of meta-phor. Finally, we use the L-Word Query function in the BNC Sara concordancer to identify and categorise metaphorical compounds containing the word war. General comments regarding the usefulness of corpus-based empirical methods for metaphor research are offered throughout the paper and in the conclusions. Poznań Studies in Contemporary Linguistics 41: 111-129
118  5/18/2011 Journal 2009 COCA BNC  José L. Oncins-Martínez Towards a Corpus-Based Analysis of Anglicisms in Spanish: A Case Study This paper aims to show some of the advantages of and the need for using corpora for exploring and assessing Anglicisms in contemporary Spanish. In order to do this a case study is presented: the adverb dramáticamente, as it is taking on the new sense ‘espectacularmente’ under the influence of English dramatically. The presence of this adverb with this new sense in contemporary Spanish is explored and supported with the data found in CORDE and CREA, the two corpora of the Royal Spanish Academy. Since the paper also seeks to show in what ways the Spanish lexicon is being influenced by English, evidence from two major English corpora, the BNC and the COCA, will be used. International Journal of English Studies, Special Issue, 2009, 115-32
119  5/18/2011 Book 2010 BNC CDE  Julia Lavid, Jorge Arus, Juan Rafael Zamorano Systemic Functionl Grammar of Spanish: A Contrastive Study with English The corpus was used to extract examples to illustrate the description of English in the contrastive part of our book London: Continuum
120  5/17/2011 Chapter 2011 COCA COHA BNC  Susanne Wagner Concessives and Contrastives in Student Writing: L1, L2 and Genre Differences This study investigates the use of concessive markers in academic writing. Frequency differences between academic sub-genres can be established on the basis of large reference corpora such as the British National Corpus (BNC) and the Corpus of Contemporary American English (COCA). These are put in perspective through comparisons with specialised corpora of academic writing (Corpus of British Academic Written English, BAWE and the Michigan Corpus of Upper Student Papers, MICUSP). A learner component is introduced by analysing four markers, although, however, nevertheless and while, in student writing. The student data stem from two sources: German natives enrolled in English Studies programmes at Chemnitz University of Technology and Czech natives studying at Brno Masaryk University. Statistical analyses reveal striking significant differences both between native and non-native writing as well as between German and Czech production. In: Schmied, Josef (ed.), Academic Writing in Europe: Empirical Perspectives. Göttingen: Cuvillier, 23–49.
121  5/14/2011 Journal 2012 COCA COHA BNC TIME  chris smith the dysphemic euphemism implied in unVables Out of 150 unVable adjectives attested in the dictionary, 82% (123) are attested in the singular (as adjectives) in BYU-BNC, and only a handful (14, i.e. under 10%) are attested in the inflected form1. There are however huge differences in the frequency of occurrence, ranging from 1 occurrence for unadoptable to 983 occurrences for unreasonable. The Smith (2005) corpus provides a few occurrences where none were found in BYU-BNC (unspeakables, unconceivables) as does Google (unwhisperables, unthinkables, unutterables). The following table summarizes the findings... Lexis, E-journal in English Lexicology
122  5/13/2011 Chapter 2010 COCA BNC  Bert Cappelle Doubler-upper nouns: A challenge for usage-based models of language? Results of competing morphological patterns were compared. In particular, hapaxes among the results were used in measures of productivity. Cognitive perspectives on word formation, A. Onysko & Sascha Michel (eds), New York: Mouton de Gruyter, pp. 335-374
123  5/13/2011 Other 2011 COCA  Erik Wennstrom untitled crossword puzzle A crossword puzzle certainly doesn''t count as a research publication, but it is a publication, and I did use COCA extensively to create the puzzle. Good crossword puzzle entries are often words and phrases that occur frequently in natural language but that don''t necessarily show up in word lists (or even dictionaries). I frequently consulted COCA to find such words and phrases. In this particular crossword, there were several entries that I probably would never have found without the corpus, including "no-go", "in beta", "angel hair", and "dust-up". The corpus is clearly not designed for this sort of work (typically, I use it to help find natural language examples for teaching logic), but it''s a testament to its versatility that it was so useful here as well. New York Times, March 30, 2011
124  5/12/2011 Other 2011 COCA  Archangela Yenny Satriawan Bringing Corpora into the Language Classroom Selected concordance lines from COCA were used to illustrate the idea of bringing corpora into the language classroom. (Article in a book) Pelangi Pendidikan, Setiono Sugiharto (Ed.), pp. (in press)
125  5/12/2011 Journal 2010 COCA  Pranav Anand and Adrian Brasoveanu Modal Concord as Modal Modification Modal concord is the phenomenon by which certain modal adverbs seem to become semantically vacuous in the presence of modal auxiliaries if the adverb and the auxiliary match in quantificational force (e.g., universal vs existential) and modal flavor (e.g., epistemic vs deontic). We propose that neither concord adverbs nor modal auxiliaries are vacuous: concord adverbs are modal-auxiliary modifiers and the compositional process ensures that both auxiliary and modifying adverb comment on the same proposition (contributed by the same radical), with respect to the same modal base. The data mostly consists of sentences from the Corpus of Contemporary American English. Sinn und Bedeutung 14
126  5/5/2011 Book 2007 TIME  Myoung Hyoun Song Corpus-based Analysis of the English Tough-Construction This corpus-based study reveals facts about the proportion and structure of the English Tough Construction. Firstly, the overall ratio of TC pattern to Extraposition pattern is 3.5:6.5 when the tough adjectives simply occur with a subordinate to-infinitive clause. And then the ratio of TC pattern to Extraposition pattern is 9.1 to 90.9 when they occur with for-NP sequence. Secondly, the Tough Construction is analyzed into ''tough-related adjective + an optional for-NP complement + to infinitival complement''. As for the status of the infinitival subclause, it is argued to be a complement to the adjective by means of criteria for complementhood. And the status of the for-NP sequence also turns out to be a complement to the tough adjective on the basis of syntactic and semantic facts. Lastly, there arises an ambiguity in meaning of TC when the tough adjective is modified by an adverb ''too'', which can be clarified by considering the context in which the sentence occurs. VDM Vertrag, Germany
127  4/28/2011 Journal 2010 BNC  Miriam Taverniers & Gudrun Rawoens Three types of oriented adjuncts in English and Swedish In this paper an account is given of three well-defined subtypes of oriented adjuncts in two Germanic languages, namely English and Swedish. The English data are extracted from the BNC. Moderna Språk, 104(1), online
128  4/28/2011 Book 2011 BNC  Alexander Osherenko Opinion mining and lexical affect sensing. Computer-aided analysis of opinions and emotions in texts. This dissertation investigates opinion mining and lexical affect sensing. It discusses a statistical approach to affect categorization of different texts (gramatically correct and incorrect movie reviews, dialogues, product reviews) and uses the BNC frequency list and the lemmatized BNC frequency list as feature sources. Südwestdeutscher Verlag für Hochschulschriften
129  4/21/2011 Chapter 2010 CDE  Esther Brown Variable development of intersubjectivity in Spanish To calculate collocational frequencies of ''no se'' in Madrid. A Mosaic of Corpus Linguistics. Selected Approaches. Ed. by Aquilino Sánchez & Moisés Almela, 61-78. Frankfurt/Berlin: Peter Lang Frankfurt.
130  4/21/2011 Journal 2006 CDE  Esther Brown The velarization of labial, coda stops in Spanish: a frequency account BYU corpora were used to calculate frequencies of phonotactic patterns. Revista de Lingüística Teórica y Aplicada, 44.2, 47-58.
131  4/19/2011 Journal 2010 COCA BNC  Pavel Dronov Idioms in communication and meta-communication (in Russian, with an abstract in English) This article deals with the problem of idioms in meta-communication. Apart from the widely-spread hedges, meta-communicational idiom modifications are frequently used in speech. A modification of this kind is produced by means of inserting an adjectival modifier into an idiom. The adjective in this case indicates the idiomatic nature of the whole phrase. The author states that while these modifications are common in English or German, they rarely occur in Russian. The author uses the data obtained by means of English, German corpora (COCA+BNC, DEREKO+Wortschatz, Ruscorpora, respectively) to observe these modifications. Lingvistika i metodika prepodavanija inostrannyx jazykov = Linguistics and Methods of Language Teaching (e-zine)
132  4/18/2011 MA/PhD 2010 COCA BNC  Pavel Dronov Insertion of an adjectival modifier into the structure of an idiom: on the semantic conditions of lexico-syntactic idiom modifications. Based on Russian, English and German idioms. Synopsis of a candidate thesis (in Russian) The data on adnominal idiom modifications in English were obtained mainly from the BYU corpora. At the time, there was no BNC on this page, hence the author used 2 corpora for his studies. Viktor V. Vinogradov''s Institute of the Russian Language of the Russian Academy of Sciences
133  4/18/2011 Book 2011 COCA BNC TIME  Juhani Rudanko Changes in Complementation in British and American English: Corpus-Based Studies on Non-Finite Complements in Recent English The author examines changes and innovative trends in the grammar of non-finite complements in the book, taking both British and American English into account. He draws on large electronic corpora, including the TIME Corpus, the Corpus of Contemporary American English, and the BNC. Palgrave Macmillan; Houndmills, Basingstoke, Great Britain
134  4/15/2011 Book 2011 COCA  Jim Papple O Canada Workbook Many of the vocabulary activities from the workbook came from collocate studies of COCA. Oxford University Press Canada
135  4/15/2011 Journal 2010 BNC  Ewa Gieroń-Czepczor Red and czerwony as cognitive categories The study is based on the BNC concordance for ''red'' which was analysed manually in order to determine the prototypical conceptual core of ''red'' in real-language data, as well as all possible extended (metonymic and metaphorical)senses. The data were contrasted with those for the Polish equivalent of red (czerwony), also analysed against available corpus data for the Polish language. SKASE JOURNAL OF Theoretical Linguistics, vol. 7 (2010), no.3
136  3/21/2011 MA/PhD 2008 BNC  Marie Nordlund From Physical to Mental Acquisition: A Corpus-Based Study of Verbs My dissertation is completely based on corpus data. I investigated the frequency of meaning extensions into the mental and abstract domains of six English verbs with an otherwise concrete meaning. Luleå University of Technology
137  3/21/2011 Journal 2010 COCA  richard konopka Metaphors of economic risk: use of investment and gamble in contemporary American contexts Used COCA to corroborate data collected for economic copora purpose. Studies in International Cultures 5(2):21-47 (國際文化研究)
138  3/18/2011 MA/PhD 2010 CDP  Igor Costa A Construção Superlativa de Expressão Corporal: uma abordagem construcionista In order to understand what is the real syntatic, semantic and pragmatic context of the investigated constructions, the "Corpus do Português" is used as a source of "real" occurences of this constructions. Universidade Federal de Juiz de Fora
139  3/15/2011 Chapter 2009 BNC  Mateusz-Milan Stanojevic, Jelena Parizoska, Lea Banovic Schematic idioms and cultural models This paper combines the constructional approach to idioms (Fillmore et al. 1988) with an investigation of idiom motivation based on ICMs/cultural models (Lakoff 1987). We will compare the syntactic and semantic make-up of English constructions with the combination of components “ in ... eyes” and Croatian constructions with the combination of components “ u ... očima” . The aim is to show that these combinations are in fact schematic idioms with a relatively stable structure, depending on the ICM/cultural model motivating them. In order to show this, we analyzed a sample of these constructions from the BNC and the Croatian National Corpus. The analysis has confirmed that there are a limited number of cultural models motivating most expressions in both languages and that each model is associated with a particular schematic structure. Newcastle: Cambridge Scholars Publishing
140  3/13/2011 Chapter 2010 CDE  Daniel Villa Y nos vams patrás: Back to an analysis of a supposed ''calque'' I used the Corpus to demonstrate the the construction ''verb + para atrás'' is found in the earliest documentation of the Spanish language. Iberoamerica/Vervuert: Madrid/Frankfurt, eds. Susana V. Rivera-Mills and Daniel J. Villa, pp. 239-251
141  3/10/2011 Chapter 2010 BNC  Francisco Cortés-Rodríguez Constructions and constraints in English middle structures: A Lexical Constructional analysi BNC data were used as a support for the analysis offered on English middle structures. 10th Conference of the European Society for the Study of English, University of Toriino (Italy)
142  3/8/2011 Journal 2010 COCA  Ji-Yeon Chang Postsecondary EFL students'' evaluations of corpora with regard to English writing Recently, as a corpus, which has been mainly used for linguistic analysis, is emerging as an innovative tool in language learning, a growing body of research has paid attention to the efficacy of corpus use from educational perspectives. Given that corpus use for pedagogical purposes entails autonomous and independent learning of language learners, it is important to uncover how corpus users evaluate it based on their own experience. While most of the previous studies were conducted in the ESL settings, the present study sets out to examine postsecondary Korean students’ opinions of general and specialized corpora with regard to their own English study and writing in the EFL setting. The questionnaires which twenty-eight students of two corpus workshops submitted were analyzed in terms of their evaluations of corpora as well as their prior writing experience, reference use, and attitudes towards English study and writing. The results show that the respondents, being dissatisfied with their English skills, were eager to increase their overall English skills and English writing skills. During English writing, they usually used two to three language references including bilingual dictionaries followed by monolingual dictionaries and Internet search engines. The results also prove that the respondents regarded corpora as helpful language references and that they would use them and recommend them to other students. The respondents, however, expressed disparate attitudes towards specialized corpora. In general, corpora were considered more appropriate to the intermediate and advanced levels of students in terms of language proficiency and academic experience. Using corpora for pedagogical purposes merits further research to validate its efficacy and investigate the possibility of its application in Korea. Chang, J.-Y. (2010). Postsecondary EFL students'' evaluations of corpora with regard to English writing. The SNU Journal of Education Research, 19, 57-85.
143  3/1/2011 Book 2010 BNC  Sue Argent Access EAP: Foundations Key vocabulary used in the book was checked using BNC to test authors'' intuitions. Garnet Education. Reading
144  2/27/2011 Book 2010 COCA BNC  Denilso Lima Gramática de Uso da Língua Inglesa My books are on collocation and English language usage. So, BNC and COCA were used so as to find real examples of some words combinations as well as the grammar in use. All my books and articles are based on corpora. Elsevier
145  2/24/2011 Chapter 2011 CDE  Malte Rosemeyer On the interplay between transitivity, factivity and information structure: Spanish nominal and verbal infinitives The frequency data from the Corpus del Español was used in a collostructional analysis in order to demonstrate the different preferences of two infinitive constructions in Spanish regarding the verb appearing as infinitive. García García, Marco; Bellosta von Colbe (to appear)
146  2/22/2011 Journal 2010 CDE  Carlos Benavides El componente semántico Trayectoria y los verbos de movimiento en español Through a corpus study the paper explores whether the lexical entries of two types of motion verbs contain the Path feature, a key element associated with motion verbs. Southwest Journal of Linguistics (SWJL) 28.2: 67-82.
147  2/22/2011 Journal 2006 BNC  Hiroshi Ohashi The Development of an English Intensifier Phrase: A Corpus-Based Study To examine the historical development of an intensifier phrase, "all you want/like." English Linguistics, 23:2, 403-432
148  2/21/2011 MA/PhD 2003 BNC  Di Kilpert Language and value: The place of evaluation in linguistic theory Chapter 3 of my PhD thesis uses the BNC corpus extensively to look at how the terms ''prescriptive'' and ''descriptive'' are used by academic linguists and by ordinary speakers of English. Rhodes University, Grahamstown, South Africa
149  2/21/2011 Journal 2008 CDE  Hella Olbertz Dar + gerund in Ecuadorian Highland Spanish: contact induced grammaticalization? In the paper I dealt with a construction specific of Ecuadorian Spanish. I used the CdE in order to contrast Ecuadorian Spanish with general Spanish. The CdE was particularly helpful because of its highly flexible query syntax. Spanish in Context
150  2/21/2011 MA/PhD 2010 COCA BNC  lise hamelin Tentative de représentation de quelques prépositions de l''anglais moderne : le cas de to, Thèse de Doctorat Statistical evidence. Université de Caen Basse-Normandie
151  2/20/2011 MA/PhD 2010 COCA CDE  Jennifer Sia The metaphorical competence of anger for advanced Spanish-English L2 users Examining expressions of anger produced by advanced Spanish-English L2 users and English L1 users, the corpora (COCA and CdE)were used to determine if the expressions were more common in English, more common in Spanish or were unique to a multicompetent L2 user. In addition, both corpora were used to discuss the variation between languages (word form, grammatical, etc). Birkbeck College, University of London
152  2/18/2011 Chapter 2012 COCA BNC  Pawel Szudarski Lexical syllabus I used BYU to check frequencies of words and thier lexico-grammatical patterns used by Dave Willis in the Lexical syllabus. The Encyclopedia of Applied Linguistics, ed. by C. Chapelle. Published by Wiley-Blackwell
153  2/14/2011 Book 2011 COCA  Magnus Ljung Swearing. A Cross-Cultural Linguistic Study COCA data on English swearing was compared to correponding data in the BNC. Palgrave MacMillan
154  2/12/2011 Journal 2011 COCA BNC CDE  Laura Alba-Juez & Elena Martínez-Caro “No wonder you’re in trouble!” The evaluative function as part of the hidden pragmatic meaning of certain expressions in English and Spanish We used the corpus to search the concordances and extented contexts of the expressions "No wonder"/ "I wouldn''t be surprised if/that" in English, and "No me extraña"/ "No me sorprendería/ extrañaría que" in Spanish To appear in the next issue of The Rocky Mountain Review, MLA
155  2/12/2011 Chapter 2009 COCA BNC  Laura Alba-Juez ‘Little words’ in small talk: Some considerations on the use of the pragmatic markers man in English and macho/tío in Peninsular Spanish I searched the concordances and contexts for the word "man" used as a pragmatic marker. R. P. Leow, H. Cámpos & D. Lardiere (eds.). Little Words. Their History, Phonology, Syntax, Semantics, Pragmatics and Acquisition. Washington D. C.: Georgetown University Press. 171-81
156  2/12/2011 Journal 2009 COCA  Klaus-Uwe Panther From syntactic coordination to conceptual modification: The case of the nice and Adj construction The data were used to support the claim that 'nice and Adj' is an emergent construction in the sense of Goldberg (1995). It has a non-compositional, inferentially derivable meaning and expresses a positive evaluation of a propositional content. Data was taken from COCA. Constructions and Frames 1: 56–86 [coauthor: Linda L. Thornburg]
157  2/12/2011 Book 2010 COCA  Seth Lindstromberg ENGLISH PREPOSITIONS EXPLAINED COCA data were used especially to establish whether particular usages were frequent enough to examine and to make sure that no frequent usages were omitted from consideration. Some examples were also taken from COCA (with citation). AMSTERDAM/PHILADELPHIA: JOHN BENJAMINS
158  2/11/2011 Chapter 2010 COCA BNC  Raf Salkie On going Almost all the examples in the paper are from the BNC and COCA. Mark Davies''s excellent web interface is acknowledged on page 170. In B. Capelle & N. Wada (eds.), Distinctions in English grammar: offered to Renaat Declerck (Tokyo, Kaitakusha), 169-190. ISBN 978-4-7589-2154-1.
159  2/11/2011 Journal 2010 CDP  Katiane Carvalho EVIDÊNCIAS DE USOS GRAMATICALIZADOS E NÃO GRAMATICALIZADOS DE DIZ QUE EM ORTO DO ESPOSO Este texto aborda a comparação de contextos de uso do item diz que em estágios linguísticos distintos, recuando a partir do presente até o português arcaico. Para tanto, adota-se perspectiva que prevê a interação entre sincronia e diacronia, considerada central para o estudo do processo de gramaticalização, entendido como regularização ou convencionalização Cadernos do CNLF, Vol. XIV, Nº 4, t. 3
160  2/10/2011 Journal 2004 BNC  Jennifer Hay & Ingo Plag What constrains possible suffix combinations? On the interaction of grammatical and processing restrictions in derivational morphology, There is a long-standing debate about the principles and mechanisms that constrain the combinatorial properties of affixes, in particular of English suffixes. One group of scholars argues for the existence of lexical strata with strong restrictions holding between the different strata. This view is disputed by scholars who claim that it is selectional restrictions of individual suffixes that are responsible for the combinatorial properties of suffixes. Most recently, Hay (2000, 2002) has proposed a psycholinguistic model of morphological complexity, according to which an affix which can be easily parsed out in processing should not occur inside an affix which can not. This model has been called “complexity based ordering”. The general claim is that affixes can be approximately ordered along a hierarchy of complexity, with more separable affixes at one end, and less separable affixes at the other end. More separable affixes can attach outside less separable affixes, but not vice-versa. The goal of this paper is to test the predictions of complexity based ordering through an investigation of 15 English suffixes and their potential 210 two suffix combinations. Using large data-bases such as the British National Corpus, the CELEX lexical data-base, the OED and the internet, we investigate whether the attested and non-attested combinations are best explained by complexity based ordering or by the individual selectional properties of these suffixes. We show that in most cases selectional restrictions and parsing restrictions coincide. Where selectional restrictions underdetermine possible combinations, complexity based ordering makes correct predictions. Only well parsable combinations are possible combinations, and this range of possible combinations is then further curtailed by selectional restrictions. In sum, we argue that both selectional restrictions and parsing constraints are instrumental in determining possible and impossible suffix combinations. Natural Language and Linguistic Theory 22, 565-596.
161  2/10/2011 Journal 2009 BNC  Ingo Plag & Harald Baayen Suffix ordering and morphological processing There is a long-standing debate about the principles constraining the combinatorial properties of suffixes. Hay 2002 and Hay & Plag 2004 proposed a model in which suffixes can be ordered along a hierarchy of processing complexity. We show on the basis of data from large corpora (including the BNC)that this model generalizes to a larger set of suffixes, and we provide independent evidence supporting the claim that a higher rank in the ordering correlates with increased productivity. Behavioral data from lexical decision and word naming show, however, that this model has been one-sided in its exclusive focus on the importance of constituent-driven processing, and that it requires supplementation by a second and equally important focus on the role of memory. Finally, using concepts from graph theory, we show that the space of existing suffix combinations can be conceptualized as a directed graph, which, with surprisingly few exceptions, is acyclic. This acyclicity is hypothesized to be functional for lexical processing. Language 85.1, 109-152
162  2/10/2011 Journal 2006 BNC  Ingo Plag The variability of compound stress in English: structural, semantic and analogical factors This article presents an experimental study in which three competing hypotheses concerning NN compound stress assignment are tested. The stress patterns of novel and existing compounds, as obtained in a reading experimentwith native speakers of American English,were acoustically measured and analyzed. The results show that there is indeed a considerable amount of variation in stress assignment, and that all three hypothesized factors, i.e. structure, semantics, and analogy, are relevant, though to different degrees. On a theoretical level, the findings strongly suggest that a categorical approach cannot be upheld and that probability and analogy need to be incorporated into an adequate account of stress assignment in noun– noun constructions. Frequency data from the BNC are used to test the hypothesis that more frequent compounds tend to be more prone to left stress. English Language and Linguistics 10.1, 143-172
163  2/6/2011 Book 2004 BNC  pam peters Cambridge Guide to English Usage I used the BNC to support all quantitative statements about British English for this guide, though it was not available through the BYU website then. When updating the frequency statements for the second edition, I would be glad to access BNC through this website cambridge university press
164  2/3/2011 Journal 2010 COCA COHA  stephen mouritsen The Dictionary Is Not a Fortress: Definitional Fallacies and a Corpus-Based Approach to Plain Meaning Judges often rely on their intuition in order to determine the "Plain Meaning" of a term in a statute, a contract, or a constitution. But in difficult cases, meaning is rarely "plain," and the legal system would therefore benefit from a method for examining lexical ambiguity that is more reliable that intuition. In this paper, I test the United States Supreme Court''s conclusion in United States v. Muscarello, that the "ordinary meaning" of "carries a firearm" in 18 U.S.C. § 924(c)(1) has reference to carrying a firearm in a vehicle and not just on one''s person. I relied on the COCA to examine contemporary usage of the verb "to carry" when it collocates "firearm" and its synonyms. I used the COHA to determine the use of the phrase "carries a firearm" at the time of the law''s enactment, i.e., 1968. 2010 BYU Law Review 1915.
165  1/28/2011 Journal 2010 BNC  Graham Ranger You see! Abstract in English: The segment you see is employed as a parenthetical comment clause in initial, median and final position, where it may be said both to indicate the argumentative status of the clause it targets, as an explanation or justification for a previous representation, and to function as a politeness marker, in associating the cospeaker with propositions endorsed by the speaker, thereby defusing potential discord before it manifests itself. It is, in this second function, a marker of negative politeness (Brown and Levinson 1978). Quirk et al (1985) also mention an alternative use of stressed you see to express triumph. In this paper I will propose an enunciative characterisation of you see which will, it is hoped, enable us to derive different, sometimes contradictory, interpretations of you see from the metalinguistic operations of which you see is the trace, and from observable differences in contextual configurations, including the identification of the related propositions or the position you see occupies relative to these propositions. From our study it emerges that you see is not in itself a marker of politeness, but that the operations which you see marks lend themselves to strategies including impoliteness (a Face Threatening Act – FTA) as well as negative and positive politeness, in the terms of Brown and Levinson. In some contexts it also appears to allow an uncertain speaker to elicit manifestations of positive politeness (backchannels) from the co-speaker. We are led to consider how the theory of enunciative operations might contribute to research in the fields of politeness and discourse structure. The BYU corpora were used for frequency tables by text and genre-type and in order to locate specific examples for commentary within the article. Keywords : discourse marker, parenthetical, comment clause, enunciation, politeness, fixed phrase, discursive strategies, grammaticalisation, pragmaticalisation Lexis - Online Journal in English Lexicology
166  1/25/2011 MA/PhD 2011 COCA  Jelena Markovic The effects of explicit instruction of formulaic sequences on L2 writers I referred to COCA to create a study instrument that measures learner''s abilities to produce formulaic sequences in a controlled situation (i.e., C-test). University of Utah
167  1/22/2011 Journal 2011 COCA  Mai Kuha Degrees of anthropocentrism in accounts of wildlife-vehicle collisions I looked at the collocations of "hit" and "strike" to show that "bird strike" is not a neutral way of referring to a collision involving a bird and a plane. Society & Animals 19(1). 1-21.
168  1/10/2011 Journal 2010 COCA  Margaret Otake English Loanwords from Japanese: A Survey of the Perceptions of American English Speakers As Japanese cultural patterns and icons have become more familiar to English-speaking North Americans, several new vocabulary items have entered the English language from Japanese sources. A selection of some of the more common and newer terms from Japanese was made based on print sources and conversational use by American English speakers. The words chosen were: geisha, tsunami, kamikaze, honcho, hibachi, karaoke, anime, manga, otaku, edamame, and sudoku. A survey of speakers of American English of a wide range of ages and backgrounds was conducted to determine the sense of the meanings of these words from the informants’ point of view, the ways in which the words are used in grammatical structures, and the degree of awareness of the language origin of the words. A comparison of the response rates by the participants is made to the historical timeline of the words in dictionaries and their frequency of use in a major American English corpus, the COCA. A report on the emerging consensus of meaning, grammatical usage, and collocation by communities of speakers is presented. Each item is also analyzed for changes that have occurred in meaning as compared to the original and present meaning in Japanese. Tokyo Seitoku University Bulletin, No.17, pp.17-32
169  1/7/2011 Book 2009 BNC TIME  Wendy Anderson and John Corbett Exploring English with Online Corpora Data from the BYU corpora (BNC and TIME) are used alongside data from the SCOTS corpus and other freely available online corpora. The book takes students through the analysis of various areas of linguistics using online corpora. Palgrave Macmillan, Basingstoke, UK
170  1/3/2011 MA/PhD 2008 CDE  Ariel Novodvorski A representação de atores sociais nos discursos sobre o ensino de espanhol no Brasil em corpus jornalístico O corpus encontra-se disponível em http://www.corpusdelespanol.org. É composto por cem milhões de palavras e abarca o período compreendido entre os séculos XIII e XX, distribuídos da seguinte maneira: vinte milhões dos séculos XIII a XV; quarenta milhões dos séculos XVI a XVIII, vinte milhões do século XIX e vinte milhões do século XX. Para os vinte milhões de palavras do século XX há um corpo balanceado de 25% de espanhol falado, 25% de ficção, 25% de textos jornalísticos e 25% de textos acadêmicos, outros e não ficção. Para os períodos anteriores não existe a mesma variedade de registros, mas o autor do corpus assinala que procurou um equilíbrio entre ficção e não ficção. O “corpus del español” foi criado há seis anos e, em outubro de 2007, foi disponibilizada para o público sua última versão. Estas informações foram tomadas da entrevista com o Dr. Mark Davies, autor do corpus, publicada em La página del Idioma Español e disponível em http://www.elcastellano.org/ns/edicion/2007/diciembre/corpus.html. Acesso em 10/03/2008. O termo “aburrido”, segundo o dicionário Señas, da Universidad Alcalá de Henares (Espanha), empregado com o verbo ser, significa tedioso em português, já com o verbo estar, significa entediado. O DRAE dá a seguinte definição para aburrimiento: “Cansancio, fastidio (chateação), tedio, originados generalmente por disgustos o molestias (incômodos), o por no contar con algo que distraiga y divierta” (nossa tradução, nossa ênfase). Também foi constatado, no Corpus del Español (de 100 millones de palavras), que o emprego de aburrido tem conotação negativa. Esse corpus está disponível em: http://www.corpusdelespanol.org/. Acesso em 29 de março de 2008. Universidade Federal de Minas Gerais (UFMG) - Brasil
171  1/3/2011 Book 2009 COCA BNC TIME  Hans Lindquist Corpus Linguistics and the Description of English Review: "''Lindquist''s ''Corpus Linguistics and the Description of English'' is an excellent book for anyone wishing to become acquainted with corpus linguistics and its wide range of applications. As it can be read either from cover to cover or perused selectively, it is suitable for many types of readers.". The data is based primarily on the corpora from corpus.byu.edu, such as COCA, TIME, and BYU-BNC. Edinburgh Univ Press
172  12/30/2010 Book 2008 BNC  shin ishikawa Eigo Kopasu to Gengo Kyouiku [English Corpus and Language Education] The book briefly introduces BYU BNC and shows how it can be used for language studies. Tokyo: Taishuka
173  12/27/2010 Book 2010 COCA BNC TIME  Gena R. Bennett Using Corpora in the Language Learning Classroom: Corpus Linguistics for Teachers Contains many corpus-designed activities that rely on several tools (MICASE, COCA, TextSTAT, for example) to teach a variety of language skills. Ann Arbor: Univ. of Michigan
174  12/27/2010 Book 2010 COCA TIME  Joan Bybee Language, Use, and Cognition The data in many of the chapters is based on COCA and TIME Cambridge: Cambridge Univ. Press
175  12/22/2010 Journal 2010 COCA  Ling Fang, Xingfu Wang Self-Study and Classroom Use of the Corpus of Contemporary American English Abstract: The paper teaches learners how to use the Corpus of Contemporary American English (COCA) effectively in their study to improve language ability and broaden their horizon on American English. The main parts in this research set good examples in searching Mutual Information of strings, semantic searching and genres analyses about words, and the free use of English proverbs is also analyzed. The authors conclude that learners can benefit much from using COCA in searching key information in their contexts and would be more native-like in speaking and writing English. Foreign Languages in China. 2010(6):79-84
176  12/19/2010 Journal 2008 COCA  Xingfu Wang,Mark Davies, Guohui Liu The Corpus of Contemporary American English — A Useful Tool for English Teaching and Research Abstract: The article is an introduction to the Corpus of Contemporary American English (COCA) that is created by Prof. Mark Davies. This corpus contains 360 million words of the materials published in America from 1990 to 2007, and it is the largest balanced English corpus. The Corpus of Contemporary American English is free for researchers and English learners to use online. Computer-Assisted Foreign Language Education in China. 2008(5):24-31
177  12/15/2010 Journal 2008 CDP  Tiago Torrent A HIPÓTESE DA DUPLA ORIGEM PARA A CONSTRUÇÃO DE DATIVO COM INFINITIVO: PRIMEIRAS INCURSÕES PELO PORTUGUÊS MEDIEVAL Os dados do Corpus do Português foram utilizados como evidência em suporte à Hipótese da Dupla Origem para a Construção de Dativo com Infinitivo, desenvolvida no artigo. Revista da ABRALIN, v. 7, n. 2, p. 65-92, jul./dez. 2008
178  12/15/2010 MA/PhD 2009 CDP  Tiago Torrent A rede de construções em para (SN) infinitivo: uma abordagem centrada no uso para as relações de herança e mudança construcionais Os dados foram usados no intuito de se delinear a trajetória diacrônica das construções em "para infinitivo" estudadas. Buscou-se levantar ocorrências dos padrões construcionais estudados de modo a sustentar a hipótese da Convergência Construcional, desenvolvida na tese. Universidade Federal do Rio de Janeiro
179  12/14/2010 Chapter 2011 COCA  Christian Mair Change and diversification in present-day English—web-based perspectives Use COCA data to examine several different constructions, in addition to Web data Handbook on the History of English: Rethinking and Extending Approaches and Methods. Terttu Nevalainen and Elizabeth Closs Traugott, eds. Oxford University Press, 2011
180  12/13/2010 Book 2010 CDE  Bert Le Bruyn Indefinite Articles and Beyond This dissertation uses diachronic quantificational data obtained from the Corpus del Espanol to argue in favour of an article analysis of Spanish "unos". Utrecht University
181  12/11/2010 Chapter 2011 COCA COHA TIME  Anne Curzan Revisiting the Reduplicative Copula with Corpus-based Evidence COCA provides key data for the forms of the reduplicative copula (e.g., "the thing is is") in current spoken English. Time and COHA provide historical evidence for shifts in focusing phrases such as "the point is." In: Handbook on the History of English: Rethinking Approaches to the History of English, eds. Terttu Nevalainen and Elizabeth Closs Traugott. (Forthcoming, 2011)
182  12/11/2010 Journal 2009 COCA TIME  Clay Beckner and Joan Bybee A Usage-Based Account of Constituency and Reanalysis This study uses COCA data to illustrate that the complex preposition ''in spite of'' functions as a unit in contemporary English, on the basis of semantics, and trends in syntactic usage. Language Learning. 59: 27-46
183  12/11/2010 Journal 2009 COCA  Ilana Salem Corpus-based Evaluation of Test Items’ Linguistic Authenticity I used COCA frequency data to verify frequency of word strings, and word-structure combinations, that appeared in teacher-written classroom tests. I consulted the KWIC feature to check meaning and functions of particular words in each of its occurrences. Humanizing language teaching. 11:6 (December 2009)
184  12/11/2010 Book 2011 COCA BNC  Thomas Payne Understanding English grammar: A linguisitc introduction Data from the BNC and COCA are used extensively in this publication as illustrations of English grammar in use. Furthermore, statistical data are refered to in determining patterns of usages. For one small example, the percentage of voicing in root final consonants of the plurals of words like loaf, roof, half and oaf illustrates the variable nature of morphophonemic rules. Cambridge University Press
185  12/11/2010 Journal 2005 CDE  Mark Davies The advantage of using relational databases for large corpora: speed, advanced queries, and unlimited annotation This is based on a much earlier version of the corpus architecture that I''ve developed. International Journal of Corpus Linguistics
186  12/10/2010 Journal 2010 COCA COHA  Guenther Lampert and Martina Lampert Where does evidentiality reside? Notes on (alleged) limiting cases: quotatives and seem-verbs. This co-authored article (with Günther Lampert) reviews standard definitions of evidentiality which yield unwanted effects for alleged English evidentials: Some (like seem) do not specify an information source at all, but (if at all) only schematically refer to it, others (like reporting verbs/quotatives) are, in light of the envisaged cognitive-functional definition proposed for the database of evidential markers in European languages, best examples, yet are supposed to be (partially) excluded because of their non-emergent (or paradigmatic) meanings. The present meta-study on conceptualizing evidentiality, a far from trivial issue that must be settled before any database can be compiled, scrutinizes relevant parameters and applies them to two test cases seem, and ‘mediated’ evidentials it is said, is said to, as they say, be like, suggesting to conceptualize evidentiality as a multi-dimensional contextu al category. COCA and COHA provided authentic examples to support our arguments. STUF-Zeitschrift für Sprachwissenschaft, Typologie und Universalienforschung. (Nov 2010)
187  12/9/2010 Journal 2010 COCA TIME  James Vanden Bosch The Absolute Phrase in Contemporary English Although this study is mainly focused on absolute phrases I found in the ICE-GB, it also depends on the resources of COCA and TIME in order to establish the frequency of stereotyped and formulaic absolute phrases, work I could not have done without access to the information provided via COCA and TIME. Sylistyka XVIII (2009); pp. 323-335 (Opole, Poland)
188  12/9/2010 Journal 2010 COCA BNC  ana frankenberg Raising Teachers'' Awareness of Corpora Corpus consciousness-raising exercises using the BYU-BNC and COCA to help newcomers to corpora develop basic corpus skills. Language Teaching
189  12/8/2010 Chapter 2010 CDP  Lewis Howe Detours along the Perfect Path (w/ Patrícia Amaral) For this paper, we used selected examples from the Corpus do Português, particularly from the older selection of texts. We were looking for instances of the "ter" + PastParticiple construction and how this construction developed over time. Romance Linguistics 2009: Selected papers from the 39th Linguistic Symposium on Romance Languages (LSRL), Tucson, Arizona, March 2009.
190  12/7/2010 Chapter 2010 COCA  Lewis Malamed Disagreement: How to Disagree Agreeably This publication, in a volume on Speech Acts, suggests the use of this corpus in order to learn how various hedging strategies are employed in disagreement. In: Martínez-Flor, Alicia and Esther Usó-Juan (eds.), Speech Act Performance: Theoretical, empirical and methodological issues. 2010. xiv, 277 pp. (pp. 199–216)
191  12/7/2010 Other 2004 CDE  Josep Alba-Salas Reseña del Corpus del español de Mark Davies This review-article examines a previous version of the Corpus del español (released in 2003). The review examines all major components of the corpus, including its graphic interface, search options, overall architecture, and textual sources. Besides assessing the strengths and limitations of the corpus, the article also discusses and illustrates its potential uses for linguistic and philological research, both synchronic and a diachronic. La corónica: A Journal of Medieval Hispanic Languages, Literatures and Cultures, 33 (1): 207-229
192  12/7/2010 Journal 2007 CDE  Josep Alba-Salas On the life and death of a collocation: A corpus-based diachronic study of dar miedo/hacer miedo-type structures in Spanish This paper seeks to shed light on the diachronic evolution of collocations by examining structures formed in Modern Spanish with dar ''give'' plus state nouns (e.g. dar miedo ''frighten'', literally, ''give fear''). Using the Corpus del Espanol, I offer a quantitative and qualitative analysis of eighteen representative dar miedo-type collocations from the 1200s to the 1900s. The results show that although the basic properties of dar miedo-type structures have remained remarkably stable over the centuries, during the Middle Ages the verb hacer ''make'' was used here almost as often as dar - a competition apparently inherited from Latin. While different nouns show different patterns of loss of hacer, echoing the lexical diffusion of certain grammatical changes, the 1500s saw a very sharp decline in ''make'' cases across the board, leading to its complete disappearance from this context by the 1800s. The loss of hacer led to a radical simplification of the collocational properties of state nouns in Spanish vis-a-vis Latin and other Modern Romance varieties. This process resulted both from language-internal factors mostly related to the lexical semantics of dar and hacer and from three key sociolinguistic processes in 16th-century Spanish: koineization, change of norm and increased standardization. Diachronica
193  12/7/2010 Chapter 2009 CDE  Josep Alba-Salas Las estructuras tipo meter miedo en diacronía: Más detalles sobre la evolución histórica de las colocaciones causativas Este estudio examina ciertas colocaciones formadas (en el español coloquial moderno) con el verbo meter más sustantivos predicativos que denotan estados sicológicos, p. ej. meter miedo, meter celos o meter envidia (a alguien).Partiendo de los datos del Corpus del español, el análisis traza la evolución histórica de las colocaciones tipo meter miedo desde el siglo XIII al año 2000. Entre otros resultados, el análisis revela que en estados anteriores de la lengua meter competía con poner en estas colocaciones, que esta competición aparentemente conllevó un proceso de difusión léxica, y que el español moderno ha resuelto dicha rivalidad histórica mediante la especialización de meter y poner para contextos colocacionales diferentes. Estos resultados confirman ciertas tendencias de cambio histórico documentadas por Alba-Salas (2007) en otros contextos colocacionales. In Diacronía de las lenguas iberorrománicas: nuevas aportaciones desde la lingüística de corpus / coord. por Andrés Enrique-Arias, 2009, ISBN 978-84-8489-489-1 , pags. 343-364
194  12/6/2010 Chapter 2010 COCA BNC  Marcia Veirano Pinto O uso de things, thing, anything, something e everything em corpus de aprendiz This paper investigated the use of the words things, thing, anything, something and everything in learner corpora. The BNC and the COCA were used as reference corpora. Book title: Caminhos da Linguística de Corpus; Orgs: Tania Maria Granja Shepherd, Tony Berber Sardinha e Marcia Veirano Pinto; Publisher: Mercado de Letras. The book is due to be published in the first semester of 2011 in Brazil.
195  12/6/2010 Chapter 2009 BNC  Lachlan Mackenzie On a friendly and scholarly basis: Some remarks on an adverbializing construction This article considers the invariable frame "on a ... basis", examining the fillers with regard to category, complexity and meaning. The analysis uses the BYU-BNC to lay bare trends and to show that the filler is understood as the core of the adjunct, with the frame as its support. In S. Slembrouck, M. Taverniers & M. Van Herreweghe (eds), From will to well: Studies in linguistics offered to Anne-Marie Simon-Vandenbergen, Ghent: Academia Press, 313-324
196  12/5/2010 Journal 2010 COCA COHA  Martina Lampert and Guenther Lampert Where does evidentiality reside? Notes on (alleged) limiting cases: seem and be like This co-authored article (with Günther Lampert) reviews standard definitions of evidentiality which yield unwanted effects for alleged English evidentials: Some (like seem) do not specify an information source at all, but (if at all) only schematically refer to it, others (like reporting verbs/quotatives) are, in light of the envisaged cognitive-functional definition proposed for the database of evidential markers in European languages, best examples, yet are supposed to be (partially) excluded because of their non-emergent (or paradigmatic) meanings. The present meta-study on conceptualizing evidentiality, a far from trivial issue that must be settled before any database can be compiled, scrutinizes relevant parameters and applies them to two test cases seem, and ‘mediated’ evidentials it is said, is said to, as they say, be like, suggesting to conceptualize evidentiality as a multi-dimensional contextu al category. COCA and COHA provided authentic examples to support our arguments. STUF-Zeitschrift für Sprachwissenschaft, Typologie und Universalienforschung
197  12/4/2010 Journal 2009 COCA  François Maniez L’adjectif dénominal en langue de spécialité : étude du domaine de la médecine. Relational adjectives are a key component of specialized language in French as well as many other languages. The use of relational adjectives as a substitute for a prepositional phrase follows the principle of economy in language, but it is also a characteristic of specialized discourse: French physicians will use expressions like « cancer mammaire » or « infarctus myocardique » where the layman uses « cancer du sein » or « infarctus du myocarde ». In English as well, the choice between a modifying noun or adjective is sometimes difficult to make for non-natives who either have to write directly in English or translate from a Romance language such as French, where relational adjectives are frequently used. After studying morphological, syntactic and semantic aspects of relational adjectives, we attempt to identify some of the factors that seem to induce the use of nouns or relational adjectives as modifiers, based on examples drawn from two subsets of the Corpus of Contemporary American English, where particular attention is devoted to variation in the use of the pair heart/cardiac as noun modifiers. In John HUMBLEY, dir. La terminologie : orientations actuelles, Revue Française de Linguistique Appliquée, Volume XIV-2:117-130.
198  12/3/2010 Journal 2009 TIME  Martin Hilpert and Stefan Th. Gries Assessing frequency changes in multistage diachronic corpora: Applications for historical corpus linguistics and the study of language acquisition The use of corpora that are divided into temporally ordered stages is becoming increasingly wide-spread in historical corpus linguistics. This development is partly due to the fact that more and more resources of this kind are being developed. Since the assessment of frequency changes over multiple periods of time is a relatively recent practice, there are few agreed-upon standards of how such trends should be statistically interpreted. This article addresses the need for a basic analytical toolbox that is specifically tailored to the interpretation of frequency changes in multistage diachronic corpora. Using data from the TIME corpus, we present a number of suggestions for the analysis of data that analysts commonly face in historical studies, but also in the study of language acquisition. Literary and Linguistic Computing, 2009, 24, 4, Dec, 385-401
199  12/3/2010 Chapter 2011 COHA  Martin Hilpert Studying many a noun in COHA: Diachronic collostructional analysis meets the noun phrase This paper presents the approach of diachronic collostructional analysis (Hilpert 2006, 2008, to appear), a method for studying semantic change in grammatical constructions. It is used here on the basis of the Corpus of Historical American English (COHA, Davies 2010) in order to show its applicability to recent change. Whereas much work in collostructional analysis has addressed verb-headed constructions, such as modal auxiliaries, causatives, the ditransitive construction, or the passive, relatively little attention has been paid to the nominal domain, which is the focus of this chapter. In: Handbook on the History of English: Rethinking Approaches to the History of English, eds. Terttu Nevalainen and Elizabeth Closs Traugott. (Forthcoming, 2011)
200  12/3/2010 MA/PhD 2009 COCA  hendra tedjasuksmana English errors in lexical and grammatical collocations As I investigated the English collocation errors made by the Indonesian paper writers, I also discussed the sources of errors which would explain why a particular verb/noun/adj etc. can or can''t collocate with other particular words. So I need to know what characteristics that verb/noun etc has. Besides, how close the two words collocate each other. atmajaya catholic university, jakarta
201  12/2/2010 Journal 2008 BNC  Dilin Liu Intranstive or object deleting? Classifying English verbs without an object This study consulted the BNC in the analysis of the typical verbs and contexts that involve object-deletion in English. Journal of English Linguistics, 36, 289-313
202  12/2/2010 Journal 2008 BNC  Dilin Liu Linking adverbials: An across-register corpus study and its implications The study used the BNC in a cross-register study of the use of linking adverbs in English. International Journal of Corpus Linguistics, 13, 491-518
203  12/2/2010 Journal 2009 BNC  Dilin Liu Using a corpused-based lexicogrammatical approach to grammar instruction in EFL and ESL contexts Inthis study, ESL/EFL students used the BNC to learn English vocabulary and grammar. The Modern Language Journal, 93, 61-78
204  12/2/2010 Journal 2010 COCA  Dilin Liu Is it a chief, main, major, primary, or principal concern? A corpus-based behavioral profile study of the near-synonyms Using the Corpus of Contemporary American English as the source data and employing a corpus-based behavioral profile (BP) approach, this study examines the internal semantic structure of a set of five near-synonyms (chief, main, major, primary, and principal). International Journal of Corpus Linguistics, 15, 56-87
205  12/2/2010 Journal 2010 COCA BNC  Dilin Liu Going beyond patterns: Involving cognitive analysis in the learning of collocations The study used cognitive analysis of corpus data (mostly from COCA) to show that collocations are generally motivated, i.e. not arbitrary as they have often been described. TESOL Quarterly. 44: 4-30
206  12/2/2010 Chapter 2010 TIME  Juhani Rudanko Tracking and Explaining Variation and Change in the Grammar of American English: a Case Study, with Evidence from the TIME Corpus The article substantiates variation and change affecting sentential complements of ''accustomed'' in one particular text type of American English during the first decade of the TIME Corpus and in the 1960s. Two explanatory principles are identified and discussed to account for the variation observed. Lenker, Ursula, Judith Huber, and Robert Mailhammer, eds., English Historical Linguistics 2008, Volume 1: the History of English Verbal and Nominal Constructions, Amsterdam: John Benjamins, pages 29-43
207  12/2/2010 Book 2005 CDE  Mark Davies A Frequency Dictionary of Spanish: Core Vocabulary for Learners Top 5000 lemmas in Portuguese, arranged in order of frequency. Include thirty thematically-based frequency lists (e.g. clothes, weather, emotions, adjectives with ser / estar, verbs with imperfect / preterit, etc) Routledge
208  12/2/2010 Chapter 2011 COCA COHA  Mark Davies Examining Recent Changes in English: Some Methodological Issues 1) Compares COHA to smaller historical corpora like BROWN family, ARCHER, etc to look at three cases of shifts with verbal complementation in the 1800s-1900s. 2) Compared COCA to the Bank of English and other supposed "monitor corpora", to argue that because it has the same genre (and sub-genre) composition from year to year, only COCA provides reliable data for recent (syntactic) changes in English. In: Handbook on the History of English: Rethinking Approaches to the History of English, eds. Terttu Nevalainen and Elizabeth Closs Traugott. (Forthcoming, 2011)
209  12/2/2010 Chapter 2009 COCA  Juhani Rudanko Change and Variation in Complement Selection: a Case Study from Recent English, with Evidence from Large Corpora The article examines change and variation -- and the principles accounting for change and variation -- in the sentential complements of the matrix verb ''submit'' in recent English. COCA, the Corpus of Contemporary American English, is a key source of data in the article. Stefan Gries, Stefanie Wulff, Mark Davies eds., Corpus-linguistic Applications: Current Studies, New Directions, pp. 47-66
210  12/2/2010 Journal 2010 TIME  Juhani Rudanko Explaining Grammatical Variation and Change: a Case Study of Complementation in American English over Three Decades The study based on data from the TIME Corpus. Journal of English Linguistics. 38: 4-24
211  12/2/2010 Chapter 2010 COCA  Ruth Carroll The Visual Language of the Recipe: A Brief Historical Survey only a brief mention (p.70): COCA "shows that when talking about recipes, we are almost four times as likely to use the verb ''use'' as we are to use the verb ''read''." Proceedings of the Oxford Symposium on Food and Cookery 2009. Ed. Richard Hosking, Totnes: Prospect Books, 62-72.
212  12/2/2010 Chapter 2011 COCA BNC  Marcus Callies Bare infinitival complements in Present-Day English presents evidence from the British National Corpus and the Corpus of Contemporary American English suggesting that bare infinitival complements in transitive uses of certain (classes of) verbs are increasing in Present-Day English; addresses the question whether this has to be interpreted as an effect of verbal economy rather than informality, and discusses various structural and psycholinguistic factors that have been attested to influence the variation between a marked and a bare infinitive (the Complexity Principle, horror aequi, semantic analogy and persistence/priming effects). Aarts, B. et al., eds. (to appear), Corpus Methodology and Current Change in the English Verb Phrase. Cambridge: CUP.
213  12/2/2010 Chapter 2011 TIME  Marcus Callies The grammaticalization and pragmaticalization of cleft constructions in Present-Day English examines the development of the variation between a marked and an unmarked infinitival complement clause in three types of cleft constructions in 20th century English; finds a reversal of preferences in both varieties in all three types of clefts; suggests a different development of the construction in written and spoken English (grammaticalization in written English vs. pragmaticalization in spoken English) Paul Rayson & Sebastian Hoffmann, eds. (to appear), Proceedings of ICAME 30, Lancaster. Amsterdam: Rodopi.
214  12/2/2010 Chapter 2009 COCA COHA BNC TIME  Magnus Levin and Hans Lindquist On the face of it. How recurrent phrases organize text. We used COCA, BNC and Time to investigate how the phrases on the face of it / on its face are used in contemporary BrE/AmE and how they have changed over time in AmE. Jucker, Andreas H., Daniel Schreier and Marianne Hundt (eds.) Corpora: Pragmatics and Discourse. Papers from the 29th International Conference on English Language Research on Computerized Corpora. Amsterdam: Rodopi. 169–188.
215  12/2/2010 Book 2011 COCA BNC  Robert Lew Online dictionaries of English A state-of-the-art overview of online dictionaries of English, also noting the convergence between the increasingly sophisticated database or corpus interfaces and more traditional dictionaries. Fuertes-Olivera, Pedro (ed.). 2011. e-Lexicography: The Internet, Digital Initiatives, and Lexicography. London: Continuum.
216  12/1/2010 Journal 2003 BNC  Xingbing Liu Factors contributing to corpus representativeness in corpus building BNC is taken as a typical representative balanced corpus in discussing the representativeness of corpora. Thesis holds that BNC''s sampling scheme can be referred to when building a large balanced general corpus. Journal of Shiyan Technical Institute, Vol. 16, No. 4
217  12/1/2010 Journal 2005 BNC  Xingbing Liu The Oral Features in the English Compositions of English Learners in China BNC Spoken and BNC World Spoken are used together with Brown Corpus as reference corpora to compare with China Learner English Corpus to investigate the oral tendency in the English compositions of English learners of different levels in China. Journal of Technology College Education, Vol. 24  No. 1
218  12/1/2010 Journal 2010 COCA  Michael Ramscar and Melody Dye Running down the clock: the role of expectation in our understanding of time and motion. We used COCA frequency data to build a model of lexical prediction. See Exp 2-4. In addition, to get a measure of how well our corpus-based model of expectations captured real usage patterns, we correlated the likelihoods (given our COCA model) with the observed frequency in Google. The fit between the predictive model and the observed data (r=.95, p<.000001) was encouraging. Ramscar, M., Matlock, T., & Dye, M. (2010) Running down the clock: the role of expectation in our understanding of time and motion. Language and Cognitive Processes, 25(5), 589-615.
219  12/1/2010 Book 2011 COCA  Xingbing Liu A Reading Course 3 The examples illustrating the usage of the new words in the word bank and the sentences in the blank filling exercises are solely taken from COCA concordances. Foreign Languages Press, In press
220  12/1/2010 Journal 2010 COCA  Melody Dye Learning language from the input : Why innate constraints can’t explain noun compounding. This paper tests the tenets of ''level ordering'' theory against the idea that children learn linguistic conventions from the input. Frequency data from COCA was used extensively in Exp 7 and subsequent analysis (pp 28-35, Tables 4-8). In brief, we were looking at whether the perceived ''naturalness'' of noun compounds could better be explained by grammatical type or plural sibilance. Ramscar, M. & Dye, M. (2011) Learning language from the input : Why innate constraints can’t explain noun compounding. Cognitive Psychology. 62(1), 1-40
221  12/1/2010 MA/PhD 2009 COCA  Cintia Widmann Morphological segmentation during silent reading For the homophones and controls in Experiment 1, the frequencies of occurrence as noun and as adjective were extracted from the corpus. In Experiment 2, "the raw frequency of co-occurrence of each modifier-homophone/control pair was extracted from the Corpus" (p. 34). The University of South Carolina
222  12/1/2010 Chapter 2010 COCA BNC TIME  Laurel Brinton From performative to concessive disjunct: I/you admit and admittedly Corpora data used to study the history of "admit" forms. Language Change and Variation from Old English to Late Modern English: A Festschrift for Minoji Akimoto. Merja Kytö, John Scahill, and Harumi Tanabe (eds.) Bern: Peter Lang. Pp. 279-302
223  12/1/2010 Book 2010 COCA  Laurel Brinton and Donna M. Brinton The Linguistic Structure of Modern English General introductory textbook (phonology, morphology, syntax, semantics, pragmatics). Corpus examples used extensively in the syntax section of the text. Amsterdam/Philadelphia: John Benjamins.
224  12/1/2010 Book 2009 BNC  Laurel Brinton The Comment Clause in English: Syntactic Origins and Pragmatic Development Corpus data used to study the frequency of some clausal disjuncts in Present-day English. Comparison is make to the frequency of the forms in Canadian English Cambridge: Cambridge University Press
225  11/29/2010 Book 2010 BNC COCA  Yufang Qian discursive constructions around terrorism in the People''s Daily and the Sun before and after 9.11 I examined the usage of some English words such as nest, wasps, ants and root out in order to explore the semantic prosody of these words in news reports. Peter Lang
226  11/29/2010 Book 2010 COCA  Florian Haas Reciprocity in English: Historical Development and Synchronic Structure The COCA data were used to test hypotheses concerning restrictions on the use of different types of reciprocal constructions. Routledge
227  11/16/2010 Other 2011 COCA COHA TIME  Neal Goldfarb Brief of the Project On Government Oversight et al as Amici Curiae in FCC v. AT&T This is a friend-of-the-court brief in the U.S. Supreme Court. The involves the Freedom of Information Act. Law-enforcement records are exempt from disclosure under FOIA to the extent that disclosure would result in an invasion of “personal privacy.” The issue is whether corporations are entitled to the benefit of this exemption, based on the fact that the statute defines “person” to include corporations. The court of appeals ruled that this stipulative definition changes the meaning of “personal” as used in FOIA, so that “personal privacy” can be understood to mean “corporate privacy”. The brief uses collocation data (primarily regarding the word “personal”) to argue against that conclusion. United States Supreme Court
228  11/12/2010 Journal 2009 COCA BNC  Jungsoo Kim English Floating Quantifier Constructions: A Non-movement Approach Taking advantage of the corpus data found in the BYU-BNC and COCA, I presented syntactic properties of the floating quantifier constructions in English. Language and Information
229  11/12/2010 MA/PhD 2010 COCA BNC  Jungsoo Kim Grammatical Properties of the Preposition of in Binominal Constructions I used authentic data found in the COCA and BYU-BNC to examine the function of the preposition of in English binominal constructions. Kyung Hee University
230  11/12/2010 Journal 2010 CDP  Christian Lehmann, José Pinto de Lima, Rute Soares ''Periphrastic voice with “see” in Portuguese The corpus was used mainly to illustrate the changes leading to the present-day grammaticalized use of "ver" ("see") in Portuguese. Acta Linguistica Hafniensia, 42: 1, 75 — 100
231  11/6/2010 Chapter 2009 CDE  Montserrat Batllori En Torno a la polaridad enfática en español y en catalán: un estudio diacrónico y comparativo (Joint with M.L. Hernanz) Empirical evidence from the Corpus del Espanol, used to support our hypothesis Diachronic linguistics, ed. Joan Rafel. 373-406
232  11/3/2010 Journal 2010 CDE  ian mackenzie Refining the V2 hypothesis for Old Spanish This paper looks at the claim that Old Spanish was a Verb Second language and explores how regular object fronting could have co-existed with VO order. Virtually all the empirical data brought to bear was retrieved using the Corpus del espanol. Bulletin of Hispanic Studies
233  11/1/2010 MA/PhD 2007 CDP  Andréa Lacotiz Diachronic study of the functions and the semantic values of suffixes -ança/-ença, -ância/-ência in the Portuguese The purpose of this research was specify etymological data found in the Dicionário Houaiss, to, hereby, investigate the changeable semantic values in their suffixal diachronic trajectory, evaluate their transcategorical formative tendency, since the Classic Latin, and verify the relation that the derivative nouns establish with their verbs and cognate adjectives. Based on a corpus of 250 usual words formed by these suffixes, we investigate the etymology of the terms, consulting dictionaries of Classic and Medieval Latin, English, French, Spanish and Italian, confronting the found data with those supplied by the Dicionário Houaiss. We describe the semantical values of the suffixes in the form of paraphrases, discriminating the prototypical one from those ocurred by loans or by meaning extensions. We inquire the cognation of these derivative nouns among adjectives in -nte and verbs, in the current Portuguese. This way, we could conclude that these suffixes resemble themselves with changeable semantical values, prototypical and acquired in their diachronical trajectory; they are useful to the creation of nouns mainly the abstract ones, because there are occurrences of concrete nouns, and have the tendency to form derivatives from adjectival and verbal bases, although, alongside the history, since the Latin, formations with other categories have also existed. Universidade de São Paulo
234  10/25/2010 Chapter 2011 CDE  Andrea Peskova La omisión y la expresión del pronombre sujeto vos en el español porteño The data from Corpus del espanol were used to present a use of the pronominal subject vos with the verb "saber" and were compared with my own data gathered in Buenos Aires in 2008. Di Tullio, Ángela / Kailuweit, Rolf (eds.): El español rioplatense. Frankfurt/Madrid: Vervuert.
235  10/23/2010 MA/PhD 2009 COCA BNC  Daniela Vranovská On Cognitive Synonymy: A Case Study (Soft and Mild) I compared all meanings of collocations with the adjectives soft and mild in attributive position. Masaryk University
236  10/20/2010 Journal 2009 BNC  Sandra Mollin “I entirely understand” is a Blairism: The methodology of identifying idiolectal collocations In this article I used the BNC as the control corpus to compare against my own Tony Blair Corpus in the domain of amplifier collocations. International Journal of Corpus Linguistics
237  10/18/2010 Journal 2010 BNC  Suhad Sonbul Direct teaching of vocabulary after reading: Is it worth the effort? This experimental study evaluated the effectiveness of direct teaching of new vocabulary items in reading passages. The study compared vocabulary learning under a reading only condition (incidental learning) to learning that is aided by direct communication of word meanings (explicit learning). Three levels of vocabulary knowledge (form recall, meaning recall, and meaning recognition) were assessed using three tests (completion, L1 translation, and multiple choice, respectively). Incidental learning plus explicit instruction was found to be more effective than incidental learning alone for all three levels. The results also showed that direct instruction is especially effective in facilitating the deepest level of knowledge, i.e. form recall. These findings demonstrate the value of the time and effort spent on direct teaching of lexical items in EFL reading classes. We used the BNC to control for the frequency of words used in this study. Sonbul, S., & Schmitt, N. (2010). Direct teaching of vocabulary after reading: Is it worth the effort? ELT J, 64(3), 253-260.
238  10/11/2010 Journal 2008 BNC  Florian Haas Symmetric verbs and constraints on passivization: An English-German comparison Different uses of the English verb ''meet'' have been investigated with respect to their occurrence in the active and passive diatheses. In order to do this, the first 2,000 instances of the lemma ''meet'' in the BNC were extracted with the help of the interface at corpus.byu.edu and classified according to their exact interpretation and their use in the active or passive voice. It turns out that there are striking differences between the relevant interpretations. A comparison to their German counterparts reveals that semantic and pragmatic motivations cannot sufficiently account for the distribution in English. I propose that verb senses and voice values are associated in such a way that semantic ambiguities are minimized. Zeitschrift für Anglistik und Amerikanistik (ZAA) 56.3: 255-268
239  10/2/2010 Journal 2010 COCA BNC TIME  Yoichi Arai Linguistic Studies Based on BYU-BNC. [written in Japanese] In this paper [in Japanese] discussed are stronger points concerning the searching functions of BYU-BNC, and linguistic analyses of "I goes", the infinitive/gerundive structures complement to "copula accustomed to" (cf. Rudanko 2006), and so on. English Corpus Studies 17, 2010, pp.223- (JAECS [Japan Association for English Corpus Studies])
240  10/2/2010 Journal 2010 COCA BNC TIME  Yoichi Arai An Overview of Free Corpora/Concordancers on the Web and Online BNC Concordancers. [written in Japanese] An article [written in Japanese] on free online English corpora centering around BYU-Corpora, which was read for a symposium in the 34th conference for JAECS [Japan Association for English Corpus Studies]: held at Aoyama Gakuin University, 3rd October, 2009. English Corpus Studies 17, 2010, pp.177-188 (Japan Association for English Corpus Studies)
241  9/30/2010 MA/PhD 2011 BNC  Elizaveta Tarasova Nominal compounding in English and Russian. Cognitive perspective The research attempts to analyze N+N root compounds in English and similarly structured language units in Russian from the starting point of their conceptual structure. One of the main areas of interest for my research is how these conceptual structures lead to representations on the morphological level in both languages. I''m using the BNC for my data collection. PhD; Victoria University of Wellington
242  9/25/2010 MA/PhD 2009 COCA BNC  Carmen Hernando A Comparative Corpus-Aided Analysis of American and British English: A Case Study of Slang This MA thesis should be set against the background of linguistic Globalisation and its diffusion through the media. The mass media allow for linguistic shifts taking place in American English to spread to other World Englishes (above all on the levels of syntax and lexis) as a result of an increasing trend towards colloquialisation, americanisation and a more democratic model of (power) relationships. The present dissertation constitutes a corpus-based study that analyses the frequency of use of a series of grammatical structures and lexical items in American and British English, particularly focusing on American slang. The study takes as a basis chiefly audiovisual corpora in parallel to which real linguistic data (drawn from a survey on slang) are examined. Results provide evidence of the so called Americanisation of other World Englishes, among them, British English, and that this influence can be observed in both written and audiovisual corpora. Universidad de Extremadura
243  9/22/2010 MA/PhD 2009 COCA  Nancy Turley A corpus-based analysis of ''I'' and ''me'' variation in coordinate noun phrases Looks at case marking (e.g. John and I / me and John] from a (COCA) corpus-based perspective. PhD dissertation, Arizona State University
244  9/22/2010 Chapter 2011 TIME  Mark Davies Recent shifts with three nonfinite verbal complements in English: Data from the 100 million word TIME Corpus (1920s-2000s) Compares the TIME Corpus to smaller corpora like the BROWN family to look at three cases of recent shifts in verbal complements in English: [to-V vs V-ing], [V NP into V-ing], and [V (for) NP to V]. In: Current Change in the English Verb Phrase, eds. Bas Aarts, Joanne Close, Geoffrey Leech and Sean Wallis. Cambridge: Cambridge Univ. Press.
245  9/22/2010 Journal 2010 COCA  Mark Davies The Corpus of Contemporary American English as the First Reliable Monitor Corpus of English Compares COCA to the Bank of English and other supposed "monitor corpora", to argue that because it has the same genre (and sub-genre) composition from year to year, only COCA provides reliable data for recent (syntactic) changes in English. Literary and Linguistic Computing (Forthcoming, November or December 2010)
246  9/22/2010 Journal 2009 TIME  Neil Millar Modal verbs in TIME: Frequency changes 1923-2006 Between the 1960''s and 1990''s the frequency of modal verbs in the Brown family of corpora fell substantially, a decline which Leech (2003:96) suggests is indicative of a more ''general and long lasting trend''. Taking Leech''s study as a starting point, this paper investigates twentieth century changes in modal verbs using the new and relatively unexplored TIME Magazine Corpus. Results show that while certain modal verbs have fallen in frequency, the overall pattern is one of growth. These changes may be accounted for by the increase of semi-modal verbs and shifts in usage. Results appear to lend support to the explanatory hypotheses of colloquialization, democratization and stylistic change. Investigation of discrepancies between results from the TIME Corpus and the Brown corpora yields important methodological implications. In particular, as a result of sampling variation, a diachronic comparison based on two data points may present an inaccurate picture of the overall trend. Adapted from the source document International Journal of Corpus Linguistics 14: 191-220.
247  9/22/2010 Chapter 2011 COCA TIME  Juhani Rudanko A Fresh Perspective on Non-Finite Complements, with Evidence from the TIME Corpus and COCA [to-V] vs [V-ing] with certain adjectives and verbs in COCA and TIME In: Handbook on the History of English: Rethinking Approaches to the History of English, eds. Terttu Nevalainen and Elizabeth Closs Traugott. (Forthcoming, 2011)
248  9/15/2010 Chapter 2009 BNC  Jean Hudson and Maria Wiktorsson Formulaic language and the relater category - the case of ''about'' Abstract: Relaters in English are words that are traditionally referred to as prepositions, particles, or adverbs, such as: in, for, with, at, about. They are highly decategorialized and polysemous. Furthermore, with few word forms which occur very frequently, they are problematic for linguists and learners alike. In our data (naturally occurring conversation in the BNC), we find many formulaic patterns around these words, providing new insights into the meanings created through and around them. In this paper we report on a study of the relater ''about''. We find that the greater proportion of the occurrences can be adequately described as part of substantive or schematic constructions, and that they to a large extent pattern with meanings with a negative or generally unfavourable orientation. In: Corrigan, Roberta, Edith A Moravcsik, Hamid Ouali and Kathleen M Wheatley (eds). Formulaic Language, Volume 1: Distribution and historical change. John Benjamins, Amsterdam. pp 77-95.
249  9/14/2010 Journal 2009 BNC  Lachlan Mackenzie English fail to as a periphrastic negative: an FDG account The verb "fail", preceding "to" + infinitive, has two senses. One presupposes active attempt; the other lacks this presupposition and approximates in meaning to simple negation. Examining all relevant occurrences in the British National Corpus with BYU-BNC, the paper considers the second sense of "fail to" in the framework of Functional Discourse Grammar. WP-FDG: Web Papers in Functional Discourse Grammar. Volume 82: The London Papers I. 1-28
250  9/11/2010 Book 2010 COCA BNC  Shoko Hamano Basic Japanese I used the corpora to compare British and American English usage to provide the best glosses for the examples in the book intended for readers in both Britain and US. London: Routledge
251  9/8/2010 Journal 2009 COCA  François Maniez « La mise en équivalence des adjectifs relationnels du domaine médical : étude du suffixe –ionnel. » Nous nous proposons d’étudier l’utilisation d’un certain type d’adjectifs relationnels (ceux se terminant par –ionnel) dans le domaine de la médecine. La traduction de ces adjectifs en anglais pose divers problèmes pour les francophones, et notamment celui du choix entre une traduction quasi-littérale, utilisant un adjectif de suffixation équivalente (-ion + -al) où une complémentation prépositionnelle impliquant le nom dont l’adjectif est dérivé. Après avoir étudié la sélection de ces adjectifs dans deux ressources terminologiques bilingues, nous étudions l’emploi des dix adjectifs les plus fréquents et celui de leurs équivalents morphologiques anglais dans deux corpus comparables d’articles de recherche du domaine de la médecine, un corpus de langue française de huit millions de mots et la partie médicale du Corpus of Contemporary American English (accessible en ligne et totalisant 4,5 millions de mots). Nous tentons d’identifier les facteurs qui contribuent à l’utilisation du nom ou de l’adjectif dans les expressions anglaises candidates à la mise en équivalence des expressions françaises utilisant ces adjectifs. Proceedings of the First International Workshop on Terminology and Lexical Semantics (TLS’09). In (éd.A. Alcina & M.C. L’Homme), Université de Montréal, 19 juin 2009.
252  9/8/2010 Book 2010 COCA BNC TIME  John Corbett Intercultural Language Activities Several of the language learning activities described in this ELT handbook use the CoCA, Time and BNC corpora. Cambridge: Cambridge University Press
253  9/8/2010 Chapter 2010 BNC  Manfred Sailer and Eva Csipak Negative Polarity Items im Englischunterricht The paper is in German. It deals with the use of Negative Polarity Items (such as "any", "ever") in an EFL textbook series that covers 6 years of highschool teaching. We compare the occurrence profiles of a number of NPIs in the textbook series with the distribution of the same items in the BNC. The result is that the textbook starts with prototypical uses and, at the end of the 6th year, achieves an almost natural distribution. In: A. Krafft & C. Spiegel (eds): Sprachliche Förderung und Weiterbildung - transdisziplinär. Forum Angewandte Linguistik 51. Frankfurt: Peter Lang.
254  9/5/2010 Chapter 2006 CDE  Mark Davies and Tim Face Vocabulary Coverage in Spanish Textbooks: How Representative is It? Shows that the vocabulary in six widely-used first and second year Spanish textbooks bear little resemblance to frequency of vocabulary in the well-balanced Corpus del Español In: Selected Proceedings from the Conference on the Acquisition of Spanish and Portuguese as First and Second Languages
255  6/3/2009 MA/PhD 2009 BNC  Michal Narecki Phrasal verb-to-noun conversion in English The thesis contains the section devoted to morphosyntactic contexts of phrasal nouns. I have basically examined 30 nouns and checked whether they were modified by articles or adjectives. This was true in most cases, which confirmed my initial assumptions. PhD; John Paul II Catholic University (Lublin)
256  4/25/2009 Chapter 2009 CDE CDP  Mark Davies Creating Useful Historical Corpora: A Comparison of CORDE, the Corpus del Español, and the Corpus do Português Shows how the Corpus del Español and the Corpus do Português allow for a wide range of research on historical Spanish and Portuguese, which are difficult or impossible with limited corpus architectures like that of CORDE and CREA (from the Real Academia Española). In: Diacronía de las lenguas iberorromances: nuevas perspectivas desde la lingüística de corpus
257  4/25/2009 Book 2009 COCA  Mark Davies and Dee Gardner A Frequency Dictionary of American English: Word Sketches, Collocates, and Thematic Lists The first frequency dictionary of contemporary American English. For each of the 5000 lemmas, it also gives the top collocates (arranged by part of speech and frequency), which provide good insight into the meaning and use of the words. Routledge
258  4/25/2009 Book 2007 CDP  Mark Davies and Ana Preto-Bay A Frequency Dictionary of Portuguese: Core Vocabulary for Learners Top 5000 lemmas in Portuguese, arranged in order of frequency. Include thirty thematically-based frequency lists (e.g. clothes, weather, emotions, adjectives with ser / estar / ficar, verbs with imperfect / preterit, etc) Routledge
259  4/25/2009 Chapter 2008 CDP  Mark Davies and Ana Preto-Bay The corpus-based ''Frequency Dictionary of Portuguese'': A new tool for learners and teachers Discusses the Corpus do Português, and how data from this corpsu was used to create the first frequency dictionary of Portuguese that is based on a large, balanced corpus. In: Proceedings of TALC 8: Teaching and Language Corpora
260  1/30/2009 MA/PhD 2009 BNC  Ina Begerow 'The Privilege of Primacy'? - A Corpus-based Study of because in clause-initial position. I am writing my master's thesis with the above mentioned title and I really appreciate to be able to use your corpora, the BNC as well as the COCA! Thanks for making these data available for everybody. PhD; Johannes Gutenberg-Universitaet (Mainz)
261  1/29/2009 MA/PhD 2009 COCA  Yu Jeung Kim Effectiveness of on-line corpus research in L2 writing I have been writing a thesis to graduate. The topic I chose is L2 writing based on on-line corpus. I really want to use BYU corpus in my research. I always appreciate your excellent corpus. PhD; University of North Texas
262  6/10/2008 Journal 2008 COCA  Daniel Sanford Metaphor and Phonological Reduction in English Idiomatic Expressions Great tool! I used the corpus to assess the frequency of the idioms I was looking at. The paper is in press, should be out with the May 2008 issue. Cognitive Linguistics. 17: 585-603