MA of L2 learner English

Corpus Linguistics 2015, University of Lancaster, 21-24 July Yu Yuan: “Exploring the variation in world Learner Englishes: A multidimensional analysis of L2 written corpora” 109 features included in the analysis RQ: Can Biber’s model be extended? How do features co-occur in learner English?   Data ICLE 1.0 (Granger, 2002) SWEECL 2.0 (Wen & Wang, 2008) … Read more

1.6 billion word Hansard Corpus available

  Through the corpora list & Prof. Mark Davies :::::::::::::::::::::::::::::::::::: We are pleased to announce the release of the 1.6 billion word Hansard Corpus . The corpus is part of the SAMUELS project and has been funded by the AHRC (UK). The Hansard Corpus contains 1.6 billion words from 7.6 million speeches in the British Parliament from … Read more

New Directions in Corpus-based Translation Studies

Through the Corpora List ::::::::::::::::::::::::::::::::::::: The “Language Science Press” has just published the following open access book in their series “Translation and Multilingual NLP”: “NEW DIRECTIONS IN CORPUS-BASED TRANSLATION STUDIES” by Claudio Fantinuoli & Federico Zanettin (eds.) Please download your free copy from http://langsci-press.org/catalog/book/76 ABSTRACT Corpus-based translation studies has become a major paradigm and research methodology … Read more

Free ngram databases from COW14 web corpora

From the corpora list :::::::::::::::::::::::::::::: We are pleased to announce the release of the first very large ngram databases derived from the giga-token COW14 web corpora. They are completely free (CC-BY) and can be downloaded without registration. We have applied no frequency thresholds whatsoever. In addition to the counted ngram lists, we offer raw versions such that everybody can create … Read more

CFP Posters on late-breaking results June 15 deadline

Through the corpora list ::::::::::::::::::::::::::::::::: CORPUS LINGUISTICS 2015 The CL2015 organising committee is pleased to issue a call for posters on late-breaking results on any of the topics in the conference’s scope. By “late-breaking” we mean research which was not at a sufficiently advanced stage for an abstract submission to be made in the main … Read more

Adam Kilgarriff: a selection of papers and talks

Some readings to remember one of the most indisputably influential corpus linguists in the 20 and 21st centuries. Using corpora for language research https://www.sketchengine.co.uk/documentation/attachment/wiki/AK/Papers/SkE_for_lingResearch2013.ppt?format=raw Googleology is bad science http://www.kilgarriff.co.uk/Publications/2007-K-CL-Googleology.pdf Grammar is to meaning as the law is to good behaviour. Corpus Linguistics and Linguistic Theory 3 (2): 195-198. http://www.kilgarriff.co.uk/Publications/2007-K-CLLT-grammarlaw.doc