New Directions in Corpus-based Translation Studies

Through the Corpora List ::::::::::::::::::::::::::::::::::::: The “Language Science Press” has just published the following open access book in their series “Translation and Multilingual NLP”: “NEW DIRECTIONS IN CORPUS-BASED TRANSLATION STUDIES” by Claudio Fantinuoli & Federico Zanettin (eds.) Please download your free copy from http://langsci-press.org/catalog/book/76 ABSTRACT Corpus-based translation studies has become a major paradigm and research methodology … Read more

Free ngram databases from COW14 web corpora

From the corpora list :::::::::::::::::::::::::::::: We are pleased to announce the release of the first very large ngram databases derived from the giga-token COW14 web corpora. They are completely free (CC-BY) and can be downloaded without registration. We have applied no frequency thresholds whatsoever. In addition to the counted ngram lists, we offer raw versions such that everybody can create … Read more

CFP Posters on late-breaking results June 15 deadline

Through the corpora list ::::::::::::::::::::::::::::::::: CORPUS LINGUISTICS 2015 The CL2015 organising committee is pleased to issue a call for posters on late-breaking results on any of the topics in the conference’s scope. By “late-breaking” we mean research which was not at a sufficiently advanced stage for an abstract submission to be made in the main … Read more

Adam Kilgarriff: a selection of papers and talks

Some readings to remember one of the most indisputably influential corpus linguists in the 20 and 21st centuries. Using corpora for language research https://www.sketchengine.co.uk/documentation/attachment/wiki/AK/Papers/SkE_for_lingResearch2013.ppt?format=raw Googleology is bad science http://www.kilgarriff.co.uk/Publications/2007-K-CL-Googleology.pdf Grammar is to meaning as the law is to good behaviour. Corpus Linguistics and Linguistic Theory 3 (2): 195-198. http://www.kilgarriff.co.uk/Publications/2007-K-CLLT-grammarlaw.doc

Native & learner language in interviews

This talk discusses some of our findings in Pérez-Paredes, P., & Sánchez Tornel, M. (2015). A multidimensional analysis of learner language during story reconstruction in interviews. In M. Callies & S. Götz (Eds.), Learner Corpora in Language Testing and Assessment. Amsterdam: John Benjamins.   A contrastive analysis of native and non-native speaker interviews from Pascual … Read more

Where’s austerity in everyday speech?

According to Prof. McEnery, people in conversations avoid using words such as “austerity”, only once in a 5 million corpus. Listen to the interview below. Surprising? Don’t think so. “Expected” words do not crop up when data is examined. We have found that the UK legislation 2007-2011 on immigration does not include the lemma “immigrant” … Read more