CFP: Generative AI and data-driven learning in second language learning

Call for papers – Vol. 31, Issue 2 Guest editors: Javad Zare, Kosar University of Bojnord, Iran, and Alex Boulton, Université de Lorraine, France Language Learning & Technology has an active call for papers in a special issue on Generative AI and data-driven learning in second language learning: What the future holds, guest edited by Javad Zare … Read more

5 recent books for language teachers interested in corpus linguistics, DDL & language education

Crosthwaite, P. (Ed.). (2019). Data-driven learning for the next generation: Corpora and DDL for pre-tertiary learners. Routledge. (URL) Jablonkai, R. R., & Csomay, E. (Eds.). (2022). The Routledge Handbook of Corpora and English Language Teaching and Learning. Routledge.. (URL) Pérez-Paredes, P. (2020). Corpus Linguistics for Education. A Guide for Research. Routledge. (URL) Timmis, I. (2015). Corpus linguistics for … Read more

John Sinclair and language theory

The following is an extract form Hunston (2022, p. 256). Hunston, S. (2022). Corpora in applied linguistics. Cambridge University Press. Sinclair made a number of generalisations in the 1980s (Sinclair 1991, 2004; see also Francis 1993; Hoey 2005; Hunston 2002; Stubbs 2001) which might be summarised as follows: • In describing the meanings of a word, … Read more

Phil Durrant’s talk available on Youtube

Check out Dr Durrant’s talk “Researching writing development with a corpus” on our research group Youtube Channel https://www.youtube.com/channel/UCKjKIIQL6u1mXD2V9ZaT-_Q More info on the talk here. More info on Corpus linguistics and applied linguistics research 2021 site.

Corpus of North American Spoken English (CoNASE)

The Corpus of North American Spoken English (CoNASE), a 1.25-billion-word corpus of geolocated automatic speech-to-text transcripts, is now available in a beta version. URL http://cc.oulu.fi/~scoats/CoNASE.html for more information. The corpus was created from 301,847 ASR transcripts from 2,572 YouTube channels, corresponding to 154,041 hours of video. The size of the corpus is 1,252,066,371 word tokens. … Read more