Some interesting new research on corpora, technology and language learning

Curry, N. (2022). On Corpus-Based Contrastive Linguistics and Language Pedagogy: Reimagining Applications for Contemporary English Language Teaching. In: McCallum, L. (eds) English Language Teaching. English Language Teaching: Theory, Research and Pedagogy. Springer, Singapore. https://doi.org/10.1007/978-981-19-2152-0_14 Li, L. X. (2022). Meta-Analytical Approach to the Impact of Corpus-Driven Teaching on Foreign Language Acquisition. Mobile Information Systems, 2022. https://doi.org/10.1155/2022/5049312 Ng, C.H., … Read more

Recent DDL research & events: 5 tips

Really exciting times for DDL and corpus linguistics and education researchers. There’s some interesting new stuff that has just been published, including some interesting conference videos. Here’s my selection. (1) Boulton, A., & Vyatkina, N. (2021). Thirty years of data-driven learning: Taking stock and charting new directions over time. Language Learning & Technology, 25(3), 66-89. Abstract The … Read more

Language is never, ever, ever random

“Language is never, ever, ever random” (Kilgarriff, 2005), not in its usage, not in its acquisition, and not in its processing. (Nick C. Ellis, 2017, p. 41) Nick C. Ellis (2017). Cognition, Corpora, and Computing: Triangulating Research in Usage-Based Language Learning. Language Learning 67(S1), pp. 40–65

Corpus of North American Spoken English (CoNASE)

The Corpus of North American Spoken English (CoNASE), a 1.25-billion-word corpus of geolocated automatic speech-to-text transcripts, is now available in a beta version. URL http://cc.oulu.fi/~scoats/CoNASE.html for more information. The corpus was created from 301,847 ASR transcripts from 2,572 YouTube channels, corresponding to 154,041 hours of video. The size of the corpus is 1,252,066,371 word tokens. … Read more