Category: corpus linguistics
GraphColl building and exploring networks of linguistic collocations @TonyMcEnery
GraphColl is a tool for building and exploring networks of linguistic collocations. It was developed at CASS at Lancaster University in 2014-15.
The concept and design of GraphColl is covered in Brezina et. al. (2015), published at IJCL.
Corpus Linguistics #cl2015: notes and pics
Corpus Linguistics Conference 2015, University of Lancaster, UK
Thanks to @TonyMcEnery, @HardieResearch and everybody at @UCREL_Lancaster for organizing a wonderful conference.
Abstract book download:
A selection of talks and personal notes:
Learner corpus research plenary #cl2015
Multi-dimensional analysis of oral proficiency interviews #cl2015
Non-obvious meaning in CL and CADS #cl2015
Representation of benefit claimants in UK media #cl2015
Tono Linguistic feature extraction #cefr #cl2015
Language learning theories underpinning corpus-based pedagogy #cl2015
And some pics:
Robert Poole (left)
Ricardo Jiménez
Carlos Ordoñana (left)
Lynne Flowerdew
Carlos Ordoñana (left) and Yukio Tono (right)
Discussing the representation of immigrants in the context of the LADEX project.
Discussing the representation of immigrants in the context of the LADEX project.
Carlos Ordoñana (left) and Yukio Tono (right)
Yolanda Noguera and John Flowerdew
Yukio Tono (middle)
Yolanda Noguera and Michael Barlow
Cardiff language and Law: Symposium Nov 2015 & Corpus Approaches to Public & Professional Discourse
From the Forensic Linguistics e-mail list
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
Symposium on “Expertise in Language and Law” will take place on Friday 27th November 2015.
This Symposium is part of the Advanced Research Residency in Language and Law which takes place at Cardiff University from October to December this year. The full program
me of events is at: http://blogs.cardiff.ac.uk/call/.
The Symposium is followed, on Saturday 28th November, by a Conference entitled “Corpus Approaches to Public and Professional Discourse” (http://blogs.cardiff.ac.uk/corpus/).
Multi-dimensional analysis of oral proficiency interviews #cl2015
Shelley Staples; Jesse Egbert; Geoff LaFlair
A multi-dimensional comparison of oral proficiency interviews to conversation, academic and professional spoken registers
MELAB : Michigan Engish Language Battery 989 OPIs in 2013
OPI used for academic and profesional purposes
Only transcribed the first 5 minutes
55 linguistic features
TagCount
FA
6 factor solution
Dimensions interpreted functionally
Dimension scores
Differences across registers (ANOVAs and post hocs)
6 dimension
1. Explicit stance: private verbs, that deletion, lower rates of implicit stance that the Longman corpus
3. Speaker-centered informational vs listener centered involvement: pro1, subject-conj.causative, nn, amplifiers,
4. Extended informational discourse: word length, prep, jj atr, that rel, negative features: all pronouns
6. Implicit stance: higher rates of implicit stance that the Longman corpus
Non-obvious meaning in CL and CADS #cl2015
Plenary session: Alan Partington
Non-obvious meaning in CL and CADS: from ‘hindsight post-dictability’ to sweet serendipity
Chair: Amanda Potts
http://www3.lingue.unibo.it/blog/clb/
Introspection & intuition
Processes of inference from the linguistic trace left by speakers/writers
Shared meaning
Idiom principle
Complexity of common grammatical items
Colligation: every word primed to occur in or avoid certain grammatical positions and functions (Hoey, 2005: 13)
SiBol (Siena-Bologna) corpus of newspapers, judicial inquiries, press briefings. Link.
Rapid language change
Corpus methodology is useful in detecting absence, not only presence
Language looks rather different when you look at a lot of it at once (Sinclair 1991)
Qualitative: anaphoric, historic, past behaviour
Quantitative anaphoric and cataphoric; enough data with which to infer
If primed >> psychologically fixed >> reproduced
Evaluation as prototypicality: inner circle obvious, outer circle non-obvious
Prosody can depend on grammar (Louw 1993), pov, literal vs figurative use and on field of register
Embedding is an important factor to interpret prosody
The added value of CL in discourse studies
Looking at language at different levels of abstraction: overview & close reading
Data are not sacred
Much of textual meaning is accretional
Positive cherry-picking: find counter examples
Almost all explanation in DA is informed speculation: in human science this is the closest you get to explanation
Moral panics have evolved over the years (globesity in 2015)