Categories
Cambridge uk

Guía básica para vivir en Cambridge @Bristolenos

 

IMG_20150911_110337

Información básica para visitantes y futuros habitantes de Cambridge.

Link.

Categories
CFP

Corpus linguistics in the south 13: presentations #cls13

 

img_20161126_103041

Corpus Linguistics on the South 13

The Hillary Clinton emails: corpus linguistics meets the real world

Rachele de Felice, University College London

Link

AntFileConverter: Pdf to plain text

Redactions. Materially annoying, so much is deleted that POS tagging becomes problematic.

Lots of repeated materials because of e-mail chains.

What about names and entities? H, HC, Hillary, MS is the same referent.

Which device was used? iPad, desktop, phone?

What about times?

Unique Ids for everything (G. Garretson): each e-mail given unique id

 

Grain and scale: Looking at small data sets in broader sociocultural contexts

Colleen Cotter, Lisa McEntee-Atalianis and Danniella Samos
Queen Mary University of London and (LMA) Birkbeck, University of London

Tracing the migrant voice in UK media

Voice: linguistic construction of social personae (Keane 2003: 268)

They looked at quotations.

Jan-Dec 2015; migrants & Calais; Lexis Nexis

Investigating obesity

Corpus linguistics and news representations: a corpus-assisted framing analysis of mental health and arts participation messages in the British press
Dimitrinka Atanasova and Nelya Koteyko, Queen Mary University of London

Framing analysis: QUAL Framing is selecting aspects of reality and make them more salient (Entman 1993)

Keywords play a role in both framing analysis and corpus linguistics

Recovery was the most prominent frame

CL gives more opportunities to look at a wider range of articles (i.e. local newspapers cater for smaller communities).

 

 

 

 

 

 

Categories
corpora corpus linguistics learner corpus learner language text analysis text tools Tools

Graphic Online Language Diagnostic

 

Graph-Magnifier-icon

The Graphic Online Language Diagnostic (“GOLD”) is a corpus tool that allows language educators to submit and analyze language data. GOLD was developed by the Center for Advanced Language Proficiency Education and Research (“CALPER”) at The Pennsylvania State University (“PSU”), University Park, PA, USA under a grant from the U.S. Department of Education (Title VI, P229A060003 and P229A020010).

Link here: http://gold.gwserver1.net

Categories
Sketch Engine software text tools Traducción

Some resources for the building of parallel corpora

 

parallel_

Michael Barlow’s site (Link).

European Parallel Corpus (Link)

OPUS (Link)

Parallel corpora in Sketch Engine (Link)

AntPConc software (Link)

 

Categories
analysis of language applied linguistics automated tools computational linguistics English English Language Knowledge engineering language analysis linguistics text analysis text tools vocabulary

TAALES 2.2 is out : automatic analysis of lexical sophistication, Windows and Mac

From the TAALES website:

Kyle, K. & Crossley, S. A. (2015). Automatically assessing lexical sophistication: Indices, tools, findings, and application. TESOL Quarterly 49(4), pp. 757-786. doi: 10.1002/tesq.194

TAALES is a tool that measures over 400 classic and new indices of lexical sophistication, and includes indices related to a wide range of sub-constructs. TAALES indices have been used to inform models of second language (L2) speaking proficiency, first language (L1) and L2 writing proficiency, spoken and written lexical proficiency, genre differences, and satirical language.

Starting with version 2.2, TAALES provides comprehensive index diagnostics, including text-level coverage output (i.e., the percent of words/bigrams/trigrams in a text covered by the index) AND individual word/bigram/trigram index coverage information.

TAALES takes plain text files as input (it will process all plain text files in a particular folder) and produces a comma separated values (.csv) spreadsheet that is easily read by any spreadsheet software.

 

You can find all the info here. Windows and Mac versions available for free.

Categories
analysis of language applied linguistics CFP conferences corpus linguistics language analysis learner corpus learner language

4th Learner Corpus Research Conference, Bolzano, Italy, 5‐7 October 2017

4th Learner Corpus Research Conference
Bolzano/Bozen, Italy, 5‐7 October 2017

http://lcr2017.eurac.edu

Abstracts should be submitted through EasyChair by Sunday 15 January 2017.

Notification of the outcome of the review process will be sent by 31 March 2017.

Call for Papers

Following the successful conferences in Louvain‐la‐Neuve (Belgium) in 2011, Bergen (Norway) in 2013 and Nijmegen (the Netherlands) in 2015, the 4th Learner Corpus Research Conference will be hosted by the Institute for Specialised Communication and Multilingualism at EURAC Research, Bolzano/Bozen, Italy. The conference, organized under the aegis of the Learner Corpus Association, aims to be a showcase for the latest developments in the field and will feature full paper presentations, work in progress reports, poster presentations, software demos and a book exhibition.

The theme of LCR 2017 is “Widening the Scope of Learner Corpus Research”.

Conference Venue: European Academy Bozen/Bolzano – EURAC Research

Confirmed keynote speakers:

Philip Durrant (University of Exeter, United Kingdom)
Stefan Th. Gries (University of California, Santa Barbara, U.S.A.)
Stefania Spina (Università per Stranieri Perugia, Italy)
The keynote speakers will adress the theme of LCR 2017 in their respective lectures on L1 writing  development and Learner Corpus Research, quantitative methods in Learner Corpus Research, and Learner Corpus Research and Italian as L2. We welcome papers that address all aspects of Learner  Corpus Research, in particular the following ones:

* Corpora as pedagogical resources
* Corpus‐based transfer studies
* Data mining and other explorative approaches to learner corpora
* English as a Lingua Franca
* Error detection and correction of learner language
* Extracting language features from learner corpora
* Innovative annotations in learner corpora
* Language for academic/specific purposes
* Learner varieties
* Learner corpora for less commonly taught languages
* Learner Corpus Research and the Common European Framework of Reference for Languages (CEFR)
* Learner Corpus Research and Natural Language Processing
* Links between Learner Corpus Research and other research methodologies (e.g. experimental methods)
* Search engines for learner corpora
* Statistical methods in learner corpus studies
* Task and learner variables

There will be four different categories of presentation:

* Full paper (20 minutes + 10 minutes for discussion)
* Work in Progress (WiP) report (10 minutes + 5 minutes for discussion)
* Corpus/software demonstration
* Poster
* The Work in Progress reports and posters are intended to present research still at a preliminary stage and on which researchers would like to get feedback.

The language of the conference is English.

Abstracts
Your abstract should be between 600 and 700 words (excluding a list of references). Abstracts should  provide the following:
* clearly articulated research question(s) and its/their relevance;
* the most important details about research approach, data and methods;
* the main results and their interpretation.

Abstracts should be submitted through EasyChair (https://easychair.org/conferences/?conf=lcr2017) by Sunday 15 January 2017. Please follow instructions provided on the conference website (http://lcr2017.eurac.edu).
Abstracts will be reviewed anonymously by the scientific committee. Notification of the outcome of  the review process will be sent by 31 March 2017.

The LCR 2017 organising committee
Andrea Abel (EURAC Research)
María Belén Díez‐Bedmar (Universidad de Jaén)
Daniela Gasser (EURAC Research)
Aivars Glaznieks (EURAC Research)
Verena Lyding (EURAC Research)
Lionel Nicolas (EURAC Research)

The LCR 2017 scientific committee
Andrea Abel (EURAC Research)
Katherine Ackerley (Università degil Studi di Padova)
Annelie Ädel (Dalarna University)
Nicolas Ballier (Université Paris Diderot – Paris 7)
María Belén Díez‐Bedmar (Universidad de Jaén)
Marcus Callies (Universität Bremen)
Erik Castello (Università degil Studi di Padova)
Francesca Coccetta (Università Ca’Foscari Venezia)
Pieter de Haan (Radboud Universiteit Nijmegen)
Hilde Hasselgård (Universitet i Oslo)
Sandra Deshors (New Mexico State University)
Ana Diaz‐Negrillo (Universidad de Granada)
Michael Flor (ETS)
John Flowerdew (City University of Hong Kong)
Lynne Flowerdew (independent researcher)
Fanny Forsberg Lundell (Stockholm University)
Gaëtanelle Gilquin (University of Louvain)
Sandra Götz (Justus Liebig Universität Gießen)
Solveig Granath (Karlstad University)
Sylviane Granger (Universtié catholique de Louvain)
Nicholas Groom (University of Birmingham)
Jirka Hana (Charles University Prague)
Shin’ichiro Ishikawa (Kobe University)
Jarmo Harri Jantunen (University of Jyväskylä)
Scott Jarvis (Ohio University)
Marie Källkvist (Lund University Sweden)
Agnieszka Lenko‐Szymanska (University of Warsaw)
Anke Lüdeling (Humboldt‐Universität Berlin)
Carla Marello (Università degil Studi Torino)
Fanny Meunier (Universtié catholique de Louvain)
Detmar Meurers (Universität Tübingen)
Florence Myles (University of Essex)
Susan Nacey (Hedmark University College)
Lionel Nicolas (EURAC Research)
Michael O’Donnell (Universidad Autónoma de Madrid)
Signe Oksefjell Ebeling (Universitetet i Oslo)
Magali Paquot (Universtié catholique de Louvain/FNRS)
Pascual Pérez‐Paredes (University of Cambridge)
Tom Rankin (Vienna University of Economics and Business)
Paul Rayson (UCREL, Lancaster University)
Ute Römer (University of Michigan)
Anna Siyanova‐Chanturia (Victoria University of Wellington)
Jennifer Thewissen (Universiteit Antwerpen)
Yukio Tono (Tokyo University of Foreign Studies)
Nina Vyatkina (University of Kansas)
Heike Zinsmeister (Universität Hamburg)

For inquiries, contact Andrea Abel: Andrea . Abel @ eurac . edu