New book: Corpus Linguistics for ELT

Corpus_Linguistics_for_ELT__Research_and_Practice_-_Ivor_Timmis_-_Google_Libros

 

Corpus Linguistics for ELT: Research and Practice
Ivor Timmis

From the introduction:

The challenge of fostering a fruitful relationship between corpus linguistics and ELT was clearly set out by Conrad (2000: 556):
Corpus grammarians must strive to reach more audiences that include
teachers and must emphasize concrete pedagogical applications … In fact,
the strongest force for change could be a new generation of ESL teachers who
were introduced to corpus-based research in their training programs [and]
have practiced conducting their own corpus investigations and designing materials based on corpus research.
Indeed, this comment by Conrad encapsulates the main aim of this book: to help move corpus linguistics from what Römer (2012) terms its ‘minority sport’ status in language teaching to a point where the ability to carry out and interpret corpus research is seen as a normal part of an English language teacher’s repertoire.
Familiarity with corpus research and practice should be a standard part of an English language teacher’s toolkit, I would argue, because most people in ELT will at some time have had thoughts like these:
• How many words do my learners need to learn?
• Why is everyone talking about lexical chunks and collocations?
• Do my students really need this grammar point?
• Which words should I use to exemplify this structure?
• Am I teaching my learners language they will need to use when they speak the language?
• Does the grammar explanation in the coursebook really reflect how we use this structure?
• What vocabulary do my English for dentistry students need to get their teeth into?

If you have had questions like these, this book is designed to help you to answer them by consulting corpora and corpus-informed literature. It is also designed to help you to generate and investigate similar questions. It is, however, important to keep corpora in perspective throughout this book.

The argument presented here is that corpora are a resource and a reference source and, as is the case with all resources, pedagogic judgement is vitally important in determining how and when
they are deployed to best effect.
The book does not assume prior knowledge or experience of corpus research; nor does it assume any technical expertise. Technophobes can relax: contemporary corpus interfaces and corpus software are user-friendly and often include tutorial packages. The tasks in this book will help to familiarise readers with publicly available user-friendly corpora such as the British National Corpus hosted at
http://corpus.byu.edu/bnc/

CFP International Journal of Learner Corpus Research

The International Journal of Learner Corpus Research (IJLCR) is a forum for researchers who collect, annotate, and analyse computer learner corpora and/or use them to investigate topics in Second Language Acquisition and linguistic theory in general, inform foreign language teaching, develop learner-corpus-informed tools (e.g. courseware, proficiency tests, dictionaries and grammars) or conduct natural language processing tasks (e.g. annotation, automatic spell- and grammar-checking , L1 identification).

 

IJLCR aims to highlight the multidisciplinary and broad scope of practice that characterizes the field and publishes original research covering methodological, theoretical and applied work in any area of learner corpus research.

IJLCR features research papers, shorter research notes and reviews of books, corpora and software tools. The language of the journal is English. The journal will also publish special issues. All contributions are peer-reviewed.

IJLCR is now inviting submissions for Vol. 2 No. 1 (to be published in spring 2016).

For more information, visit the official website of the journal.

 

Summer Schools in Corpus Linguistics

Through the Corpora List

::::::::::::::::::::::::::::::::::::

lancaster

Summer Schools in Corpus Linguistics / Statistics for Corpus Linguistics

http://ucrel.lancs.ac.uk/summerschool

Lancaster University, UK – 14th to 17th July 2015

 

Since 2010, Lancaster University has run a highly successful series of free-to attend summer training events. In 2015, we will for the first time be running two corpus linguistics events in parallel:

 

  • The UCREL Summer School in Corpus Linguistics
  • The UCREL/CASS Summer School in Statistics for Corpus Linguistics

 

Sponsored by UCREL at Lancaster University – one of the world’s leading and longest-established centres for corpus-based research – and by the ESRC-funded CASS project, these events’ aim is to support students of language and linguistics in the development of advanced skills in corpus methods.

Both are intended primarily for postgraduate research students (and secondarily for Masters-level students, postdoctoral researchers, and others); both assume at least a basic knowledge of corpus linguistics (but in the case of the Statistics Summer School, no knowledge of statistics is assumed).

The four-day programme consists of a series of intensive two-hour sessions, some involving practical work, others more discussion-oriented. Some sessions are shared across the two events. The instructors include, as well as speakers from Lancaster University, external guest speakers who are prominent specialists in their respective areas.

For a list of topics and speakers in the UCREL Summer School in Corpus Linguistics, see http://ucrel.lancs.ac.uk/summerschool/corpusling.php

 

For a list of topics and speakers in the UCREL/CASS Summer School in Statistics for Corpus Linguistics, see http://ucrel.lancs.ac.uk/summerschool/stats.php

These events are part of a larger set of five co-located Lancaster Summer Schools in Interdisciplinary Digital Methods; the other events include training in corpus methods directed at non-linguists; see the website for further information:

http://ucrel.lancs.ac.uk/summerschool

Note that the summer schools run the week immediately before the Corpus Linguistics 2015 conference, for the benefit of anyone who might wish to attend both.

 

How to register

Our Summer Schools are free to attend, but registration in advance is compulsory, as places are limited.

The deadline for registrations is Sunday 7th June 2015, but we cannot guarantee that places will still be available at that point!

The application forms are available on the event website here as is further information on the programme.

Corpus Linguistics 2015: @UCREL_Lancaster registration open

From the Corpora List
:::::::::::::::::::::::::::::::::::::::::::::::::::

Corpus Linguistics 2015: In honour of the life and work of Geoffrey Leech

 
 


The eighth international Corpus Linguistics conference (CL2015) will be held at Lancaster University from Tuesday 21st July 2015 to Friday 24th July 2015. The main conference will be preceded by a workshop day on Monday 20th July.

This series of conferences began in 2001 with an event celebrating the career of Professor Geoffrey Leech, on the occasion of his retirement. In August of 2014, we reported with great sadness Geoff’s sudden death.

By dedicating this eighth conference in the Corpus Linguistics series once again to a celebration of Geoff’s life, his career, and his truly remarkable influence on the field, we once more pay tribute to, and commemorate, a remarkable intellect and a sorely-missed colleague and friend.

Conference themes and topics

The goals of the conference are:

. To gather together current and developing research in the study and application of corpus linguistics; . To push the field forwards by promoting dialogue among the many different users of corpora across interconnected sub-disciplines of linguistics – be they descriptive, theoretical, applied or computational; . To explore new challenges both within corpus linguistics, and in the extension of corpus approaches to new fields of study.

CL2015 will have three thematic streams and a general programme.

Stream A: A tribute to Geoffrey Leech

For this stream we invite contributions using corpus methods in any of the branches of linguistics with which Geoffrey Leech’s research was especially closely associated, namely:

. Pragmatics
. Stylistics
. Description of English grammar and grammatical change . Grammatical annotation of corpus texts

Stream B: Discourse, Politics and Society

For this stream we invite contributions in the following areas:

. The use of corpora in discourse analysis . Corpus approaches to the study of new media . Applications of corpus approaches in the social sciences and humanities

Stream C: Language learning and teaching

For this stream we invite contributions in the following areas:

. Learner corpus research
. Corpus-based work in English language teaching, including ESP and EAP . Use of corpora in second language acquisition studies . Data-driven learning . Development of learner materials

General Programme

For the general programme, we invite contributions on as broad and inclusive a basis as possible. The areas in which we particularly welcome submissions include but are not limited to:

. Corpus methodology:
o Critical explorations of existing measures and methods in corpus linguistics; o New methods and techniques in corpus development, annotation and analysis; o New tools and techniques developed in corpus-based computational linguistics; o Advances in quantitative techniques.
. Theoretical corpus linguistics:
o The interface between corpus and linguistic theory; o Syntax, morphology, semantics; o Psycholinguistic and cognitive explorations; o Multi-lingual comparative and contrastive analysis; o Historical linguistics.
. Lexis and lexicon:
o Lexicography;
o Collocation and meaning in context.
. Sociolinguistics, language variation and applied linguistics:
o Regional and social variation in language; o Code-switching and bilingualism; o Forensic linguistics; o Genre, register and textual variation.

Plenary speakers

We are delighted to announce that the following speakers have accepted our invitation to give plenary lectures at CL2015:

. Douglas Biber (Northern Arizona University, USA) . Sylviane Granger (Université catholique de Louvain, Belgium) . Michaela Mahlberg (University of Nottingham, UK) . Alan Partington (Università di Bologna, Italy)

Call for pre-conference workshops

As noted above, CL2015 will include a workshop day on Monday 20th July 2015. We hereby issue a call for workshop proposals on any theme relevant to the conference.

“Workshops” may take two main forms.

The first type is the colloquium-style workshop, which operates as a mini-conference with its own programme committee and call for papers to be presented: proposals for this type of workshop should specify the scope of the workshop, who its organisers will be, and whether the creation of workshop proceedings is envisaged. Proposals should also provide an initial version of the text of the call for papers.

The other main type of workshop is a practical or applied workshop providing a demonstration of or training in some particular corpus linguistic technique or piece of software. In this case the proposal must explain the content of the workshop, provide an initial version of the text of a call for participation, and give an indication of the workshop’s IT requirements, if any.

We are also happy to consider innovative forms of workshop intermediate between colloquium-style workshop and practical workshop.

All proposals must in addition specify the proposed running time. Our timetable allows for the following lengths of workshop:

. Full-day workshop – up to 7 hours (plus lunch/breaks) . Half-day workshop – up to 3.5 hours (plus break) . Short workshop – up to 2 hours (single session)

There is no fixed format for workshop proposals, as long as they include all the details specified above. Proposals should be sent by email to Andrew Hardie by 15th December. We are happy to respond to informal expressions of interest in advance of formal submission of a proposal.

Call for papers, posters and panels

We invite submission of abstracts for papers, posters and panels on any topic relevant to the conference themes.

For this conference, we are requesting extended abstracts (750-1500 words), as we do not plan to produce a volume of conference proceedings. All abstracts will be peer-reviewed by the conference programme committee.

Paper presentations will consist of a 20 minute talk followed by 10 minutes for questions and discussion. Please note: paper submissions should present either complete research, or research in progress where at least some substantial results have been achieved. Work in progress which has yet to produce results can instead be submitted as a poster abstract.

Submissions for panel discussions should take the form of a single 1500 word abstract on behalf of all speakers to be on the panel. The abstract should include a note to specify whether the panel is intended to be 1 hour or 1.5 hours in length.

Submissions for poster presentations should be shorter (400-750 words). We especially welcome poster abstracts that (a) report on innovative research that is in its very earliest phases (b) report on new software or corpus data resources.

We especially encourage abstract submissions from early-career researchers, including postgraduate research students and postdoctoral researchers.

All abstracts must be submitted via the conference website; the submission system is now live (see http://ucrel.lancs.ac.uk/cl2015/call.php ). Details on how to submit an abstract to a specific conference stream are available on the website.

Key dates

. End October 2014 – call for papers; call for proposals for pre-conference workshops . 
7th January 2015 – deadline for abstract submission . 
16th January 2015 – earlybird registration opens . 
24th January 2015 – all abstract review outcomes will be returned by this date . 
30th March 2015 – end of earlybird registration (rates rise) .
 21st June 2015 – end of main registration (late registration not guaranteed, though we’ll try) . 
21st June 2015 – final deadline for cancellation with refund of registration fees . 
20th July 2015 – pre-conference workshop day . 
21st July to 24th July 2015 – main conference

General information

For information on registration, accommodation travel etc., see the conference website: http://ucrel.lancs.ac.uk/cl2015 ; email: cl2015@lancaster.ac.uk

The conference is hosted by the UCREL research centre (http://ucrel.lancs.ac.uk), which brings together the Department of Linguistics and English Language (http://www.ling.lancs.ac.uk/) with the School of Computing and Communications (http://www.scc.lancs.ac.uk/).

Local organising committee of CL 2015: Andrew Hardie (chair), Tony McEnery, Paul Rayson.

Innovative Corpus Query and Visualization Tools

QueryVis – Workshop on Innovative Corpus Query and Visualization Tools

at Nodalida 2015, Vilnius (Lithuania), May, 11th, 2015

Recent years have seen an increased interest in and availability of many different kinds of corpora. These range from small, but carefully annotated treebanks to large parallel corpora and very large monolingual corpora for big data research. It remains a challenge to query the multilayer annotations of small corpora, to efficiently access large corpora as well as to visualize the query results.

Invited Speaker

We are proud to announce that the plenary speech “Scaling out corpus technology: the open source query and analysis engine KorAP” will be presented by Marc Kupietz (Institut für Deutsche Sprache, Mannheim).

Topics to be covered
Querying corpora with multiple levels of annotation
Querying parallel and multi-parallel corpora
Visualization of annotation and alignment
Visualization of query results over very large corpora
Querying by example
Querying multimodal corpora

Workshop Format

We are planning for a half-day workshop with paper presentations, demos and an invited talk.
Call for Papers

We are seeking short papers (4 pages) or long papers (8 pages) on the topics specified above. Papers should be formated according to the Nodalida guidelines. Submission will be through EasyChair.
Important Dates

Paper submission 16. March 2015
Information to authors 17. April 2015
Final papers due 1. May 2015

Contact

Gintarė Grigonytė (Stockholm University)
queryvis_nodalida@ifi.uzh.ch

 

 

1st Intl. NLP for Informal Text- Deadline 17/4

Graph-Magnifier-icon

The 1st International Workshop on Natural Language Processing for Informal Text (NLPIT 2015)
In conjunction with The International Conference on Web Engineering(ICWE 2015)
June 23, 2015, Rotterdam, The Netherlands
http://wwwhome.cs.utwente.nl/~badiehm/nlpit2015/

Overview
The rapid growth of Internet usage in the last two decades adds new challenges to understand the informal user generated content (UGC) on the Internet. Textual UGC refers to textual posts on social media, blogs, emails, chat conversations, instant messages, forums, reviews, or advertisements that are created by end-users of an online system. A large portion of language used on textual UGC is informal. Informal text is the style of writing that disregard language grammars and uses a mixture of abbreviations and context dependent terms. The straightforward application of state-of-the-art Natural Language Processing approaches on informal text typically results in significantly degraded performance due to the following reasons: the lack of sentence structure; the lack of enough context required; the seldom entities involved; the noisy sparse contents of users’ contributions; and the untrusted facts contained. It is the aim of this work- shop to bring the attention of researchers to the opportunities and challenges involved in informal text processing. In particular, we are interested in discussing informal text modeling, normalization, mining, and understanding in addition to various application areas in which UGC is involved.

Topics

We invite submissions on topics that include, but are not limited to, the following core NLP approaches for informal UGC: language identification, classification, clustering, filtering, summarization, tokenization, segmentation, morphological analysis, POS tagging, parsing, named entity extraction, named entity disambiguation, relation/fact extraction, semantic annotation, sentiment analysis, language normalization, informality modeling and measuring, language generation, handling uncertainties, machine translation, ontology construction, dictionary construction, etc.

Submission

Authors are invited to submit original work not submitted to another conference or workshop. Workshop submissions could be a full paper or short paper. Paper length should not exceed 12 pages for full papers and 6 pages for short papers. All papers should follow the Springer’s LNCS format. Papers in PDF can be sent via the EasyChair Conference System https://easychair.org/conferences/?conf=nlpit2015. Each submission will receive, in addition to a meta-review, at least 2 peer double-blind reviews. Each full paper will get 25 minutes presentation time. Short papers will get 5 minutes presentation time in addition to a poster. Beside papers, we also plan to have an invited talk by a renowned scientist on a topic relevant for the workshop. Workshop proceedings will be published as part of the ICWE2015 workshop proceedings. To contact the NLPIT 2015 organization team, please send an e-mail to: nlpit2015@easychair.org.

Deadlines

– Submission deadline: April 17, 2015
– Notification deadline: May 17, 2015
– Camera-ready version: May 24, 2015
– Workshop date: June 23, 2015

Msg. distributed through the corpora list