Categories
applied linguistics corpus linguistics

the logDice score in Word Sketches

Dice score gives very good results of collocation candidates. The only problem is that the values of the Dice score are usually very small numbers. We have defined logDice to fix this problem.

Values of the logDice have the following features:
– Theoretical maximum is 14, in case when all occurrences of X co-occur with Y and all occurrences of Y co-occur with X. Usually the value is less then 10.

– Value 0 means there is less than 1 co-occurrence of XY per 16,000 X or 16,000 Y. We can say that negative values means there is no statistical significance of XY collocation.

– Comparing two scores, plus 1 point means twice as often collocation, plus 7 points means roughly 100 times frequent collocation.

– The score does not depend on the total size of a corpus. The score combine relative frequencies of XY in relation to X and Y.

All these characteristics are useful orientation points for any field linguist working with collocation candidate lists.

From: A Lexicographer-Friendly Association Score, by Pavel Rychlý

Categories
EU European projects research research project

@eu_commission Assessing the Research Management Performance

 

Study on Assessing the Research Management Performance of Framework Programmes Projects

 

Categories
Academic discourse COCA corpora corpus linguistics text analysis text tools writing

Videos: Corpus Contemporary American English

This is a follow-up to our post Writing tools for researchers.

The basics

Using POS tags

Collocations

 

BNC & COCA Basic Query Syntax

COCA-basic-query-syntax

Categories
CFP conferences conferencias educación superior TICs

III Congreso Intl. Aprendizaje, Innovación y Competitividad

Graph-Magnifier-icon

La tercera edición de CINAIC se celebrará del 14 al 16 de Octubre de 2015 en Madrid. La organización del congreso corre a cargo de la Universidad Politécnica de Madrid, la Universidad de Zaragoza, la Universidad de las Palmas de Gran Canaria, la Universidad de Alicante, el CDTI (Ministerio de Economía y Competitividad), la Dirección General de Universidades (Ministerio de Educación, Cultura y Deporte) y los grupos de investigación GIDTIC (Universidad de Zaragoza) y GRIAL (Universidad de Salamanca).

El plazo para presentar trabajos, en cualquiera de sus áreas temáticas, finalizará el 2 de Mayo de 2015.

Como en ediciones anteriores, CINAIC 2015 promueve el intercambio de conocimiento entre los asistentes a través de las distintas actividades de socialización y dinamización.

Así mismo, CINAIC 2015 trabaja para que el máximo número de trabajos aceptados sean publicados en revistas científicas indexadas en los principales índices de referencia (JCR, Scopus y otras). Puede consultar el listado de revistas científicas ya confirmadas (se irá ampliando). En la edición anterior el 21% de los trabajos aceptados fueron publicados en revistas científicas. Así mismo, todos los trabajos aceptados serán publicados tanto en las actas del congreso (con ISBN) y en el Repositorio de Buenas Prácticas de Innovación Educativa (financiado por el Ministerio de Educación, Cultura y Deporte).

Correo electrónico: congresocinaic@gmail.com

Categories
social media social networks The Guardian

Life before and after Facebook @guardian

 

Children are reluctant to admit it, but after a bad day at school it might be a relief to have a place where the online world cannot reach you. The problem is whether the home can any longer offer that peace and quiet. It is as if the walls are no longer solid but permeable. Somehow the outside world now penetrates inside the average family home because of this continual contact with peers and others.

Read the whole story.

Categories
analysis of language applied linguistics CALL corpus linguistics research

A taxonomy of learner searches in DDL

 

Learners’ search patterns during corpus-based focus-on-form activities: A study on hands-on concordancing

Authors: Pérez-Paredes, Pascual; Sánchez-Tornel, María; Calero, Jose M. Alcaraz
Source: International Journal of Corpus Linguistics, Volume 17, Number 4, 2012, pp. 482-515(34)
Publisher: John Benjamins Publishing Company

Abstract:
Our research explores the search behaviour of EFL learners (n=24) by tracking their interaction with corpus-based materials during focus-on-form activities (Observe, Search the corpus, Rewriting). One set of learners made no use of web services other than the BNC during the central Search the corpus activity while the other set resorted to other web services and/or consultation guidelines. The performance of the second group was higher, the learners’ formulation of corpus queries on the BNC was unsophisticated and the students tended to use the BNC search interface to a great extent in the same way as they used Google or similar services. Our findings suggest that careful consideration should be given to the cognitive aspects concerning the initiation of corpus searches, the role of computer search interfaces, as well as the implementation of corpus-based language learning. Our study offers a taxonomy of learner searches that may be of interest in future research.