Author of the publication

EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation.

, , , , and . LREC, European Language Resources Association, (2002)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Dealing with heterogeneous big data when geoparsing historical corpora., , , , , and . BigData, page 80-83. IEEE Computer Society, (2014)EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation., , , , and . LREC, European Language Resources Association, (2002)Corpus Linguistics and South Asian Languages: Corpus Creation and Tool Development., , , , , , , , , and 3 other author(s). LLC, 19 (4): 509-524 (2004)Visual GISting: bringing together corpus linguistics and Geographical Information Systems., and . LLC, 26 (3): 297-314 (2011)Customising geoparsing and georeferencing for historical texts., , , , , , and . BigData, page 59-62. IEEE Computer Society, (2013)Combining Documentation and Research: Ongoing Work on an Endangered Language., , , and . IALP, page 169-172. IEEE Computer Society, (2012)From legacy encodings to Unicode: the graphical and logical principles in the scripts of South Asia.. Language Resources and Evaluation, 41 (1): 1-25 (2007)Automatically Analyzing Large Texts in a GIS Environment: The Registrar General's Reports and Cholera in the 19th Century., , , , and . Trans. GIS, 19 (2): 296-320 (2015)The Were -Subjunctive in British Rural Dialects: Marrying Corpus and Questionnaire Data., and . Computers and the Humanities, 37 (2): 205-228 (2003)