Challenges of Computational Processing of Code-Switching

Abstract

This paper addresses challenges of Natural Language Processing (NLP) on non-canonical multilingual data in which two or more languages are mixed. It refers to code-switching which has become more popular in our daily life and therefore obtains an increasing amount of attention from the research community. We report our experience that covers not only core NLP tasks such as normalisation, language identification, language modelling, part-of-speech tagging and dependency parsing but also more downstream ones such as machine translation and automatic speech recognition. We highlight and discuss the key problems for each of the tasks with supporting examples from different language pairs and relevant previous work.

BibTeX key: cetinoglu2016challanges
entry type: inproceedings
address: Austin, Texas, USA
booktitle: Proceedings of EMNLP Workshop on Computational Approaches to Linguistic Code Switching (CALCS 2016) @EMNLP
year: 2016
month: November

PUMA

Challenges of Computational Processing of Code-Switching

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on