Challenges of Computational Processing of Code-Switching
Ö. Çetinoğlu, S. Schulz, und N. Vu. Proceedings of EMNLP Workshop on Computational Approaches to Linguistic Code Switching (CALCS 2016) @EMNLP, Austin, Texas, USA, (November 2016)
Zusammenfassung
This paper addresses challenges of Natural
Language Processing (NLP) on non-canonical
multilingual data in which two or more languages are mixed. It refers to code-switching
which has become more popular in our
daily life and therefore obtains an increasing
amount of attention from the research community. We report our experience that covers not only core NLP tasks such as normalisation, language identification, language modelling, part-of-speech tagging and dependency
parsing but also more downstream ones such
as machine translation and automatic speech
recognition. We highlight and discuss the key
problems for each of the tasks with supporting
examples from different language pairs and
relevant previous work.
%0 Conference Paper
%1 cetinoglu2016challanges
%A Çetinoğlu, Özlem
%A Schulz, Sarah
%A Vu, Ngoc Thang
%B Proceedings of EMNLP Workshop on Computational Approaches to Linguistic Code Switching (CALCS 2016) @EMNLP
%C Austin, Texas, USA
%D 2016
%K myown
%T Challenges of Computational Processing of Code-Switching
%X This paper addresses challenges of Natural
Language Processing (NLP) on non-canonical
multilingual data in which two or more languages are mixed. It refers to code-switching
which has become more popular in our
daily life and therefore obtains an increasing
amount of attention from the research community. We report our experience that covers not only core NLP tasks such as normalisation, language identification, language modelling, part-of-speech tagging and dependency
parsing but also more downstream ones such
as machine translation and automatic speech
recognition. We highlight and discuss the key
problems for each of the tasks with supporting
examples from different language pairs and
relevant previous work.
@inproceedings{cetinoglu2016challanges,
abstract = {This paper addresses challenges of Natural
Language Processing (NLP) on non-canonical
multilingual data in which two or more languages are mixed. It refers to code-switching
which has become more popular in our
daily life and therefore obtains an increasing
amount of attention from the research community. We report our experience that covers not only core NLP tasks such as normalisation, language identification, language modelling, part-of-speech tagging and dependency
parsing but also more downstream ones such
as machine translation and automatic speech
recognition. We highlight and discuss the key
problems for each of the tasks with supporting
examples from different language pairs and
relevant previous work.},
added-at = {2016-09-21T15:32:03.000+0200},
address = {Austin, Texas, USA},
author = {Çetinoğlu, Özlem and Schulz, Sarah and Vu, Ngoc Thang},
biburl = {https://puma.ub.uni-stuttgart.de/bibtex/2a7f7c93fe429132dcff932d7433004be/sarahschulz},
booktitle = {Proceedings of EMNLP Workshop on Computational Approaches to Linguistic Code Switching (CALCS 2016) @EMNLP},
interhash = {06a3e19cd0f88c4d336ba80591d5b1fb},
intrahash = {a7f7c93fe429132dcff932d7433004be},
keywords = {myown},
month = {November},
timestamp = {2016-12-14T14:43:01.000+0100},
title = {Challenges of Computational Processing of Code-Switching},
year = 2016
}