{"b5c8e90eb9dff3fc51414c4ad406eb2asarahschulz":{"DOI":"","ISBN":"","ISSN":"","URL":"http://dx.doi.org/10.1145/2850422","abstract":"As social media constitutes a valuable source for data analysis for a wide range of applications, the need\r\nfor handling such data arises. However, the nonstandard language used on social media poses problems\r\nfor natural language processing (NLP) tools, as these are typically trained on standard language material.\r\nWe propose a text normalization approach to tackle this problem. More specifically, we investigate the\r\nusefulness of a multimodular approach to account for the diversity of normalization issues encountered in\r\nuser-generated content (UGC). We consider three different types of UGC written in Dutch (SNS, SMS, and\r\ntweets) and provide a detailed analysis of the performance of the different modules and the overall system.\r\nWe also apply an extrinsic evaluation by evaluating the performance of a part-of-speech tagger, lemmatizer,\r\nand named-entity recognizer before and after normalization.","annote":"","author":[{"family":"Schulz","given":"Sarah"},{"family":"De Pauw","given":"Guy"},{"family":"De Clercq","given":"Orphée"},{"family":"Desmet","given":"Bart"},{"family":"Hoste","given":"Véronique"},{"family":"Daelemans","given":"Walter"},{"family":"Macken","given":"Lieve"}],"citation-label":"Schulz2016b","collection-editor":[],"collection-title":"","container-author":[],"container-title":"ACM TIST","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2016","July"]],"literal":"2016"},"event-place":"","id":"b5c8e90eb9dff3fc51414c4ad406eb2asarahschulz","interhash":"825bae0683727e2fb59c3bde118b30ec","intrahash":"b5c8e90eb9dff3fc51414c4ad406eb2a","issue":"4","issued":{"date-parts":[["2016","July"]],"literal":"2016"},"keyword":"content modular normalization user-generate","note":"","number":"4","page":"61","page-first":"61","publisher":"","publisher-place":"","status":"","title":"Multimodular Text Normalization of Dutch User-Generated Content","type":"article-journal","username":"sarahschulz","version":"","volume":"7"}}