@sp

DErivBase: Inducing and Evaluating a Derivational Morphology Resource for German

, , and . Proceedings of ACL, page 1201--1211. Sofia, Bulgaria, (2013)

Abstract

Derivational models are still an under-researched area in computational morphology. Even for German, a rather resource-rich language, there is a lack of large-coverage derivational knowledge. This paper describes a rule-based framework for inducing derivational families (i.e., clusters of lemmas in derivational relationships) and its application to create a high-coverage German resource, DERIVBASE, mapping over 280k lemmas into more than 17k non-singleton clusters. We focus on the rule component and a qualitative and quantitative evaluation. Our approach achieves up to 93\% precision and 71\% recall. We attribute the high precision to the fact that our rules are based on information from grammar books.

Links and resources

Tags