Inproceedings,

DiaHClust: an Iterative Hierarchical Clustering Approach for Identifying Stages in Language Change

, and .
Proceedings of the International Workshop on Computational Approaches to Historical Language Change, page 126-135. Association for Computational Linguistics, (2019)
DOI: 10.18653/v1/W19-4716

Abstract

Language change is often assessed against a set of pre-determined time periods in order to be able to trace its diachronic trajectory. This is problematic, since a pre-determined periodization might obscure significant developments and lead to false assumptions about the data. Moreover, these time periods can be based on factors which are either arbitrary or non-linguistic, e.g., dividing the corpus data into equidistant stages or taking into account language-external events. Addressing this problem, in this paper we present a data-driven approach to periodization: `DiaHClust'. DiaHClust is based on iterative hierarchical clustering and offers a multi-layered perspective on change from text-level to broader time periods. We demonstrate the usefulness of DiaHClust via a case study investigating syntactic change in Icelandic, modelling the syntactic system of the language in terms of vectors of syntactic change.

Tags

Users

  • @sfbtrr161
  • @leonkokkoliadis

Comments and Reviews