Inproceedings,

New Domain, Major Effort? How Much Data is Necessary to Adapt a Temporal Tagger To the Voice Assistant Domain

T. Alam, A. Zarcone, and S. Padó.
Proceedings of IWCS, page 144--154. Online, (2021)

Abstract

Reliable tagging of Temporal Expressions (TEs, e.g., Book a table at L’Osteria for Sunday evening) is a central requirement for Voice Assistants (VAs). However, there is a dearth of resources and systems for the VA domain, since publicly-available temporal taggers are trained only on substantially different domains, such as news and clinical text. Since the cost of annotating large datasets is prohibitive, we investigate the trade-off between in-domain data and performance in DA-Time, a hybrid temporal tagger for the English VA domain which combines a neural architecture for robust TE recognition, with a parser-based TE normalizer. We find that transfer learning goes a long way even with as little as 25 in-domain sentences: DA-Time performs at the state of the art on the news domain, and substantially outperforms it on the VA domain.

BibTeX key: alam21:_new_domain_major_effor
entry type: inproceedings
address: Online
booktitle: Proceedings of IWCS
year: 2021
pages: 144--154
Document: https://iwcs2021.github.io/proceedings/iwcs/pdf/2021.iwcs-1.14.pdf

PUMA

New Domain, Major Effort? How Much Data is Necessary to Adapt a Temporal Tagger To the Voice Assistant Domain

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on