Inproceedings,

Demand-Driven Data Provisioning in Data Lakes: BARENTS - A Tailorable Data Preparation Zone

, , , , and .
Proceedings of the 23rd International Conference on Information Integration and Web Intelligence, page 191--202. Linz, ACM, (November 2021)iiWAS 2021 Best Paper Award.
DOI: 10.1145/3487664.3487784

Abstract

Data has never been as significant as it is today. It can be acquired virtually at will on any subject. Yet, this poses new challenges towards data management, especially in terms of storage (data is not consumed during processing, i.\,e., the data volume keeps growing), flexibility (new applications emerge), and operability (analysts are no IT experts). The goal has to be a demand-driven data provisioning, i.\,e., the right data must be available in the right form at the right time. Therefore, we introduce a tailorable data preparation zone for Data Lakes called BARENTS\@. It enables users to model in an ontology how to derive information from data and assign the information to use cases. The data is automatically processed based on this model and the refined data is made available to the appropriate use cases. Here, we focus on a resource-efficient data management strategy. BARENTS can be embedded seamlessly into established Big Data infrastructures, e.\,g., Data Lakes.

Tags

Users

  • @christophstach

Comments and Reviews