Inproceedings,

Challenges of Research Data Management for High Performance Computing

, and .
International Conference on Theory and Practice of Digital Libraries, page 140--151. Springer, (2017)

Abstract

This paper targets the challenges of research data management with a focus on High Performance Computing (HPC) and simulation data. Main challenges are discussed: The Big Data qualities of HPC research data, technical data management, organizational and administrative challenges. Emerging from these challenges, requirements for a feasible HPC research data management are derived and an alternative data life cycle is proposed. The requirement analysis includes recommendations which are based on a modified OAIS architecture: To meet the HPC requirements of a scalable system, metadata and data must not be stored together. Metadata keys are defined and organizational actions are recommended. Moreover, this paper contributes by introducing the role of a Scientific Data Manager, who is responsible for the institution’s data management and taking stewardship of the data.

Tags

Users

  • @diglezakis
  • @bjoernschembera

Comments and Reviews