<?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:community="http://www.bibsonomy.org/ontologies/2008/05/community#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:admin="http://webns.net/mvcb/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:syn="http://purl.org/rss/1.0/modules/syndication/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns:cc="http://web.resource.org/cc/" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" xmlns:swrc="http://swrc.ontoware.org/ontology#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns="http://purl.org/rss/1.0/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xml:base="https://puma.ub.uni-stuttgart.de/group/researchcode/bigdata"><owl:Ontology rdf:about=""><rdfs:comment>PUMA publications for /group/researchcode/bigdata</rdfs:comment><owl:imports rdf:resource="http://swrc.ontoware.org/ontology/portal"/></owl:Ontology><rdf:Description rdf:about="https://puma.ub.uni-stuttgart.de/bibtex/23bfc5faff91370571e13df24f7ac713a/diglezakis"><owl:sameAs rdf:resource="/uri/bibtex/23bfc5faff91370571e13df24f7ac713a/diglezakis"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><swrc:date>Mon Aug 21 13:26:42 CEST 2017</swrc:date><swrc:booktitle>22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing</swrc:booktitle><swrc:pages>317-321</swrc:pages><swrc:title>Device-Driven Metadata Management Solutions for Scientific Big Data Use Cases</swrc:title><swrc:year>2014</swrc:year><swrc:keywords>bigdata forschungsdaten metadata </swrc:keywords><swrc:abstract>Big Data applications in science are producing
huge amounts of data, which require advanced processing, handling,
and analysis capabilities. For the organization of large
scale data sets it is essential to annotate these with metadata,
index them, and make them easily findable. In this paper we
investigate two scientific use cases from biology and photon
science, which entail complex situations in regard to data
volume, data rates and analysis requirements. The LSDMA
project provides an ideal context for this research, combining
both innovative R&amp;D on the processing, handling, and analysis
level and a wide range of research communities in need of
scalable solutions. To facilitate the advancement of data life
cycles we present preferred metadata management strategies.
In biology the Open Microscopy Environment (OME) and in
photon science NeXus/ICAT are presented. We show that these
are well suited for the respective data life cycles. To facilitate
searching across communities we discuss solutions involving
the Open Archive Initiative - Protocol for Metadata Harvesting
(OAI-PMH) and Apache Lucene/Solr.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing" swrc:key="eventtitle"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Richard Grunzke"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Jürgen Hesser"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Jürgen Starek"/></rdf:_3><rdf:_4><swrc:Person swrc:name="Nick Kepper"/></rdf:_4><rdf:_5><swrc:Person swrc:name="Sandra Gesing"/></rdf:_5><rdf:_6><swrc:Person swrc:name="Marcus Hardt"/></rdf:_6><rdf:_7><swrc:Person swrc:name="Volker Hartmann"/></rdf:_7><rdf:_8><swrc:Person swrc:name="Stephan Kindermann"/></rdf:_8><rdf:_9><swrc:Person swrc:name="Jan Potthoff"/></rdf:_9><rdf:_10><swrc:Person swrc:name="Michael Hausmann"/></rdf:_10><rdf:_11><swrc:Person swrc:name="Ralph Müller-Pfefferkorn"/></rdf:_11><rdf:_12><swrc:Person swrc:name="René Jäkel"/></rdf:_12></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="https://puma.ub.uni-stuttgart.de/bibtex/23f6c27b8af75462b9f1026355ced1233/diglezakis"><owl:sameAs rdf:resource="/uri/bibtex/23f6c27b8af75462b9f1026355ced1233/diglezakis"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Article"/><owl:sameAs rdf:resource="http://stacks.iop.org/1742-6596/513/i=3/a=032047"/><swrc:date>Mon Aug 21 13:03:30 CEST 2017</swrc:date><swrc:journal>Journal of Physics: Conference Series</swrc:journal><swrc:number>3</swrc:number><swrc:pages>032047</swrc:pages><swrc:title>Optimization of data life cycles</swrc:title><swrc:volume>513</swrc:volume><swrc:year>2014</swrc:year><swrc:keywords>bigdata forschungsdaten </swrc:keywords><swrc:abstract>Data play a central role in most fields of science. In recent years, the amount of data from experiment, observation, and simulation has increased rapidly and data complexity has grown. Also, communities and shared storage have become geographically more distributed. Therefore, methods and techniques applied to scientific data need to be revised and partially be replaced, while keeping the community-specific needs in focus. The German Helmholtz Association project &#034;Large Scale Data Management and Analysis&#034; (LSDMA) aims to maximize the efficiency of data life cycles in different research areas, ranging from high energy physics to systems biology. In its five Data Life Cycle Labs (DLCLs), data experts closely collaborate with the communities in joint research and development to optimize the respective data life cycle. In addition, the Data Services Integration Team (DSIT) provides data analysis tools and services which are common to several DLCLs. This paper describes the various activities within LSDMA and focuses on the work performed in the DLCLs.</swrc:abstract><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="C Jung"/></rdf:_1><rdf:_2><swrc:Person swrc:name="M Gasthuber"/></rdf:_2><rdf:_3><swrc:Person swrc:name="A Giesler"/></rdf:_3><rdf:_4><swrc:Person swrc:name="M Hardt"/></rdf:_4><rdf:_5><swrc:Person swrc:name="J Meyer"/></rdf:_5><rdf:_6><swrc:Person swrc:name="F Rigoll"/></rdf:_6><rdf:_7><swrc:Person swrc:name="K Schwarz"/></rdf:_7><rdf:_8><swrc:Person swrc:name="R Stotzka"/></rdf:_8><rdf:_9><swrc:Person swrc:name="A Streit"/></rdf:_9></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="https://puma.ub.uni-stuttgart.de/bibtex/28e03b4aeb58ea1b36ed9a9a6c72d54d1/diglezakis"><owl:sameAs rdf:resource="/uri/bibtex/28e03b4aeb58ea1b36ed9a9a6c72d54d1/diglezakis"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InBook"/><swrc:date>Tue Jul 18 09:41:03 CEST 2017</swrc:date><swrc:address>Switzerland</swrc:address><swrc:booktitle>Whither turbulence and big data in the 21st century?</swrc:booktitle><swrc:chapter>27</swrc:chapter><swrc:pages>497-507</swrc:pages><swrc:publisher><swrc:Organization swrc:name="Springer"/></swrc:publisher><swrc:title>Turbulence in the Era of Big Data: Recent Experiences with Sharing Large Datasets</swrc:title><swrc:year>2017</swrc:year><swrc:keywords>bigdata forschungsdaten obib engineering iag </swrc:keywords><swrc:abstract>In the context of the contemporary push for “big data” in many fields, we review recent experiences building large databases for turbulence research. We consider data from direct numerical simulations (DNS) of various canonical flows and from experimental studies and related numerical simulations of wall-bounded turbulence, where the data storage needs are particularly challenging due to the very large range of length and time scales that exists in these flows at high Reynolds numbers. The focus is on a move from the traditional approach of data-handling and analysis where datasets are moved to individual computers, to one where much of the analysis is moved to the hosting system that stores these data. In this context we give a summary of a unique open numerical laboratory that archives over 200 Terabytes of DNS data, including full spatio-temporal flow fields of various canonical flows. Particular attention is given to the unique access requirements for large datasets to become open to the research community and the success the system has had in democratizing access to large datasets.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="978-3-319-41217-7" swrc:key="isbn"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="http://dx.doi.org/10.1007/978-3-319-41217-7" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Charles Meneveau"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Ivan Marusic"/></rdf:_2></rdf:Seq></swrc:author><swrc:editor><rdf:Seq><rdf:_1><swrc:Person swrc:name="Andrew Pollard"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Luciano Castillo"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Luminita Danaila"/></rdf:_3><rdf:_4><swrc:Person swrc:name="Mark Glauser"/></rdf:_4></rdf:Seq></swrc:editor></rdf:Description><rdf:Description rdf:about="https://puma.ub.uni-stuttgart.de/bibtex/299e8a3d8e32d794bbb8872e3e32bf103/diglezakis"><owl:sameAs rdf:resource="/uri/bibtex/299e8a3d8e32d794bbb8872e3e32bf103/diglezakis"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InBook"/><swrc:date>Tue Jul 18 09:33:00 CEST 2017</swrc:date><swrc:address>Switzerland</swrc:address><swrc:booktitle>Whither turbulence and big data in the 21st century?</swrc:booktitle><swrc:chapter>28</swrc:chapter><swrc:pages>509-515</swrc:pages><swrc:publisher><swrc:Organization swrc:name="Springer"/></swrc:publisher><swrc:title>Public Dissemination of Raw Turbulence Data</swrc:title><swrc:year>2017</swrc:year><swrc:keywords>bigdata forschungsdaten reuse dissemination simulation obib engineering iag </swrc:keywords><swrc:abstract>It is argued that there is a certain urgency to the discussion of whether raw data should be made publicly available within the turbulence community, and about the best ways, technology and rules for possible dissemination. Besides expressing the personal opinion that such sharing would be advantageous for the field, the urgency mostly arises from the danger that funding agencies or other institutions would otherwise set standards without proper community input. This paper is in part a plea for community action in that direction. As an example, the experience of the Madrid school of Aeronautics with the dissemination of numerical simulation results is briefly reviewed, including the present technological solutions and usage statistics.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="978-3-319-41215-3" swrc:key="isbn"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="10.1007/978-3-3 19-41217-7_28" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Juan A. Sillero"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Javier Jiminéz"/></rdf:_2></rdf:Seq></swrc:author><swrc:editor><rdf:Seq><rdf:_1><swrc:Person swrc:name="Andrew Pollard"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Luciano Castillo"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Luminita Danaila"/></rdf:_3><rdf:_4><swrc:Person swrc:name="Mark Glauser"/></rdf:_4></rdf:Seq></swrc:editor></rdf:Description></rdf:RDF>