{"fe5c27bfa323a45b3c97de8f81f88da2diglezakis":{"DOI":"","ISBN":"","ISSN":"","URL":"https://gitlab.lrz.de/nfdi4ing/crawler","abstract":"HOMER (HPMC tool for Ontology-based Metadata Extraction and Re-use).","annote":"","author":[{"family":"Stemmer","given":"Christian"},{"family":"Hoppe","given":"Nils"},{"family":"Farnbacher","given":"Benjamin"},{"family":"Radoslav","given":"Ralev"},{"family":"Chiapparino","given":"Giuseppe"}],"citation-label":"stemmer_metadata_2021","collection-editor":[],"collection-title":"","container-author":[],"container-title":"","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2021"]],"literal":"2021"},"event-place":"","id":"fe5c27bfa323a45b3c97de8f81f88da2diglezakis","interhash":"960db829af8e246ca25a1582ec3583a7","intrahash":"fe5c27bfa323a45b3c97de8f81f88da2","issue":"","issued":{"date-parts":[["2021"]],"literal":"2021"},"keyword":"hpc-computing metadata crawler NFDI4ING extraction","misc":{"urldate":"2023-08-11"},"note":"[000][29*][000]","number":"","page":"","page-first":"","publisher":"","publisher-place":"","status":"","title":"A metadata crawler for simulation data management","type":"article","username":"diglezakis","version":"","volume":""},"a01fba3c690fedfaaa51e2be254cafb6diglezakis":{"DOI":"https://doi.org/10.1007/s11227-020-03602-6","ISBN":"","ISSN":"","URL":"https://link.springer.com/article/10.1007/s11227-020-03602-6","abstract":"The deluge of dark data is about to happen. Lacking data management capabilities, especially in the field of supercomputing, and missing data documentation (i.e., missing metadata annotation) constitute a major source of dark data. The present work contributes to addressing this challenge by presenting ExtractIng, a generic automated metadata extraction toolkit. Existing metadata information of simulation output files scattered through the file system, can be aggregated, parsed and converted to the EngMeta metadata model. Use cases from computational engineering are considered to demonstrate the viability of ExtractIng. The evaluation results show that the metadata extraction is simulation-code independent in the sense that it can handle data outputs from various fields of science, is easy to integrate into simulation workflows and compatible with a multitude of computational environments.","annote":"","author":[{"family":"Schembera","given":"Björn"}],"citation-label":"schembera2021rainbow","collection-editor":[{"family":"Arabnia","given":"Hamid"}],"collection-title":"","container-author":[{"family":"Arabnia","given":"Hamid"}],"container-title":"The Journal of Supercomputing","documents":[],"edition":"","editor":[{"family":"Arabnia","given":"Hamid"}],"event-date":{"date-parts":[["2021"]],"literal":"2021"},"event-place":"","id":"a01fba3c690fedfaaa51e2be254cafb6diglezakis","interhash":"b6eac40c2b6d9e29e99c8eece7d9e01b","intrahash":"a01fba3c690fedfaaa51e2be254cafb6","issue":"","issued":{"date-parts":[["2021"]],"literal":"2021"},"keyword":"forschungsdaten hpc-computing metadata tools automated","misc":{"doi":"https://doi.org/10.1007/s11227-020-03602-6"},"note":"","number":"","page":"","page-first":"","publisher":"","publisher-place":"","status":"","title":"Like a rainbow in the dark: metadata annotation for HPC applications in the age of dark data","type":"article-journal","username":"diglezakis","version":"","volume":""},"ab2ddb1039b00ff1b73c9ab682a82aaediglezakis":{"DOI":"10.1021/acs.jced.9b00739","ISBN":"","ISSN":"","URL":"https://doi.org/10.1021%2Facs.jced.9b00739","abstract":"","annote":"","author":[{"family":"Horsch","given":"Martin Thomas"},{"family":"Niethammer","given":"Christoph"},{"family":"Boccardo","given":"Gianluca"},{"family":"Carbone","given":"Paola"},{"family":"Chiacchiera","given":"Silvia"},{"family":"Chiricotto","given":"Mara"},{"family":"Elliott","given":"Joshua D."},{"family":"Lobaskin","given":"Vladimir"},{"family":"Neumann","given":"Philipp"},{"family":"Schiffels","given":"Peter"},{"family":"Seaton","given":"Michael A."},{"family":"Todorov","given":"Ilian T."},{"family":"Vrabec","given":"Jadran"},{"family":"Cavalcanti","given":"Welchy Leite"}],"citation-label":"Horsch_2019","collection-editor":[],"collection-title":"","container-author":[],"container-title":"Journal of Chemical & Engineering Data","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2019","12"]],"literal":"2019"},"event-place":"","id":"ab2ddb1039b00ff1b73c9ab682a82aaediglezakis","interhash":"bcb2f3681bb3cff5c214b58d19429556","intrahash":"ab2ddb1039b00ff1b73c9ab682a82aae","issue":"3","issued":{"date-parts":[["2019","12"]],"literal":"2019"},"keyword":"forschungsdaten hpc-computing metadata ontologie molecularSimulation","misc":{"doi":"10.1021/acs.jced.9b00739"},"note":"","number":"3","number-of-pages":"16","page":"1313--1329","page-first":"1313","publisher":"American Chemical Society (ACS)","publisher-place":"","status":"","title":"Semantic Interoperability and Characterization of Data Provenance in Computational Molecular Engineering","type":"article-journal","username":"diglezakis","version":"","volume":"65"},"c7971d8d6a474e4c5bf8a30d62d04edediglezakis":{"DOI":"https://doi.org/10.1016/j.commatsci.2015.09.013","ISBN":"","ISSN":"0927-0256","URL":"http://www.sciencedirect.com/science/article/pii/S0927025615005820","abstract":"Abstract Computational science has seen in the last decades a spectacular rise in the scope, breadth, and depth of its efforts. Notwithstanding this prevalence and impact, it is often still performed using the renaissance model of individual artisans gathered in a workshop, under the guidance of an established practitioner. Great benefits could follow instead from adopting concepts and tools coming from computer science to manage, preserve, and share these computational efforts. We illustrate here our paradigm sustaining such vision, based around the four pillars of Automation, Data, Environment, and Sharing. We then discuss its implementation in the open-source AiiDA platform (http://www.aiida.net), that has been tuned first to the demands of computational materials science. AiiDA’s design is based on directed acyclic graphs to track the provenance of data and calculations, and ensure preservation and searchability. Remote computational resources are managed transparently, and automation is coupled with data storage to ensure reproducibility. Last, complex sequences of calculations can be encoded into scientific workflows. We believe that AiiDA’s design and its sharing capabilities will encourage the creation of social ecosystems to disseminate codes, data, and scientific workflows.","annote":"","author":[{"family":"Pizzi","given":"Giovanni"},{"family":"Cepellotti","given":"Andrea"},{"family":"Sabatini","given":"Riccardo"},{"family":"Marzari","given":"Nicola"},{"family":"Kozinsky","given":"Boris"}],"citation-label":"Pizzi2016218","collection-editor":[],"collection-title":"","container-author":[],"container-title":"Computational Materials Science","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2016"]],"literal":"2016"},"event-place":"","id":"c7971d8d6a474e4c5bf8a30d62d04edediglezakis","interhash":"8aa55fa398b71ec36da3eb98d8fff42e","intrahash":"c7971d8d6a474e4c5bf8a30d62d04ede","issue":"","issued":{"date-parts":[["2016"]],"literal":"2016"},"keyword":"framework forschungsdaten hpc-computing metadata software reproducibility softwareLizenzMotivation repository","misc":{"issn":"0927-0256","doi":"https://doi.org/10.1016/j.commatsci.2015.09.013"},"note":"","number":"","number-of-pages":"12","page":"218 - 230","page-first":"218","publisher":"","publisher-place":"","status":"","title":"AiiDA: automated interactive infrastructure and database for computational science","type":"article-journal","username":"diglezakis","version":"","volume":"111"}}