{"4af36dbc29da579a3941c7efb2cce177diglezakis":{"DOI":"","ISBN":"","ISSN":"","URL":"","abstract":"","annote":"","author":[{"family":"Zaveri","given":"Amrapali"},{"family":"Rula","given":"Anisa"},{"family":"Maurino","given":"Andrea"},{"family":"Pietrobon","given":"Ricardo"},{"family":"Lehmann","given":"Jens"},{"family":"Auer","given":"Sören"}],"citation-label":"zaveri2016quality","collection-editor":[],"collection-title":"","container-author":[],"container-title":"Semantic Web","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2016"]],"literal":"2016"},"event-place":"","id":"4af36dbc29da579a3941c7efb2cce177diglezakis","interhash":"5d306174a9c00920835987cec4d4c7f3","intrahash":"4af36dbc29da579a3941c7efb2cce177","issue":"1","issued":{"date-parts":[["2016"]],"literal":"2016"},"keyword":"metadata linkedData quality","note":"","number":"1","number-of-pages":"30","page":"63--93","page-first":"63","publisher":"IOS Press","publisher-place":"","status":"","title":"Quality assessment for linked data: A survey","type":"article-journal","username":"diglezakis","version":"","volume":"7"},"82a59066ca07363abaae321592803ba4diglezakis":{"DOI":"10.1145/2566486.2568002","ISBN":"9781450327442","ISSN":"","URL":"https://doi.org/10.1145/2566486.2568002","abstract":"Linked Open Data (LOD) comprises an unprecedented volume of structured data on the Web. However, these datasets are of varying quality ranging from extensively curated datasets to crowdsourced or extracted data of often relatively low quality. We present a methodology for test-driven quality assessment of Linked Data, which is inspired by test-driven software development. We argue that vocabularies, ontologies and knowledge bases should be accompanied by a number of test cases, which help to ensure a basic level of quality. We present a methodology for assessing the quality of linked data resources, based on a formalization of bad smells and data quality problems. Our formalization employs SPARQL query templates, which are instantiated into concrete quality test case queries. Based on an extensive survey, we compile a comprehensive library of data quality test case patterns. We perform automatic test case instantiation based on schema constraints or semi-automatically enriched schemata and allow the user to generate specific test case instantiations that are applicable to a schema or dataset. We provide an extensive evaluation of five LOD datasets, manual test case instantiation for five schemas and automatic test case instantiations for all available schemata registered with Linked Open Vocabularies (LOV). One of the main advantages of our approach is that domain specific semantics can be encoded in the data quality test cases, thus being able to discover data quality problems beyond conventional quality heuristics.","annote":"","author":[{"family":"Kontokostas","given":"Dimitris"},{"family":"Westphal","given":"Patrick"},{"family":"Auer","given":"Sören"},{"family":"Hellmann","given":"Sebastian"},{"family":"Lehmann","given":"Jens"},{"family":"Cornelissen","given":"Roland"},{"family":"Zaveri","given":"Amrapali"}],"citation-label":"10.1145/2566486.2568002","collection-editor":[],"collection-title":"WWW '14","container-author":[],"container-title":"Proceedings of the 23rd International Conference on World Wide Web","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2014"]],"literal":"2014"},"event-place":"Seoul, Korea","id":"82a59066ca07363abaae321592803ba4diglezakis","interhash":"66a6d782062b615d9b4fa141ceb2473a","intrahash":"82a59066ca07363abaae321592803ba4","issue":"","issued":{"date-parts":[["2014"]],"literal":"2014"},"keyword":"metadata linkedData quality","misc":{"isbn":"9781450327442","numpages":"12","location":"Seoul, Korea","doi":"10.1145/2566486.2568002"},"note":"","number":"","number-of-pages":"11","page":"747–758","page-first":"747","publisher":"Association for Computing Machinery","publisher-place":"Seoul, Korea","status":"","title":"Test-driven evaluation of linked data quality","type":"paper-conference","username":"diglezakis","version":"","volume":""},"903a581ae186a899dea8daa2e1ccdadfdiglezakis":{"DOI":"10.1177/01655515211027775","ISBN":"","ISSN":"","URL":"/brokenurl#             https://doi.org/10.1177/01655515211027775","abstract":"Open Government Data (OGD) have the potential to support social and economic progress. However, this potential can be frustrated if these data remain unused. Although the literature suggests that OGD data sets’ metadata quality is one of the main factors affecting their use, to the best of our knowledge, no quantitative study provided evidence of this relationship. Considering about 400,000 data sets of 28 national, municipal and international OGD portals, we have programmatically analysed their usage, their metadata quality and the relationship between the two. Our analysis has highlighted three main findings. First, regardless of their size, the software platform adopted, and their administrative and territorial coverage, most OGD data sets are underutilised. Second, OGD portals pay varying attention to the quality of their data sets’ metadata. Third, we did not find clear evidence that data sets’ usage is positively correlated to better metadata publishing practices. Finally, we have considered other factors, such as data sets’ category, and some demographic characteristics of the OGD portals, and analysed their relationship with data sets’ usage, obtaining partially affirmative answers.","annote":"","author":[{"family":"Quarati","given":"Alfonso"}],"citation-label":"doi:10.1177/01655515211027775","collection-editor":[],"collection-title":"","container-author":[],"container-title":"Journal of Information Science","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2023"]],"literal":"2023"},"event-place":"","id":"903a581ae186a899dea8daa2e1ccdadfdiglezakis","interhash":"207dae856d6ae8d5c93fc013e5bfa77f","intrahash":"903a581ae186a899dea8daa2e1ccdadf","issue":"4","issued":{"date-parts":[["2023"]],"literal":"2023"},"keyword":"forschungsdaten metadata governmentalData usage openData quality","misc":{"eprint":"https://doi.org/10.1177/01655515211027775","doi":"10.1177/01655515211027775"},"note":"","number":"4","number-of-pages":"23","page":"887-910","page-first":"887","publisher":"","publisher-place":"","status":"","title":"Open Government Data: Usage trends and metadata quality","type":"article-journal","username":"diglezakis","version":"","volume":"49"},"8c5785f85de11a56c81a487b86b8efcddiglezakis":{"DOI":"10.1371/journal.pone.0246099","ISBN":"","ISSN":"","URL":"https://doi.org/10.1371/journal.pone.0246099","abstract":"The increasing amount of publicly available research data provides the opportunity to link and integrate data in order to create and prove novel hypotheses, to repeat experiments or to compare recent data to data collected at a different time or place. However, recent studies have shown that retrieving relevant data for data reuse is a time-consuming task in daily research practice. In this study, we explore what hampers dataset retrieval in biodiversity research, a field that produces a large amount of heterogeneous data. In particular, we focus on scholarly search interests and metadata, the primary source of data in a dataset retrieval system. We show that existing metadata currently poorly reflect information needs and therefore are the biggest obstacle in retrieving relevant data. Our findings indicate that for data seekers in the biodiversity domain environments, materials and chemicals, species, biological and chemical processes, locations, data parameters and data types are important information categories. These interests are well covered in metadata elements of domain-specific standards. However, instead of utilizing these standards, large data repositories tend to use metadata standards with domain-independent metadata fields that cover search interests only to some extent. A second problem are arbitrary keywords utilized in descriptive fields such as title, description or subject. Keywords support scholars in a full text search only if the provided terms syntactically match or their semantic relationship to terms used in a user query is known.","annote":"","author":[{"family":"Löffler","given":"Felicitas"},{"family":"Wesp","given":"Valentin"},{"family":"König-Ries","given":"Birgitta"},{"family":"Klan","given":"Friederike"}],"citation-label":"10.1371/journal.pone.0246099","collection-editor":[],"collection-title":"","container-author":[],"container-title":"PLOS ONE","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2021","03"]],"literal":"2021"},"event-place":"","id":"8c5785f85de11a56c81a487b86b8efcddiglezakis","interhash":"f94a187d839d2d70d3b0a6c7a58a635b","intrahash":"8c5785f85de11a56c81a487b86b8efcd","issue":"3","issued":{"date-parts":[["2021","03"]],"literal":"2021"},"keyword":"forschungsdaten biodiversity metadata retrievability quality","misc":{"doi":"10.1371/journal.pone.0246099"},"note":"","number":"3","number-of-pages":"35","page":"1-36","page-first":"1","publisher":"Public Library of Science","publisher-place":"","status":"","title":"Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?","type":"article-journal","username":"diglezakis","version":"","volume":"16"},"5d791fe76a865e8a09b6764d3df16ed7diglezakis":{"DOI":"10.1145/2964909","ISBN":"","ISSN":"1936-1955","URL":"https://doi.org/10.1145/2964909","abstract":"The Open Data movement has become a driver for publicly available data on the Web. More and more data—from governments and public institutions but also from the private sector—are made available online and are mainly published in so-called Open Data portals. However, with the increasing number of published resources, there is a number of concerns with regards to the quality of the data sources and the corresponding metadata, which compromise the searchability, discoverability, and usability of resources.In order to get a more complete picture of the severity of these issues, the present work aims at developing a generic metadata quality assessment framework for various Open Data portals: We treat data portals independently from the portal software frameworks by mapping the specific metadata of three widely used portal software frameworks (CKAN, Socrata, OpenDataSoft) to the standardized Data Catalog Vocabulary metadata schema. We subsequently define several quality metrics, which can be evaluated automatically and in an efficient manner. Finally, we report findings based on monitoring a set of over 260 Open Data portals with 1.1M datasets. This includes the discussion of general quality issues, for example, the retrievability of data, and the analysis of our specific quality metrics.","annote":"","author":[{"family":"Neumaier","given":"Sebastian"},{"family":"Umbrich","given":"Jürgen"},{"family":"Polleres","given":"Axel"}],"citation-label":"10.1145/2964909","collection-editor":[],"collection-title":"","container-author":[],"container-title":"J. Data and Information Quality","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2016","10"]],"literal":"2016"},"event-place":"New York, NY, USA","id":"5d791fe76a865e8a09b6764d3df16ed7diglezakis","interhash":"3dd0f746b1beab1577af8e91c1fd452a","intrahash":"5d791fe76a865e8a09b6764d3df16ed7","issue":"1","issued":{"date-parts":[["2016","10"]],"literal":"2016"},"keyword":"forschungsdaten opendata metadata metrics quality","misc":{"numpages":"29","articleno":"2","issn":"1936-1955","issue_date":"November 2016","doi":"10.1145/2964909"},"note":"","number":"1","page":"","page-first":"","publisher":"Association for Computing Machinery","publisher-place":"New York, NY, USA","status":"","title":"Automated Quality Assessment of Metadata across Open Data Portals","type":"article-journal","username":"diglezakis","version":"","volume":"8"},"c2a901734515cb590f0d089b4ffae8dddiglezakis":{"DOI":"10.1109/COMPSACW.2013.32","ISBN":"","ISSN":"","URL":"","abstract":"","annote":"","author":[{"family":"Reiche","given":"Konrad Johannes"},{"family":"Höfig","given":"Edzard"}],"citation-label":"6605795","collection-editor":[],"collection-title":"","container-author":[],"container-title":"2013 IEEE 37th Annual Computer Software and Applications Conference Workshops","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2013"]],"literal":"2013"},"event-place":"","id":"c2a901734515cb590f0d089b4ffae8dddiglezakis","interhash":"0a1a12ca451498b1c6c919498f05aab9","intrahash":"c2a901734515cb590f0d089b4ffae8dd","issue":"","issued":{"date-parts":[["2013"]],"literal":"2013"},"keyword":"forschungsdaten metadata metrics quality","misc":{"doi":"10.1109/COMPSACW.2013.32"},"note":"","number":"","number-of-pages":"5","page":"236-241","page-first":"236","publisher":"","publisher-place":"","status":"","title":"Implementation of Metadata Quality Metrics and Application on Public Government Data","type":"paper-conference","username":"diglezakis","version":"","volume":""},"857617e681c5056771403a3fb7787eb3diglezakis":{"DOI":"10.1177/0165551520961048","ISBN":"","ISSN":"","URL":"/brokenurl#             https://doi.org/10.1177/0165551520961048","abstract":"Open research data (ORD) have been considered a driver of scientific transparency. However, data friction, as the phenomenon of data underutilisation for several causes, has also been pointed out. A factor often called into question for ORD low usage is the quality of the ORD and associated metadata. This work aims to illustrate the use of ORD, published by the Figshare scientific repository, concerning their scientific discipline, their type and compared with the quality of their metadata. Considering all the Figshare resources and carrying out a programmatic quality assessment of their metadata, our analysis highlighted two aspects. First, irrespective of the scientific domain considered, most ORD are under-used, but with exceptional cases which concentrate most researchers’ attention. Second, there was no evidence that the use of ORD is associated with good metadata publishing practices. These two findings opened to a reflection about the potential causes of such data friction.","annote":"","author":[{"family":"Quarati","given":"Alfonso"},{"family":"Raffaghelli","given":"Juliana E"}],"citation-label":"doi:10.1177/0165551520961048","collection-editor":[],"collection-title":"","container-author":[],"container-title":"Journal of Information Science","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2022"]],"literal":"2022"},"event-place":"","id":"857617e681c5056771403a3fb7787eb3diglezakis","interhash":"c9c4ee5e7d7fdec159be06346c7c3332","intrahash":"857617e681c5056771403a3fb7787eb3","issue":"4","issued":{"date-parts":[["2022"]],"literal":"2022"},"keyword":"forschungsdaten metadata usage quality","misc":{"eprint":"https://doi.org/10.1177/0165551520961048","doi":"10.1177/0165551520961048"},"note":"","number":"4","number-of-pages":"25","page":"423-448","page-first":"423","publisher":"","publisher-place":"","status":"","title":"Do researchers use open research data? Exploring the relationships between usage trends and metadata quality across scientific disciplines from the Figshare case","type":"article-journal","username":"diglezakis","version":"","volume":"48"},"63bc735a5e4636f62c50ead44a3525ebdiglezakis":{"DOI":"10.1186/s12874-021-01252-7","ISBN":"","ISSN":"14712288","URL":"https://doi.org/10.1186/s12874-021-01252-7","abstract":"No standards exist for the handling and reporting of data quality in health research. This work introduces a data quality framework for observational health research data collections with supporting software implementations to facilitate harmonized data quality assessments.","annote":"","author":[{"family":"Schmidt","given":"Carsten Oliver"},{"family":"Struckmann","given":"Stephan"},{"family":"Enzenbach","given":"Cornelia"},{"family":"Reineke","given":"Achim"},{"family":"Stausberg","given":"Jürgen"},{"family":"Damerow","given":"Stefan"},{"family":"Huebner","given":"Marianne"},{"family":"Schmidt","given":"Börge"},{"family":"Sauerbrei","given":"Willi"},{"family":"Richter","given":"Adrian"}],"citation-label":"schmidt2021facilitating","collection-editor":[],"collection-title":"","container-author":[],"container-title":"BMC Medical Research Methodology","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2021"]],"literal":"2021"},"event-place":"","id":"63bc735a5e4636f62c50ead44a3525ebdiglezakis","interhash":"591e500ac426b3bbd6c03f77f5e6476f","intrahash":"63bc735a5e4636f62c50ead44a3525eb","issue":"1","issued":{"date-parts":[["2021"]],"literal":"2021"},"keyword":"framework forschungsdaten health quality","misc":{"issn":"14712288","refid":"Schmidt2021","doi":"10.1186/s12874-021-01252-7"},"note":"","number":"1","page":"63--","page-first":"63","publisher":"","publisher-place":"","status":"","title":"Facilitating harmonized data quality assessments. A data quality framework for observational health research data collections with software implementations in R","type":"article-journal","username":"diglezakis","version":"","volume":"21"},"040f19393ccedf72d92ebc691c1ac2f5diglezakis":{"DOI":"10.1109/ACCESS.2021.3073455","ISBN":"","ISSN":"","URL":"","abstract":"","annote":"","author":[{"family":"Nogueras-Iso","given":"Javier"},{"family":"Lacasta","given":"Javier"},{"family":"Ureña-Cámara","given":"Manuel Antonio"},{"family":"Ariza-López","given":"Francisco Javier"}],"citation-label":"9405650","collection-editor":[],"collection-title":"","container-author":[],"container-title":"IEEE Access","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2021"]],"literal":"2021"},"event-place":"","id":"040f19393ccedf72d92ebc691c1ac2f5diglezakis","interhash":"2b0220507b5e4051a4e609bdce0375b2","intrahash":"040f19393ccedf72d92ebc691c1ac2f5","issue":"","issued":{"date-parts":[["2021"]],"literal":"2021"},"keyword":"opendata metadata quality","misc":{"doi":"10.1109/ACCESS.2021.3073455"},"note":"","number":"","number-of-pages":"18","page":"60364-60382","page-first":"60364","publisher":"","publisher-place":"","status":"","title":"Quality of Metadata in Open Data Portals","type":"article-journal","username":"diglezakis","version":"","volume":"9"},"0eee8583fec47b76dad4bbe11e175f4bdiglezakis":{"DOI":"10.2218/ijdc.v15i1.698","ISBN":"","ISSN":"","URL":"https://doi.org/10.2218%2Fijdc.v15i1.698","abstract":"","annote":"","author":[{"family":"Cosmo","given":"Roberto Di"},{"family":"Gruenpeter","given":"Morane"},{"family":"Marmol","given":"Bruno"},{"family":"Monteil","given":"Alain"},{"family":"Romary","given":"Laurent"},{"family":"Sadowska","given":"Jozefina"}],"citation-label":"Di_Cosmo_2020","collection-editor":[],"collection-title":"","container-author":[],"container-title":"International Journal of Digital Curation","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2020","08"]],"literal":"2020"},"event-place":"","id":"0eee8583fec47b76dad4bbe11e175f4bdiglezakis","interhash":"01b34332646c28914f666f59cb082442","intrahash":"0eee8583fec47b76dad4bbe11e175f4b","issue":"1","issued":{"date-parts":[["2020","08"]],"literal":"2020"},"keyword":"forschungsdaten software curation quality repository","misc":{"doi":"10.2218/ijdc.v15i1.698"},"note":"","number":"1","page":"16","page-first":"16","publisher":"Edinburgh University Library","publisher-place":"","status":"","title":"Curated Archiving of Research Software Artifacts: Lessons Learned from the French Open Archive (HAL)","type":"article-journal","username":"diglezakis","version":"","volume":"15"},"0eee8583fec47b76dad4bbe11e175f4bresearchcode":{"DOI":"10.2218/ijdc.v15i1.698","ISBN":"","ISSN":"","URL":"https://doi.org/10.2218%2Fijdc.v15i1.698","abstract":"","annote":"","author":[{"family":"Cosmo","given":"Roberto Di"},{"family":"Gruenpeter","given":"Morane"},{"family":"Marmol","given":"Bruno"},{"family":"Monteil","given":"Alain"},{"family":"Romary","given":"Laurent"},{"family":"Sadowska","given":"Jozefina"}],"citation-label":"Di_Cosmo_2020","collection-editor":[],"collection-title":"","container-author":[],"container-title":"International Journal of Digital Curation","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2020","08"]],"literal":"2020"},"event-place":"","id":"0eee8583fec47b76dad4bbe11e175f4bresearchcode","interhash":"01b34332646c28914f666f59cb082442","intrahash":"0eee8583fec47b76dad4bbe11e175f4b","issue":"1","issued":{"date-parts":[["2020","08"]],"literal":"2020"},"keyword":"forschungsdaten software curation quality repository from:diglezakis","misc":{"doi":"10.2218/ijdc.v15i1.698"},"note":"","number":"1","page":"16","page-first":"16","publisher":"Edinburgh University Library","publisher-place":"","status":"","title":"Curated Archiving of Research Software Artifacts: Lessons Learned from the French Open Archive (HAL)","type":"article-journal","username":"researchcode","version":"","volume":"15"},"1e3eaeadaa563b6b95f6a4f63a71cce8diglezakis":{"DOI":"","ISBN":"","ISSN":"","URL":"http://arxiv.org/abs/2007.11298","abstract":"As scientific progress highly depends on the quality of research data, there\r\nare strict requirements for data quality coming from the scientific community.\r\nA major challenge in data quality assurance is to localise quality problems\r\nthat are inherent to data. Due to the dynamic digitalisation in specific\r\nscientific fields, especially the humanities, different database technologies\r\nand data formats may be used in rather short terms to gain experiences. We\r\npresent a model-driven approach to analyse the quality of research data. It\r\nallows abstracting from the underlying database technology. Based on the\r\nobservation that many quality problems show anti-patterns, a data engineer\r\nformulates analysis patterns that are generic concerning the database format\r\nand technology. A domain expert chooses a pattern that has been adapted to a\r\nspecific database technology and concretises it for a domain-specific database\r\nformat. The resulting concrete patterns are used by data analysts to locate\r\nquality problems in their databases. As proof of concept, we implemented tool\r\nsupport that realises this approach for XML databases. We evaluated our\r\napproach concerning expressiveness and performance in the domain of cultural\r\nheritage based on a qualitative study on quality problems occurring in cultural\r\nheritage data.","annote":"","author":[{"family":"Kesper","given":"Arno"},{"family":"Wenz","given":"Viola"},{"family":"Taentzer","given":"Gabriele"}],"citation-label":"kesper2020detecting","collection-editor":[],"collection-title":"","container-author":[],"container-title":"","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2020"]],"literal":"2020"},"event-place":"","id":"1e3eaeadaa563b6b95f6a4f63a71cce8diglezakis","interhash":"50fba9fa611865cb94f67d4ae6d6e025","intrahash":"1e3eaeadaa563b6b95f6a4f63a71cce8","issue":"","issued":{"date-parts":[["2020"]],"literal":"2020"},"keyword":"forschungsdaten metadata quality","note":"cite arxiv:2007.11298Comment: 28 pages. This paper is an extended version of a paper to be  published in \"ACM/IEEE 23rd International Conference on Model Driven  Engineering Languages and Systems (MODELS '20)\". Added subtitle","number":"","page":"","page-first":"","publisher":"","publisher-place":"","status":"","title":"Detecting Quality Problems in Research Data: A Model-Driven Approach","type":"article","username":"diglezakis","version":"","volume":""},"b399927cecc700a8d82e28f0b6304780diglezakis":{"DOI":"","ISBN":"","ISSN":"","URL":"","abstract":"Insights based on data are omnipresent. However, in particular in modern data analytics applications, information about the underlying data often remain obscure, hindering accountable data analytics.\r\nRecent efforts have been put into better describing such data based on\r\nmetadata, similarly to what has been done in various scientific disciplines for transparent and reproducible research. Based on a detailed\r\nstudy of various metadata standards and proposals, we observe that existing metadata models do not yet sufficiently cover information that\r\nis relevant for data accountability. To fill this gap, this paper proposes\r\nLiQuID, a novel metadata model to make datasets accountable throughout their life cycle. It is more general than existing metadata models,\r\nwhich can be mapped to LiQuID. We validate LiQuID for the purpose\r\nof dataset accountability based on a real-world workload we created.","annote":"","author":[{"family":"Oppold","given":"Sarah"},{"family":"Herschel","given":"Melanie"}],"citation-label":"oppold2020accountable","collection-editor":[],"collection-title":"","container-author":[],"container-title":"","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2020"]],"literal":"2020"},"event-place":"","id":"b399927cecc700a8d82e28f0b6304780diglezakis","interhash":"ed7235584594c29440adfe8a80e92cbd","intrahash":"b399927cecc700a8d82e28f0b6304780","issue":"","issued":{"date-parts":[["2020"]],"literal":"2020"},"keyword":"metadata rechtlicheFragestellungen ethischeFragestellungen quality","note":"","number":"","page":"","page-first":"","publisher":"","publisher-place":"","status":"","title":"Accountable Data Analytics Start with Accountable Data: The LiQuID Metadata Model","type":"article-journal","username":"diglezakis","version":"","volume":""},"55ebfe6b1d200c784a9bfa47bc58496ediglezakis":{"DOI":"","ISBN":"","ISSN":"","URL":"","abstract":"","annote":"","author":[{"family":"Bruce","given":"Thomas R."},{"family":"Hillmann","given":"Diane I."}],"citation-label":"bruce2004continuum","collection-editor":[],"collection-title":"","container-author":[],"container-title":"","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2004"]],"literal":"2004"},"event-place":"","id":"55ebfe6b1d200c784a9bfa47bc58496ediglezakis","interhash":"e38c1f535c1eb3b7bc55e3e821ef90a8","intrahash":"55ebfe6b1d200c784a9bfa47bc58496e","issue":"","issued":{"date-parts":[["2004"]],"literal":"2004"},"keyword":"forschungsdaten metadata quality","note":"","number":"","page":"","page-first":"","publisher":"ALA Editions","publisher-place":"","status":"","title":"The Continuum of Metadata Quality: Defning, Expression, Exploiting","type":"chapter","username":"diglezakis","version":"","volume":"Metadata in Practice"},"6cd3c4534dc753ab35a879553d07827bdiglezakis":{"DOI":"10.5334/dsj-2015-002","ISBN":"","ISSN":"","URL":"https://doi.org/10.5334%2Fdsj-2015-002","abstract":"","annote":"","author":[{"family":"Cai","given":"Li"},{"family":"Zhu","given":"Yangyong"}],"citation-label":"Cai_2015","collection-editor":[],"collection-title":"","container-author":[],"container-title":"Data Science Journal","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2015","05"]],"literal":"2015"},"event-place":"","id":"6cd3c4534dc753ab35a879553d07827bdiglezakis","interhash":"5fa860942425dc3b213b7f9106ecb024","intrahash":"6cd3c4534dc753ab35a879553d07827b","issue":"0","issued":{"date-parts":[["2015","05"]],"literal":"2015"},"keyword":"forschungsdaten metadata quality","misc":{"doi":"10.5334/dsj-2015-002"},"note":"","number":"0","page":"2","page-first":"2","publisher":"Ubiquity Press, Ltd.","publisher-place":"","status":"","title":"The Challenges of Data Quality and Data Quality Assessment in the Big Data Era","type":"article-journal","username":"diglezakis","version":"","volume":"14"},"81b16701efcf1910f1b9bf75a99b3724diglezakis":{"DOI":"","ISBN":"","ISSN":"","URL":"http://www.rfii.de/?p=4043","abstract":"","annote":"","author":[{"family":"für Informationsinfrastrukturen","given":"Rat"}],"citation-label":"furinformationsinfrastrukturen2019herausforderung","collection-editor":[{"family":"RfII","given":""}],"collection-title":"","container-author":[{"family":"RfII","given":""}],"container-title":"","documents":[],"edition":"Zweite","editor":[{"family":"RfII","given":""}],"event-date":{"date-parts":[["2019"]],"literal":"2019"},"event-place":"Göttingen","id":"81b16701efcf1910f1b9bf75a99b3724diglezakis","interhash":"64cbbd02e70e3c74899897265b2124cd","intrahash":"81b16701efcf1910f1b9bf75a99b3724","issue":"","issued":{"date-parts":[["2019"]],"literal":"2019"},"keyword":"forschungsdaten quality","misc":{"language":"de, en"},"note":"","number":"","page":"172 S.","page-first":"172","publisher":"","publisher-place":"Göttingen","status":"","title":"Herausforderung Datenqualität - Empfehlungen zur Zukunftsfähigkeit von Forschung im digitalen Wandel","type":"report","username":"diglezakis","version":"","volume":""},"2e41be7e64302ca5cc3684f1adbc4cf4diglezakis":{"DOI":"","ISBN":"","ISSN":"","URL":"http://dblp.uni-trier.de/db/journals/ijmso/ijmso13.html#BalatsoukasRG18","abstract":"Poor quality metadata can have negative impact not only on the way research datasets are retrieved, shared and used by scientists, but also on the way research data repositories are managed and audited. The aim of the research reported in this paper was to perform a descriptive analysis of the Dublin Core's Subject metadata element and identify its quality problems, if any, in the context of the Dryad research data repository following a novel data-preprocessing method using SQL queries. The findings showed quality problems related to the lack of controlled vocabulary and standardisation, like the inconsistent use of singular and plural forms, adjectives and synonyms. This study has both practical and methodological implications for the evaluation of metadata and the improvement of the quality of the research data annotation process in open research data repositories.","annote":"","author":[{"family":"Balatsoukas","given":"Panos"},{"family":"Rousidis","given":"Dimitris"},{"family":"Garoufallou","given":"Emmanouel"}],"citation-label":"journals/ijmso/BalatsoukasRG18","collection-editor":[],"collection-title":"","container-author":[],"container-title":"IJMSO","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2018"]],"literal":"2018"},"event-place":"","id":"2e41be7e64302ca5cc3684f1adbc4cf4diglezakis","interhash":"688c622b2dff1d3f47d3bb916a2d34cd","intrahash":"2e41be7e64302ca5cc3684f1adbc4cf4","issue":"1","issued":{"date-parts":[["2018"]],"literal":"2018"},"keyword":"forschungsdaten metadata quality","misc":{"ee":"https://doi.org/10.1504/IJMSO.2018.096444"},"note":"","number":"1","number-of-pages":"7","page":"1-8","page-first":"1","publisher":"","publisher-place":"","status":"","title":"A method for examining metadata quality in open research datasets using the OAI-PMH and SQL queries: the case of the Dublin Core 'Subject' element and suggestions for user-centred metadata annotation design.","type":"article-journal","username":"diglezakis","version":"","volume":"13"},"e369aee7f0127d6e42a6076a31fbf16adiglezakis":{"DOI":"","ISBN":"","ISSN":"","URL":"","abstract":"","annote":"","author":[{"family":"Neumaier","given":"Sebastian"},{"family":"Umbrich","given":"Jürgen"},{"family":"Polleres","given":"Axel"}],"citation-label":"neumaier2016automated","collection-editor":[],"collection-title":"","container-author":[],"container-title":"Journal of Data and Information Quality (JDIQ)","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2016"]],"literal":"2016"},"event-place":"","id":"e369aee7f0127d6e42a6076a31fbf16adiglezakis","interhash":"3dd0f746b1beab1577af8e91c1fd452a","intrahash":"e369aee7f0127d6e42a6076a31fbf16a","issue":"1","issued":{"date-parts":[["2016"]],"literal":"2016"},"keyword":"forschungsdaten metadata quality repository","note":"","number":"1","page":"2","page-first":"2","publisher":"ACM","publisher-place":"","status":"","title":"Automated quality assessment of metadata across open data portals","type":"article-journal","username":"diglezakis","version":"","volume":"8"},"798e1964f4113bd9a5954ce322a39a95diglezakis":{"DOI":"10.1080/01639370902737240","ISBN":"","ISSN":"","URL":"https://www.tandfonline.com/doi/full/10.1080/01639370902737240","abstract":"This study presents the current state of research and practice on metadata quality through focus on the functional perspective on metadata quality, measurement, and evaluation criteria coupled with mechanisms for improving metadata quality. Quality metadata reflect the degree to which the metadata in question perform the core bibliographic functions of discovery, use, provenance, currency, authentication, and administration. The functional perspective is closely tied to the criteria and measurements used for assessing metadata quality. Accuracy, completeness, and consistency are the most common criteria used in measuring metadata quality in the literature. Guidelines embedded within a Web form or template perform a valuable function in improving the quality of the metadata. Results of the study indicate a pressing need for the building of a common data model that is interoperable across digital repositories.","annote":"","author":[{"family":"Park","given":"Jung-Ran"}],"citation-label":"doi:10.1080/01639370902737240","collection-editor":[],"collection-title":"","container-author":[],"container-title":"Cataloging & Classification Quarterly","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2009"]],"literal":"2009"},"event-place":"","id":"798e1964f4113bd9a5954ce322a39a95diglezakis","interhash":"c2593fe92c622904bc8e16f77c93b50f","intrahash":"798e1964f4113bd9a5954ce322a39a95","issue":"3-4","issued":{"date-parts":[["2009"]],"literal":"2009"},"keyword":"imported forschungsdaten metadata survey quality repository","misc":{"eprint":"https://doi.org/10.1080/01639370902737240","doi":"10.1080/01639370902737240"},"note":"","number":"3-4","number-of-pages":"15","page":"213-228","page-first":"213","publisher":"Routledge","publisher-place":"","status":"","title":"Metadata Quality in Digital Repositories: A Survey of the Current State of the Art","type":"article-journal","username":"diglezakis","version":"","volume":"47"},"d1da899bf6683d519b0dbdd02dd336fddiglezakis":{"DOI":"10.1080/01639374.2017.1358786","ISBN":"","ISSN":"","URL":"https://www.tandfonline.com/doi/full/10.1080/01639374.2017.1358786","abstract":"ABSTRACTThis article documents the steps taken to assess metadata errors within the IDEALS repository. It describes the workflows established to create accurate and consistent metadata, focusing especially on the batch ingest and retroactive metadata remediation processes. It also seeks to address theoretical issues surrounding the concept of metadata quality.","annote":"","author":[{"family":"Stein","given":"Ayla"},{"family":"Applegate","given":"Kelly J."},{"family":"Robbins","given":"Seth"}],"citation-label":"doi:10.1080/01639374.2017.1358786","collection-editor":[],"collection-title":"","container-author":[],"container-title":"Cataloging & Classification Quarterly","documents":[],"edition":"","editor":[],"event-date":{"date-parts":[["2017"]],"literal":"2017"},"event-place":"","id":"d1da899bf6683d519b0dbdd02dd336fddiglezakis","interhash":"ac526d7704eb1769ac23b2a539394082","intrahash":"d1da899bf6683d519b0dbdd02dd336fd","issue":"7-8","issued":{"date-parts":[["2017"]],"literal":"2017"},"keyword":"forschungsdaten metadata quality repository","misc":{"eprint":"https://doi.org/10.1080/01639374.2017.1358786","doi":"10.1080/01639374.2017.1358786"},"note":"","number":"7-8","number-of-pages":"22","page":"644-666","page-first":"644","publisher":"Routledge","publisher-place":"","status":"","title":"Achieving and Maintaining Metadata Quality: Toward a Sustainable Workflow for the IDEALS Institutional Repository","type":"article-journal","username":"diglezakis","version":"","volume":"55"}}