Back to Search Start Over

Developing a standardized but extendable framework to increase the findability of infectious disease datasets.

Authors :
Tsueng G
Cano MAA
Bento J
Czech C
Kang M
Pache L
Rasmussen LV
Savidge TC
Starren J
Wu Q
Xin J
Yeaman MR
Zhou X
Su AI
Wu C
Brown L
Shabman RS
Hughes LD
Source :
Scientific data [Sci Data] 2023 Feb 23; Vol. 10 (1), pp. 99. Date of Electronic Publication: 2023 Feb 23.
Publication Year :
2023

Abstract

Biomedical datasets are increasing in size, stored in many repositories, and face challenges in FAIRness (findability, accessibility, interoperability, reusability). As a Consortium of infectious disease researchers from 15 Centers, we aim to adopt open science practices to promote transparency, encourage reproducibility, and accelerate research advances through data reuse. To improve FAIRness of our datasets and computational tools, we evaluated metadata standards across established biomedical data repositories. The vast majority do not adhere to a single standard, such as Schema.org, which is widely-adopted by generalist repositories. Consequently, datasets in these repositories are not findable in aggregation projects like Google Dataset Search. We alleviated this gap by creating a reusable metadata schema based on Schema.org and catalogued nearly 400 datasets and computational tools we collected. The approach is easily reusable to create schemas interoperable with community standards, but customized to a particular context. Our approach enabled data discovery, increased the reusability of datasets from a large research consortium, and accelerated research. Lastly, we discuss ongoing challenges with FAIRness beyond discoverability.<br /> (© 2023. The Author(s).)

Details

Language :
English
ISSN :
2052-4463
Volume :
10
Issue :
1
Database :
MEDLINE
Journal :
Scientific data
Publication Type :
Academic Journal
Accession number :
36823157
Full Text :
https://doi.org/10.1038/s41597-023-01968-9