Back to Search Start Over

SciDataFlow: a tool for improving the flow of data through science.

Authors :
Buffalo, Vince
Source :
Bioinformatics. Jan2024, Vol. 40 Issue 1, p1-3. 3p.
Publication Year :
2024

Abstract

Motivation Managing data and code in open scientific research is complicated by two key problems: large datasets often cannot be stored alongside code in repository platforms like GitHub, and iterative analysis can lead to unnoticed changes to data, increasing the risk that analyses are based on older versions of data. Results SciDataFlow is a fast, concurrent command-line tool paired with a simple Data Manifest specification that streamlines tracking data changes, uploading data to remote repositories, and pulling in all data necessary to reproduce a computational analysis. Availability and implementation SciDataFlow is available at https://github.com/vsbuffalo/scidataflow. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13674803
Volume :
40
Issue :
1
Database :
Academic Search Index
Journal :
Bioinformatics
Publication Type :
Academic Journal
Accession number :
175158021
Full Text :
https://doi.org/10.1093/bioinformatics/btad754