1. Structuring data analysis projects in the Open Science era with Kerblam!
- Author
-
Visentin, Luca, Munaron, Luca, and Ruffinatti, Federico Alessandro
- Subjects
Computer Science - Human-Computer Interaction ,Quantitative Biology - Other Quantitative Biology - Abstract
Structuring data analysis projects, that is, defining the layout of files and folders needed to analyze data using existing tools and novel code, largely follows personal preferences. In this work, we look at the structure of several data analysis project templates and find little structural overlap. We highlight the parts that are similar between them, and propose guiding principles to keep in mind when one wishes to create a new data analysis project. Finally, we present Kerblam!, a project management tool that can expedite project data management, execution of workflow managers, and sharing of the resulting workflow and analysis outputs. We hope that, by following these principles and using Kerblam!, the landscape of data analysis projects can become more transparent, understandable, and ultimately useful to the wider community.
- Published
- 2024