1. BioConvert: a comprehensive format converter for life sciences
- Author
-
Hugo Caro, Sulyvan Dollin, Anne Biton, Bryan Brancotte, Dimitri Desvillechabrol, Yoann Dufresne, Blaise Li, Etienne Kornobis, Frédéric Lemoine, Nicolas Maillet, Amandine Perrin, Nicolas Traut, Bertrand Néron, Thomas Cokelaer, Biomics (plateforme technologique), Institut Pasteur [Paris] (IP)-Université Paris Cité (UPCité), Hub Bioinformatique et Biostatistique - Bioinformatics and Biostatistics HUB, Algorithmes pour les séquences biologiques - Sequence Bioinformatics, Génomique évolutive des virus à ARN - Evolutionary genomics of RNA viruses, Génomique évolutive des Microbes / Microbial Evolutionary Genomics, Institut Pasteur [Paris] (IP)-Centre National de la Recherche Scientifique (CNRS)-Université Paris Cité (UPCité), Neuroanatomie Appliquée et Théorique / Applied and Theoretical Neuroanatomy (NAAT), France Génomique Consortium (ANR 10-INBS-09-08) and IBISA and the Biomics Platform of Institut Pasteur, Paris, France, and ANR-10-INBS-0009,France-Génomique,Organisation et montée en puissance d'une Infrastructure Nationale de Génomique(2010)
- Subjects
[INFO.INFO-DS]Computer Science [cs]/Data Structures and Algorithms [cs.DS] ,[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM] - Abstract
Bioinformatics is a field known for the numerous standards and formats that have been developed over the years. This plethora of formats, sometimes complementary, and often redundant, poses many challenges to bioinformatics data analysts. They constantly need to find the best tool to convert their data into the suitable format, which is often a complex, technical and time consuming task. Moreover, these small yet important tasks are often difficult to make reproducible. To over-come these difficulties, we initiatedBioConvert, a collaborative project to facilitate the conversion of life science data from one format to another.BioConvertaggregates existing software within a single framework and complemented them with original code when needed. It provides a common interface to make the user experience more streamlined instead of having to learn tens of them. Currently,BioConvertsupports about 50 formats and 100 direct conversions in areas such as alignment, sequencing, phylogeny, and variant calling. In addition to being useful for end-users,BioConvertcan also be utilized by developers as a universal benchmarking framework for evaluating and comparing numerous conversion tools. Additionally, we provide a web server implementing an online user-friendly interface toBioConvert, hence allowing direct use for the community.
- Published
- 2023
- Full Text
- View/download PDF