1. Standardizing macromolecular structure files: further efforts are needed.
- Author
-
D'Arminio N, Giordano D, Scafuri B, Facchiano A, and Marabotti A
- Subjects
- Humans, SARS-CoV-2, Proteins chemistry, Molecular Structure, Databases, Protein, Protein Conformation, COVID-19
- Abstract
Investigating large datasets of biological information by automatic procedures may offer chances of progress in knowledge. Recently, tremendous improvements in structural biology have allowed the number of structures in the Protein Data Bank (PDB) archive to increase rapidly, in particular those for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)-associated proteins. However, their automatic analysis can be hampered by the nonuniform descriptors used by authors in some records of the PDB and PDBx/mmCIF files. In this opinion article we highlight the difficulties encountered in automating the analysis of hundreds of structures, suggesting that further standardization of the description of these molecular entities and of their attributes, generalized to the macromolecular structures contained in the PDB, might generate files more suitable for automatized analyses of a large number of structures., Competing Interests: Declaration of interests No interests are declared by the authors., (Copyright © 2023 Elsevier Ltd. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF