Back to Search Start Over

Probabilistic identification of saccharide moieties in biomolecules and their protein complexes.

Authors :
Dashti H
Westler WM
Wedell JR
Demler OV
Eghbalnia HR
Markley JL
Mora S
Source :
Scientific data [Sci Data] 2020 Jul 03; Vol. 7 (1), pp. 210. Date of Electronic Publication: 2020 Jul 03.
Publication Year :
2020

Abstract

The chemical composition of saccharide complexes underlies their biomedical activities as biomarkers for cardiometabolic disease, various types of cancer, and other conditions. However, because these molecules may undergo major structural modifications, distinguishing between compounds of saccharide and non-saccharide origin becomes a challenging computational problem that hinders the aggregation of information about their bioactive moieties. We have developed an algorithm and software package called "Cheminformatics Tool for Probabilistic Identification of Carbohydrates" (CTPIC) that analyzes the covalent structure of a compound to yield a probabilistic measure for distinguishing saccharides and saccharide-derivatives from non-saccharides. CTPIC analysis of the RCSB Ligand Expo (database of small molecules found to bind proteins in the Protein Data Bank) led to a substantial increase in the number of ligands characterized as saccharides. CTPIC analysis of Protein Data Bank identified 7.7% of the proteins as saccharide-binding. CTPIC is freely available as a webservice at (http://ctpic.nmrfam.wisc.edu).

Details

Language :
English
ISSN :
2052-4463
Volume :
7
Issue :
1
Database :
MEDLINE
Journal :
Scientific data
Publication Type :
Academic Journal
Accession number :
32620933
Full Text :
https://doi.org/10.1038/s41597-020-0547-y