1. Solubility-Weighted Index: fast and accurate prediction of protein solubility.
- Author
-
Bhandari, Bikash K, Gardner, Paul P, and Lim, Chun Shen
- Subjects
- *
INTERNET servers , *FORECASTING , *RECOMBINANT proteins , *SOLUBILITY , *PROTEIN domains , *PROTEINS - Abstract
Motivation Recombinant protein production is a widely used technique in the biotechnology and biomedical industries, yet only a quarter of target proteins are soluble and can therefore be purified. Results We have discovered that global structural flexibility, which can be modeled by normalized B -factors, accurately predicts the solubility of 12 216 recombinant proteins expressed in Escherichia coli. We have optimized these B -factors, and derived a new set of values for solubility scoring that further improves prediction accuracy. We call this new predictor the 'Solubility-Weighted Index' (SWI). Importantly, SWI outperforms many existing protein solubility prediction tools. Furthermore, we have developed 'SoDoPE' (Soluble Domain for Protein Expression), a web interface that allows users to choose a protein region of interest for predicting and maximizing both protein expression and solubility. Availability and implementation The SoDoPE web server and source code are freely available at https://tisigner.com/sodope and https://github.com/Gardner-BinfLab/TISIGNER-ReactJS , respectively. The code and data for reproducing our analysis can be found at https://github.com/Gardner-BinfLab/SoDoPE%5fpaper%5f2020. Supplementary information Supplementary data are available at Bioinformatics online. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF