1. Solubility-Weighted Index: fast and accurate prediction of protein solubility
- Author
-
Paul P. Gardner, Chun Shen Lim, and Bikash K. Bhandari
- Subjects
Statistics and Probability ,Web server ,Source code ,AcademicSubjects/SCI01060 ,Computer science ,media_common.quotation_subject ,computer.software_genre ,Biochemistry ,Protein expression ,law.invention ,Set (abstract data type) ,03 medical and health sciences ,0302 clinical medicine ,law ,Code (cryptography) ,Escherichia coli ,Solubility ,Molecular Biology ,030304 developmental biology ,media_common ,Mathematics ,0303 health sciences ,Computers ,A protein ,Proteins ,Original Papers ,Computer Science Applications ,Computational Mathematics ,Computational Theory and Mathematics ,Recombinant protein production ,Recombinant DNA ,Stock price index ,Data mining ,Protein solubility ,Biological system ,computer ,Sequence Analysis ,030217 neurology & neurosurgery ,Software - Abstract
MotivationRecombinant protein production is a widely used technique in the biotechnology and biomedical industries, yet only a quarter of target proteins are soluble and can therefore be purified.ResultsWe have discovered that global structural flexibility, which can be modeled by normalised B-factors, accurately predicts the solubility of 12,216 recombinant proteins expressed in Escherichia coli. We have optimised B-factors, and derived a new set of values for solubility scoring that further improves prediction accuracy. We call this new predictor the ‘Solubility-Weighted Index’ (SWI). Importantly, SWI outperforms many existing protein solubility prediction tools. Furthermore, we have developed ‘SoDoPE’ (Soluble Domain for Protein Expression), a web interface that allows users to choose a protein region of interest for predicting and maximising both protein expression and solubility.AvailabilityThe SoDoPE web server and source code are freely available at https://tisigner.com/sodope and https://github.com/Gardner-BinfLab/TISIGNER-ReactJS, respectively. The code and data for reproducing our analysis can be found at https://github.com/Gardner-BinfLab/SoDoPE_paper2020.
- Published
- 2020