Back to Search
Start Over
Identification of amino acids with sensitive nanoporous MoS2: towards machine learning-based prediction
- Source :
- npj 2D Materials and Applications, Vol 2, Iss 1, Pp 1-9 (2018)
- Publication Year :
- 2018
- Publisher :
- Springer Science and Business Media LLC, 2018.
-
Abstract
- Protein detection plays a key role in determining the single point mutations which can cause a variety of diseases. Nanopore sequencing provides a label-free, single base, fast and long reading platform, which makes it amenable for personalized medicine. A challenge facing nanopore technology is the noise in ionic current. Here, we show that a nanoporous single-layer molybdenum disulfide (MoS2) can detect individual amino acids in a polypeptide chain (16 units) with a high accuracy and distinguishability. Using extensive molecular dynamics simulations (with a total aggregate simulation time of 66 µs) and machine learning techniques, we featurize and cluster the ionic current and residence time of the 20 amino acids and identify the fingerprints of the signals. Using logistic regression, nearest neighbor, and random forest classifiers, the sensor reading is predicted with an accuracy of 72.45, 94.55, and 99.6%, respectively. In addition, using advanced ML classification techniques, we are able to theoretically predict over 2.8 million hypothetical sensor readings’ amino acid types. Molecular dynamics simulations combined with machine learning techniques enable the prediction of MoS2 nanopore sequencing capabilities. A team led by N. R. Aluru at the University of Illinois at Urbana-Champaign used logistic regression, nearest neighbor, and random forest classifiers to develop a machine learning-based platform capable of predicting the sensing capabilities of nanoporous, atomically thin MoS2. The material was shown to be able to identify individual amino acids in polypeptide chains with high accuracy and distinguishability. Twenty amino acids could be detected and categorized in different classes based on current-residence time training data, with an accuracy of up to 99.6%. These results show promise for the development of amino acid detection platforms with atomically thin materials assisted by machine learning.
- Subjects :
- 0301 basic medicine
Computer science
02 engineering and technology
Machine learning
computer.software_genre
k-nearest neighbors algorithm
lcsh:Chemistry
03 medical and health sciences
lcsh:TA401-492
General Materials Science
chemistry.chemical_classification
Nanoporous
business.industry
Mechanical Engineering
General Chemistry
021001 nanoscience & nanotechnology
Condensed Matter Physics
Random forest
Amino acid
Nanopore
Identification (information)
030104 developmental biology
lcsh:QD1-999
chemistry
Mechanics of Materials
lcsh:Materials of engineering and construction. Mechanics of materials
Artificial intelligence
Nanopore sequencing
Noise (video)
0210 nano-technology
business
computer
Subjects
Details
- ISSN :
- 23977132
- Volume :
- 2
- Database :
- OpenAIRE
- Journal :
- npj 2D Materials and Applications
- Accession number :
- edsair.doi.dedup.....e15e9b0ee1f7fa5721b2477bb3acf513
- Full Text :
- https://doi.org/10.1038/s41699-018-0060-8