17 results on '"A. Khudyakov"'
Search Results
2. Primary case inference in viral outbreaks through analysis of intra-host variant population
- Author
-
J. Walker Gussler, David S. Campo, Zoya Dimitrova, Pavel Skums, and Yury Khudyakov
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Investigation of outbreaks to identify the primary case is crucial for the interruption and prevention of transmission of infectious diseases. These individuals may have a higher risk of participating in near future transmission events when compared to the other patients in the outbreak, so directing more transmission prevention resources towards these individuals is a priority. Although the genetic characterization of intra-host viral populations can aid the identification of transmission clusters, it is not trivial to determine the directionality of transmissions during outbreaks, owing to complexity of viral evolution. Here, we present a new computational framework, PYCIVO: primary case inference in viral outbreaks. This framework expands upon our earlier work in development of QUENTIN, which builds a probabilistic disease transmission tree based on simulation of evolution of intra-host hepatitis C virus (HCV) variants between cases involved in direct transmission during an outbreak. PYCIVO improves upon QUENTIN by also adding a custom heterogeneity index and identifying the scenario when the primary case may have not been sampled. Results These approaches were validated using a set of 105 sequence samples from 11 distinct HCV transmission clusters identified during outbreak investigations, in which the primary case was epidemiologically verified. Both models can detect the correct primary case in 9 out of 11 transmission clusters (81.8%). However, while QUENTIN issues erroneous predictions on the remaining 2 transmission clusters, PYCIVO issues a null output for these clusters, giving it an effective prediction accuracy of 100%. To further evaluate accuracy of the inference, we created 10 modified transmission clusters in which the primary case had been removed. In this scenario, PYCIVO was able to correctly identify that there was no primary case in 8/10 (80%) of these modified clusters. This model was validated with HCV; however, this approach may be applicable to other microbial pathogens. Conclusions PYCIVO improves upon QUENTIN by also implementing a custom heterogeneity index which empowers PYCIVO to make the important ‘No primary case’ prediction. One or more samples, possibly including the primary case, may have not been sampled, and this designation is meant to account for these scenarios.
- Published
- 2022
- Full Text
- View/download PDF
3. Convex hulls in hamming space enable efficient search for similarity and clustering of genomic sequences
- Author
-
Campo, David S. and Khudyakov, Yury
- Published
- 2020
- Full Text
- View/download PDF
4. Fast estimation of genetic relatedness between members of heterogeneous populations of closely related genomic variants
- Author
-
Viachaslau Tsyvina, David S. Campo, Seth Sims, Alex Zelikovsky, Yury Khudyakov, and Pavel Skums
- Subjects
Similarity search ,Similarity join ,K-mer ,Filtering ,Edit distance ,Hamming distance ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Many biological analysis tasks require extraction of families of genetically similar sequences from large datasets produced by Next-generation Sequencing (NGS). Such tasks include detection of viral transmissions by analysis of all genetically close pairs of sequences from viral datasets sampled from infected individuals or studying of evolution of viruses or immune repertoires by analysis of network of intra-host viral variants or antibody clonotypes formed by genetically close sequences. The most obvious naïeve algorithms to extract such sequence families are impractical in light of the massive size of modern NGS datasets. Results In this paper, we present fast and scalable k-mer-based framework to perform such sequence similarity queries efficiently, which specifically targets data produced by deep sequencing of heterogeneous populations such as viruses. It shows better filtering quality and time performance when comparing to other tools. The tool is freely available for download at https://github.com/vyacheslav-tsivina/signature-sj Conclusion The proposed tool allows for efficient detection of genetic relatedness between genomic samples produced by deep sequencing of heterogeneous populations. It should be especially useful for analysis of relatedness of genomes of viruses with unevenly distributed variable genomic regions, such as HIV and HCV. For the future we envision, that besides applications in molecular epidemiology the tool can also be adapted to immunosequencing and metagenomics data.
- Published
- 2018
- Full Text
- View/download PDF
5. Automated quality control for a molecular surveillance system
- Author
-
Seth Sims, Atkinson G. Longmire, David S. Campo, Sumathi Ramachandran, Magdalena Medrzycki, Lilia Ganova-Raeva, Yulin Lin, Amanda Sue, Hong Thai, Alexander Zelikovsky, and Yury Khudyakov
- Subjects
HVR1 ,HCV ,Transmission ,Outbreak detection ,Molecular surveillance ,Quality control ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Molecular surveillance and outbreak investigation are important for elimination of hepatitis C virus (HCV) infection in the United States. A web-based system, Global Hepatitis Outbreak and Surveillance Technology (GHOST), has been developed using Illumina MiSeq-based amplicon sequence data derived from the HCV E1/E2-junction genomic region to enable public health institutions to conduct cost-effective and accurate molecular surveillance, outbreak detection and strain characterization. However, as there are many factors that could impact input data quality to which the GHOST system is not completely immune, accuracy of epidemiological inferences generated by GHOST may be affected. Here, we analyze the data submitted to the GHOST system during its pilot phase to assess the nature of the data and to identify common quality concerns that can be detected and corrected automatically. Results The GHOST quality control filters were individually examined, and quality failure rates were measured for all samples, including negative controls. New filters were developed and introduced to detect primer dimers, loss of specimen-specific product, or short products. The genotyping tool was adjusted to improve the accuracy of subtype calls. The identification of “chordless” cycles in a transmission network from data generated with known laboratory-based quality concerns allowed for further improvement of transmission detection by GHOST in surveillance settings. Parameters derived to detect actionable common quality control anomalies were incorporated into the automatic quality control module that rejects data depending on the magnitude of a quality problem, and warns and guides users in performing correctional actions. The guiding responses generated by the system are tailored to the GHOST laboratory protocol. Conclusions Several new quality control problems were identified in MiSeq data submitted to GHOST and used to improve protection of the system from erroneous data and users from erroneous inferences. The GHOST system was upgraded to include identification of causes of erroneous data and recommendation of corrective actions to laboratory users.
- Published
- 2018
- Full Text
- View/download PDF
6. Convex hulls in hamming space enable efficient search for similarity and clustering of genomic sequences
- Author
-
David S. Campo and Yury Khudyakov
- Subjects
Convex hull ,Computer science ,Population ,Hepacivirus ,lcsh:Computer applications to medicine. Medical informatics ,Biochemistry ,Upper and lower bounds ,Clustering ,03 medical and health sciences ,0302 clinical medicine ,Structural Biology ,Cluster Analysis ,Humans ,Centrality ,education ,Hamming space ,Cluster analysis ,Molecular Biology ,Categorical variable ,lcsh:QH301-705.5 ,030304 developmental biology ,0303 health sciences ,education.field_of_study ,business.industry ,Applied Mathematics ,Research ,Pattern recognition ,Hamming distance ,Genomics ,Hamming ,Computer Science Applications ,lcsh:Biology (General) ,Population distance ,lcsh:R858-859.7 ,030211 gastroenterology & hepatology ,Artificial intelligence ,business ,Hamming code ,Algorithms - Abstract
Background In molecular epidemiology, comparison of intra-host viral variants among infected persons is frequently used for tracing transmissions in human population and detecting viral infection outbreaks. Application of Ultra-Deep Sequencing (UDS) immensely increases the sensitivity of transmission detection but brings considerable computational challenges when comparing all pairs of sequences. We developed a new population comparison method based on convex hulls in hamming space. We applied this method to a large set of UDS samples obtained from unrelated cases infected with hepatitis C virus (HCV) and compared its performance with three previously published methods. Results The convex hull in hamming space is a data structure that provides information on: (1) average hamming distance within the set, (2) average hamming distance between two sets; (3) closeness centrality of each sequence; and (4) lower and upper bound of all the pairwise distances among the members of two sets. This filtering strategy rapidly and correctly removes 96.2% of all pairwise HCV sample comparisons, outperforming all previous methods. The convex hull distance (CHD) algorithm showed variable performance depending on sequence heterogeneity of the studied populations in real and simulated datasets, suggesting the possibility of using clustering methods to improve the performance. To address this issue, we developed a new clustering algorithm, k-hulls, that reduces heterogeneity of the convex hull. This efficient algorithm is an extension of the k-means algorithm and can be used with any type of categorical data. It is 6.8-times more accurate than k-mode, a previously developed clustering algorithm for categorical data. Conclusions CHD is a fast and efficient filtering strategy for massively reducing the computational burden of pairwise comparison among large samples of sequences, and thus, aiding the calculation of transmission links among infected individuals using threshold-based methods. In addition, the convex hull efficiently obtains important summary metrics for intra-host viral populations.
- Published
- 2020
7. Efficient error correction for next-generation sequencing of viral amplicons
- Author
-
Skums Pavel, Dimitrova Zoya, Campo David S, Vaughan Gilberto, Rossi Livia, Forbi Joseph C, Yokosawa Jonny, Zelikovsky Alex, and Khudyakov Yury
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. Results In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Conclusions Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses. The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm
- Published
- 2012
- Full Text
- View/download PDF
8. Automated quality control for a molecular surveillance system
- Author
-
Alexander Zelikovsky, Magdalena Medrzycki, Yulin Lin, Sumathi Ramachandran, Yury Khudyakov, Amanda Sue, Lilia Ganova-Raeva, Hong Thai, Seth Sims, Atkinson G. Longmire, and David S. Campo
- Subjects
0301 basic medicine ,HVR1 ,Genotyping Techniques ,Computer science ,Hepacivirus ,medicine.disease_cause ,computer.software_genre ,Biochemistry ,Disease Outbreaks ,Automation ,0302 clinical medicine ,Structural Biology ,Epidemiology ,030212 general & internal medicine ,lcsh:QH301-705.5 ,media_common ,Applied Mathematics ,Reference Standards ,Hepatitis C ,Computer Science Applications ,Identification (information) ,Molecular surveillance ,Population Surveillance ,HCV ,lcsh:R858-859.7 ,Data mining ,Quality Control ,medicine.medical_specialty ,media_common.quotation_subject ,Hepatitis C virus ,Control (management) ,lcsh:Computer applications to medicine. Medical informatics ,03 medical and health sciences ,Outbreak detection ,medicine ,Humans ,Transmission ,Quality (business) ,Molecular Biology ,Hepatitis ,Protocol (science) ,Public health ,Methodology ,Outbreak ,medicine.disease ,United States ,030104 developmental biology ,lcsh:Biology (General) ,Transmission network ,Data quality ,computer - Abstract
Background Molecular surveillance and outbreak investigation are important for elimination of hepatitis C virus (HCV) infection in the United States. A web-based system, Global Hepatitis Outbreak and Surveillance Technology (GHOST), has been developed using Illumina MiSeq-based amplicon sequence data derived from the HCV E1/E2-junction genomic region to enable public health institutions to conduct cost-effective and accurate molecular surveillance, outbreak detection and strain characterization. However, as there are many factors that could impact input data quality to which the GHOST system is not completely immune, accuracy of epidemiological inferences generated by GHOST may be affected. Here, we analyze the data submitted to the GHOST system during its pilot phase to assess the nature of the data and to identify common quality concerns that can be detected and corrected automatically. Results The GHOST quality control filters were individually examined, and quality failure rates were measured for all samples, including negative controls. New filters were developed and introduced to detect primer dimers, loss of specimen-specific product, or short products. The genotyping tool was adjusted to improve the accuracy of subtype calls. The identification of “chordless” cycles in a transmission network from data generated with known laboratory-based quality concerns allowed for further improvement of transmission detection by GHOST in surveillance settings. Parameters derived to detect actionable common quality control anomalies were incorporated into the automatic quality control module that rejects data depending on the magnitude of a quality problem, and warns and guides users in performing correctional actions. The guiding responses generated by the system are tailored to the GHOST laboratory protocol. Conclusions Several new quality control problems were identified in MiSeq data submitted to GHOST and used to improve protection of the system from erroneous data and users from erroneous inferences. The GHOST system was upgraded to include identification of causes of erroneous data and recommendation of corrective actions to laboratory users.
- Published
- 2018
9. Fast estimation of genetic relatedness between members of heterogeneous populations of closely related genomic variants
- Author
-
Tsyvina, Viachaslau, primary, Campo, David S., additional, Sims, Seth, additional, Zelikovsky, Alex, additional, Khudyakov, Yury, additional, and Skums, Pavel, additional
- Published
- 2018
- Full Text
- View/download PDF
10. Automated quality control for a molecular surveillance system
- Author
-
Sims, Seth, primary, Longmire, Atkinson G., additional, Campo, David S., additional, Ramachandran, Sumathi, additional, Medrzycki, Magdalena, additional, Ganova-Raeva, Lilia, additional, Lin, Yulin, additional, Sue, Amanda, additional, Thai, Hong, additional, Zelikovsky, Alexander, additional, and Khudyakov, Yury, additional
- Published
- 2018
- Full Text
- View/download PDF
11. Computational models of liver fibrosis progression for hepatitis C virus chronic infection
- Author
-
Marina Berenguer, Fernando González-Candelas, F. Xavier López-Labrador, Yury Khudyakov, and James Lara
- Subjects
Adult ,Genetic Markers ,Liver Cirrhosis ,Male ,medicine.medical_treatment ,Hepacivirus ,Hepatitis C virus ,Liver transplantation ,medicine.disease_cause ,Biochemistry ,Liver disease ,chemistry.chemical_compound ,Viral Proteins ,Structural Biology ,Artificial Intelligence ,medicine ,Humans ,Computer Simulation ,NS5B ,Molecular Biology ,Aged ,biology ,Base Sequence ,Genetic heterogeneity ,Applied Mathematics ,Research ,fungi ,virus diseases ,Computational Biology ,Bayes Theorem ,Middle Aged ,medicine.disease ,biology.organism_classification ,Virology ,digestive system diseases ,Computer Science Applications ,Liver Transplantation ,chemistry ,Genetic marker ,Hepatocellular carcinoma ,Immunology ,Disease Progression ,Female - Abstract
Background Chronic infection with hepatitis C virus (HCV) is a risk factor for liver diseases such as fibrosis, cirrhosis and hepatocellular carcinoma. HCV genetic heterogeneity was hypothesized to be associated with severity of liver disease. However, no reliable viral markers predicting disease severity have been identified. Here, we report the utility of sequences from 3 HCV 1b genomic regions, Core, NS3 and NS5b, to identify viral genetic markers associated with fast and slow rate of fibrosis progression (RFP) among patients with and without liver transplantation (n = 42). Methods A correlation-based feature selection (CFS) method was used to detect and identify RFP-relevant viral markers. Machine-learning techniques, linear projection (LP) and Bayesian Networks (BN), were used to assess and identify associations between the HCV sequences and RFP. Results Both clustering of HCV sequences in LP graphs using physicochemical properties of nucleotides and BN analysis using polymorphic sites showed similarities among HCV variants sampled from patients with a similar RFP, while distinct HCV genetic properties were found associated with fast or slow RFP. Several RFP-relevant HCV sites were identified. Computational models parameterized using the identified sites accurately associated HCV strains with RFP in 70/30 split cross-validation (90-95% accuracy) and in validation tests (85-90% accuracy). Validation tests of the models constructed for patients with or without liver transplantation suggest that the RFP-relevant genetic markers identified in the HCV Core, NS3 and NS5b genomic regions may be useful for the prediction of RFP regardless of transplant status of patients. Conclusions The apparent strong genetic association to RFP suggests that HCV genetic heterogeneity has a quantifiable effect on severity of liver disease, thus presenting opportunity for developing genetic assays for measuring virulence of HCV strains in clinical and public health settings.
- Published
- 2014
12. Modeling the functional state of the reverse transcriptase of hepatitis B virus and its application to probing drug-protein interaction
- Author
-
Xu, Xiaojun, primary, Thai, Hong, additional, Kitrinos, Kathryn M., additional, Xia, Guoliang, additional, Gaggar, Anuj, additional, Paulson, Matthew, additional, Ganova-Raeva, Lilia, additional, Khudyakov, Yury, additional, and Lara, James, additional
- Published
- 2016
- Full Text
- View/download PDF
13. Reconstruction of viral population structure from next-generation sequencing data using multicommodity flows
- Author
-
Pavel Skums, Bassam Tork, Yuri Khudyakov, Nicholas Mancuso, Ion I. Mandoiu, Alexander Artyomenko, and Alexander Zelikovsky
- Subjects
Population structure ,Population ,Genome, Viral ,Hepacivirus ,Viral quasispecies ,Computational biology ,Biology ,Biochemistry ,DNA sequencing ,03 medical and health sciences ,Structural Biology ,RNA Viruses ,education ,Molecular Biology ,030304 developmental biology ,Genetics ,0303 health sciences ,education.field_of_study ,Sequence Analysis, RNA ,Methodology Article ,Applied Mathematics ,030302 biochemistry & molecular biology ,Immune escape ,Genetic Variation ,Amplicon ,3. Good health ,Computer Science Applications ,Viral evolution ,DNA microarray ,Algorithms - Abstract
Highly mutable RNA viruses exist in infected hosts as heterogeneous populations of genetically close variants known as quasispecies. Next-generation sequencing (NGS) allows for analysing a large number of viral sequences from infected patients, presenting a novel opportunity for studying the structure of a viral population and understanding virus evolution, drug resistance and immune escape. Accurate reconstruction of genetic composition of intra-host viral populations involves assembling the NGS short reads into whole-genome sequences and estimating frequencies of individual viral variants. Although a few approaches were developed for this task, accurate reconstruction of quasispecies populations remains greatly unresolved. Two new methods, AmpMCF and ShotMCF, for reconstruction of the whole-genome intra-host viral variants and estimation of their frequencies were developed, based on Multicommodity Flows (MCFs). AmpMCF was designed for NGS reads obtained from individual PCR amplicons and ShotMCF for NGS shotgun reads. While AmpMCF, based on covering formulation, identifies a minimal set of quasispecies explaining all observed reads, ShotMCS, based on packing formulation, engages the maximal number of reads to generate the most probable set of quasispecies. Both methods were evaluated on simulated data in comparison to Maximum Bandwidth and ViSpA, previously developed state-of-the-art algorithms for estimating quasispecies spectra from the NGS amplicon and shotgun reads, respectively. Both algorithms were accurate in estimation of quasispecies frequencies, especially from large datasets. The problem of viral population reconstruction from amplicon or shotgun NGS reads was solved using the MCF formulation. The two methods, ShotMCF and AmpMCF, developed here afford accurate reconstruction of the structure of intra-host viral population from NGS reads. The implementations of the algorithms are available at http://alan.cs.gsu.edu/vira.html (AmpMCF) and http://alan.cs.gsu.edu/NGS/?q=content/shotmcf (ShotMCF).
- Published
- 2013
14. Computational models of liver fibrosis progression for hepatitis C virus chronic infection
- Author
-
Lara, James, primary, López-Labrador, F Xavier, additional, González-Candelas, Fernando, additional, Berenguer, Marina, additional, and Khudyakov, Yury E, additional
- Published
- 2014
- Full Text
- View/download PDF
15. Reconstruction of viral population structure from next-generation sequencing data using multicommodity flows
- Author
-
Skums, Pavel, primary, Mancuso, Nicholas, additional, Artyomenko, Alexander, additional, Tork, Bassam, additional, Mandoiu, Ion, additional, Khudyakov, Yury, additional, and Zelikovsky, Alex, additional
- Published
- 2013
- Full Text
- View/download PDF
16. Modeling the functional state of the reverse transcriptase of hepatitis B virus and its application to probing drug-protein interaction.
- Author
-
Xiaojun Xu, Hong Thai, Kitrinos, Kathryn M., Guoliang Xia, Gaggar, Anuj, Paulson, Matthew, Ganova-Raeva, Lilia, Khudyakov, Yury, and Lara, James
- Subjects
HEPATITIS B virus ,REVERSE transcriptase ,PROTEIN-drug interactions ,MOLECULAR dynamics ,POLYMERASES - Abstract
Background: Herein, the predicted atomic structures of five representative sequence variants of the reverse transcriptase protein (RT) of hepatitis B virus (HBV), sampled from patients with rapid or slow response to tenofovir disoproxil fumarate (TDF) treatment, have been examined to identify structural variations between them in order to assess structural and functional properties of HBV-RT variants associated with the differential responses to TDF treatment. Results: We utilized a hybrid computational approach to model the atomistic structures of HBV-RT/DNA-RNA/dATP and HBV-RT/DNA-RNA/TFV-DP (tenofovir diphosphate) complexes with the native hybrid DNA-RNA substrate in place. Multi-nanosecond molecular dynamics (MD) simulations of HBV-RT/DNA-RNA/dATP complexes revealed strong coupling of the natural nucleotide substrate, dATP, to the active site of the RT, and the differential involvement of the two putative magnesium cations (Mg
2+ ) at the active site, whereby one Mg2+ directly bridges the interaction between dATP and HBV-RT and the other serves as a coordinator to maintain an optimal configuration of the active site. Solvated interaction energy (SIE) calculated in MD simulations of HBV-RT/DNA-RNA/TFV-DP complexes indicate no differential binding affinity between TFV-DP and HBV-RT variants identified in patients with slow or rapid response to TDF treatment. Conclusion: The predicted atomic structures accurately represent functional states of HBV-RT. The equivalent interaction between TFV-DP and each examined HBV-RT variants suggests that binding affinity of TFV-DP to HBV-RT is not associated with delayed viral clearance. [ABSTRACT FROM AUTHOR]- Published
- 2016
- Full Text
- View/download PDF
17. Modeling the functional state of the reverse transcriptase of hepatitis B virus and its application to probing drug-protein interaction
- Author
-
Guo-liang Xia, James Lara, Lilia Ganova-Raeva, Kathryn M. Kitrinos, Hong Thai, Xiao-Jun Xu, Matthew Paulson, Yury Khudyakov, and Anuj Gaggar
- Subjects
Models, Molecular ,0301 basic medicine ,Hepatitis B virus ,Stereochemistry ,medicine.disease_cause ,Antiviral Agents ,Biochemistry ,Viral Proteins ,03 medical and health sciences ,chemistry.chemical_compound ,Structural Biology ,Catalytic Domain ,Drug Resistance, Viral ,Reverse transcriptase ,medicine ,Humans ,Drug Interactions ,Magnesium ,Nucleotide ,Tenofovir ,Molecular Biology ,Solvated interaction energy (SIE) ,Ions ,chemistry.chemical_classification ,biology ,Research ,Applied Mathematics ,Active site ,Substrate (chemistry) ,RNA ,virus diseases ,Molecular dynamics (MD) ,RNA-Directed DNA Polymerase ,Interaction energy ,Hybrid structure modeling ,Hepatitis B ,Virology ,Computer Science Applications ,030104 developmental biology ,chemistry ,Drug resistance ,biology.protein ,Reverse Transcriptase Inhibitors ,Thermodynamics ,DNA - Abstract
Background Herein, the predicted atomic structures of five representative sequence variants of the reverse transcriptase protein (RT) of hepatitis B virus (HBV), sampled from patients with rapid or slow response to tenofovir disoproxil fumarate (TDF) treatment, have been examined to identify structural variations between them in order to assess structural and functional properties of HBV-RT variants associated with the differential responses to TDF treatment. Results We utilized a hybrid computational approach to model the atomistic structures of HBV-RT/DNA-RNA/dATP and HBV-RT/DNA-RNA/TFV-DP (tenofovir diphosphate) complexes with the native hybrid DNA-RNA substrate in place. Multi-nanosecond molecular dynamics (MD) simulations of HBV-RT/DNA-RNA/dATP complexes revealed strong coupling of the natural nucleotide substrate, dATP, to the active site of the RT, and the differential involvement of the two putative magnesium cations (Mg2+) at the active site, whereby one Mg2+ directly bridges the interaction between dATP and HBV-RT and the other serves as a coordinator to maintain an optimal configuration of the active site. Solvated interaction energy (SIE) calculated in MD simulations of HBV-RT/DNA-RNA/TFV-DP complexes indicate no differential binding affinity between TFV-DP and HBV-RT variants identified in patients with slow or rapid response to TDF treatment. Conclusion The predicted atomic structures accurately represent functional states of HBV-RT. The equivalent interaction between TFV-DP and each examined HBV-RT variants suggests that binding affinity of TFV-DP to HBV-RT is not associated with delayed viral clearance. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1116-4) contains supplementary material, which is available to authorized users.
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.