27 results on '"Ochem"'
Search Results
2. The openOCHEM consensus model is the best-performing open-source predictive model in the First EUOS/SLAS joint compound solubility challenge
- Author
-
Andrea Hunklinger, Peter Hartog, Martin Šícho, Guillaume Godin, and Igor V. Tetko
- Subjects
Solubility prediction ,Kaggle challenge ,OCHEM ,Consensus ,Descriptor based models ,Representation learning ,Medicine (General) ,R5-920 ,Biotechnology ,TP248.13-248.65 - Abstract
The EUOS/SLAS challenge aimed to facilitate the development of reliable algorithms to predict the aqueous solubility of small molecules using experimental data from 100 K compounds. In total, hundred teams took part in the challenge to predict low, medium and highly soluble compounds as measured by the nephelometry assay. This article describes the winning model, which was developed using the publicly available Online CHEmical database and Modeling environment (OCHEM) available on the website https://ochem.eu/article/27. We describe in detail the assumptions and steps used to select methods, descriptors and strategy which contributed to the winning solution. In particular we show that consensus based on 28 models calculated using descriptor-based and representation learning methods allowed us to obtain the best score, which was higher than those based on individual approaches or consensus models developed using each individual approach. A combination of diverse models allowed us to decrease both bias and variance of individual models and to calculate the highest score. The model based on Transformer CNN contributed the best individual score thus highlighting the power of Natural Language Processing (NLP) methods. The inclusion of information about aleatoric uncertainty would be important to better understand and use the challenge data by the contestants.
- Published
- 2024
- Full Text
- View/download PDF
3. Intelligent consensus predictions of bioconcentration factor of pharmaceuticals using 2D and fragment-based descriptors
- Author
-
Kabiruddin Khan, Vinay Kumar, Erika Colombo, Anna Lombardo, Emilio Benfenati, and Kunal Roy
- Subjects
BCF ,DrugBank ,Ecotoxicity ,ECOSAR ,OCHEM ,Pharmaceuticals ,Environmental sciences ,GE1-350 - Abstract
Bioconcentration factors (BCFs) are markers of chemical substance accumulation in organisms, and they play a significant role in determining the environmental risk of various chemicals. Experiments to obtain BCFs are expensive and time-consuming; therefore, it is better to estimate BCF early in the chemical development process. The current research aims to evaluate the ecotoxicity potential of 122 pharmaceuticals and identify possible important structural attributes using BCF as the determining feature against a group of fish species. We have calculated the theoretical 2D descriptors from the OCHEM platform and SiRMS descriptor calculating software. The regression-based quantitative structure–property relationship (QSPR) modeling was used to identify the chemical features responsible for acute fish bioconcentration. Multiple models with the “intelligent consensus” algorithm were employed for the regression-based approach improving the predictive ability of the models. To ensure the robustness and interpretability of the developed models, rigorous validation was performed employing various statistical internal and external validation metrics. From the developed models, it can be specified that the presence of large lipophilic and electronegative moieties greatly enhances the bioaccumulative potential of pharmaceuticals, whereas the hydrophilic characteristics have shown a negative impact on BCF. Furthermore, the developed models were employed to screen the DrugBank database (https://go.drugbank.com/) for assessing the BCF properties of the entire database. The evidence acquired from the modeled descriptors might be used for aquatic risk assessment in the future, with the added benefit of providing an early caution of their probable negative impact on aquatic ecosystems for regulatory purposes.
- Published
- 2022
- Full Text
- View/download PDF
4. Theoretical and Experimental Studies of Phosphonium Ionic Liquids as Potential Antibacterials of MDR Acinetobacter baumannii.
- Author
-
Metelytsia, Larysa O., Hodyna, Diana M., Semenyuta, Ivan V., Kovalishyn, Vasyl V., Rogalsky, Sergiy P., Derevianko, Kateryna Yu, Brovarets, Volodymyr S., and Tetko, Igor V.
- Subjects
ACINETOBACTER baumannii ,IONIC liquids ,ANTIBACTERIAL agents ,ALKYL bromides - Abstract
A previously developed model to predict antibacterial activity of ionic liquids against a resistant A. baumannii strain was used to assess activity of phosphonium ionic liquids. Their antioxidant potential was additionally evaluated with newly developed models, which were based on public data. The accuracy of the models was rigorously evaluated using cross-validation as well as test set prediction. Six alkyl triphenylphosphonium and alkyl tributylphosphonium bromides with the C
8 , C10 , and C12 alkyl chain length were synthesized and tested in vitro. Experimental studies confirmed their activity against A. baumannii as well as showed pronounced antioxidant properties. These results suggest that phosphonium ionic liquids could be promising lead structures against A. baumannii. [ABSTRACT FROM AUTHOR]- Published
- 2022
- Full Text
- View/download PDF
5. Highly Accurate Filters to Flag Frequent Hitters in AlphaScreen Assays by Suggesting their Mechanism.
- Author
-
Ghosh, Dipan, Koch, Uwe, Hadian, Kamyar, Sattler, Michael, and Tetko, Igor V.
- Subjects
MACHINE learning ,ARTIFICIAL intelligence ,MACHINE performance ,CHEMINFORMATICS - Abstract
AlphaScreen is one of the most widely used assay technologies in drug discovery due to its versatility, dynamic range and sensitivity. However, a presence of false positives and frequent hitters contributes to difficulties with an interpretation of measured HTS data. Although filters do exist to identify frequent hitters for AlphaScreen, they are frequently based on privileged scaffolds. The development of such filters is time consuming and requires deep domain knowledge. Recently, machine learning and artificial intelligence methods are emerging as important tools to advance drug discovery and chemoinformatics, including their application to identification of frequent hitters in screening assays. However, the relative performance and complementarity of the Machine Learning and scaffold‐based techniques has not yet been comprehensively compared. In this study, we analysed filters based on the privileged scaffolds with filters built using machine learning. Our results demonstrate that machine‐learning methods provide more accurate filters for identification of frequent hitters in AlphaScreen assays than scaffold‐based methods and can be easily redeveloped once new data are measured. We present highly accurate models to identify frequent hitters in AlphaScreen assays. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
6. Design of new imidazole derivatives with anti-HCMV activity: QSAR modeling, synthesis and biological testing.
- Author
-
Kovalishyn, Vasyl, Zyabrev, Volodymyr, Kachaeva, Maryna, Ziabrev, Kostiantyn, Keith, Kathy, Harden, Emma, Hartline, Caroll, James, Scott H., and Brovarets, Volodymyr
- Subjects
- *
BIOSYNTHESIS , *IMIDAZOLES , *QSAR models , *HUMAN cytomegalovirus , *CHEMICAL libraries , *CHEMICAL models - Abstract
The problem of designing new antiviral drugs against Human Cytomegalovirus (HCMV) was addressed using the Online Chemical Modeling Environment (OCHEM). Data on compound antiviral activity to human organisms were collected from the literature and uploaded in the OCHEM database. The predictive ability of the regression models was tested through cross-validation, giving coefficient of determination q2 = 0.71–0.76. The validation of the models using an external test set proved that the models can be used to predict the activity of newly designed compounds with reasonable accuracy within the applicability domain (q2 = 0.70–0.74). The models were applied to screen a virtual chemical library of imidazole derivatives, which was designed to have antiviral activity. The six most promising compounds were identified, synthesized and their antiviral activities against HCMV were evaluated in vitro. However, only two of them showed some activity against the HCMV AD169 strain. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
7. Theoretical and Experimental Studies of Phosphonium Ionic Liquids as Potential Antibacterials of MDR Acinetobacter baumannii
- Author
-
Larysa O. Metelytsia, Diana M. Hodyna, Ivan V. Semenyuta, Vasyl V. Kovalishyn, Sergiy P. Rogalsky, Kateryna Yu Derevianko, Volodymyr S. Brovarets, and Igor V. Tetko
- Subjects
phosphonium ionic liquids ,antibacterial ,Acinetobacter baumannii ,antioxidants ,OCHEM ,QSAR ,Therapeutics. Pharmacology ,RM1-950 - Abstract
A previously developed model to predict antibacterial activity of ionic liquids against a resistant A. baumannii strain was used to assess activity of phosphonium ionic liquids. Their antioxidant potential was additionally evaluated with newly developed models, which were based on public data. The accuracy of the models was rigorously evaluated using cross-validation as well as test set prediction. Six alkyl triphenylphosphonium and alkyl tributylphosphonium bromides with the C8, C10, and C12 alkyl chain length were synthesized and tested in vitro. Experimental studies confirmed their activity against A. baumannii as well as showed pronounced antioxidant properties. These results suggest that phosphonium ionic liquids could be promising lead structures against A. baumannii.
- Published
- 2022
- Full Text
- View/download PDF
8. In silico and in vitro studies of a number PILs as new antibacterials against MDR clinical isolate Acinetobacter baumannii.
- Author
-
Trush, Maria M., Kovalishyn, Vasyl, Hodyna, Diana, Golovchenko, Olexandr V., Chumachenko, Svitlana, Tetko, Igor V., Brovarets, Volodymyr S., and Metelytsia, Larysa
- Subjects
- *
ACINETOBACTER baumannii , *IN vitro studies , *ANTIBACTERIAL agents , *CHEMICAL libraries , *DIGITAL libraries , *CHEMICAL models - Abstract
QSAR analysis of a set of previously synthesized phosphonium ionic liquids (PILs) tested against Gram‐negative multidrug‐resistant clinical isolate Acinetobacter baumannii was done using the Online Chemical Modeling Environment (OCHEM). To overcome the problem of overfitting due to descriptor selection, fivefold cross‐validation with variable selection in each step of the model development was applied. The predictive ability of the classification models was tested by cross‐validation, giving balanced accuracies (BA) of 76%–82%. The validation of the models using an external test set proved that the models can be used to predict the activity of newly designed compounds with a reasonable accuracy within the applicability domain (BA = 83%–89%). The models were applied to screen a virtual chemical library with expected activity of compounds against MDR Acinetobacter baumannii. The eighteen most promising compounds were identified, synthesized, and tested. Biological testing of compounds was performed using the disk diffusion method in Mueller‐Hinton agar. All tested molecules demonstrated high anti‐A. baumannii activity and different toxicity levels. The developed classification SAR models are freely available online at http://ochem.eu/article/113921 and could be used by scientists for design of new more effective antibiotics. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
9. Rational design of isonicotinic acid hydrazide derivatives with antitubercular activity: Machine learning, molecular docking, synthesis and biological testing.
- Author
-
Kovalishyn, Vasyl, Grouleff, Julie, Semenyuta, Ivan, Sinenko, Vitaliy O., Slivchuk, Sergiy R., Hodyna, Diana, Brovarets, Volodymyr, Blagodatny, Volodymyr, Poda, Gennady, Tetko, Igor V., and Metelytsia, Larysa
- Subjects
- *
MULTIDRUG-resistant tuberculosis , *ISONIAZID , *ANTITUBERCULAR agents , *MACHINE learning , *MOLECULAR docking , *MYCOBACTERIUM tuberculosis , *THERAPEUTICS - Abstract
The problem of designing new antitubercular drugs against multiple drug‐resistant tuberculosis (MDR‐TB) was addressed using advanced machine learning methods. As there are only few published measurements against MDR‐TB, we collected a large literature data set and developed models against the non‐resistant H37Rv strain. The predictive accuracy of these models had a coefficient of determination q2 = .7–.8 (regression models) and balanced accuracies of about 80% (classification models) with cross‐validation and independent test sets. The models were applied to screen a virtual chemical library, which was designed to have MDR‐TB activity. The seven most promising compounds were identified, synthesized and tested. All of them showed activity against the H37Rv strain, and three molecules demonstrated activity against the MDR‐TB strain. The docking analysis indicated that the discovered molecules could bind enoyl reductase, InhA, which is required in mycobacterial cell wall development. The models are freely available online ( http://ochem.eu/article/103868) and can be used to predict potential anti‐TB activity of new chemicals. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
10. Imidazolium ionic liquids as effective antiseptics and disinfectants against drug resistant S. aureus: In silico and in vitro studies.
- Author
-
Hodyna, Diana, Kovalishyn, Vasyl, Semenyuta, Ivan, Blagodatnyi, Volodymyr, Rogalsky, Sergiy, and Metelytsia, Larisa
- Subjects
- *
IMIDAZOLES , *ANTISEPTICS , *DRUG resistance in bacteria , *IONIC liquids , *DISINFECTION & disinfectants , *STAPHYLOCOCCUS aureus - Abstract
Аbstract This paper describes Quantitative Structure-Activity Relationships (QSAR) studies, molecular docking and in vitro antibacterial activity of several potent imidazolium-based ionic liquids (ILs) against S. aureus ATCC 25923 and its clinical isolate. Small set of 131 ILs was collected from the literature and uploaded in the OCHEM database. QSAR methodologies used Associative Neural Networks and Random Forests (WEKA-RF) methods. The predictive ability of the models was tested through cross-validation, giving cross-validated coefficients q 2 = 0.82–0.87 for regression models and overall prediction accuracies of 80–82.1% for classification models. The proposed QSAR models are freely available online on OCHEM server at https://ochem.eu/article/107364 and can be used for estimation of antibacterial activity of new imidazolium-based ILs. A series of synthesized 1,3-dialkylimidazolium ILs with predicted activity were evaluated in vitro . The high activity of 7 ILs against S. aureus strain and its clinical isolate was measured and thereafter analyzed by the molecular docking to prokaryotic homologue of a eukaryotic tubulin FtsZ. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
11. Modelling the toxicity of a large set of metal and metal oxide nanoparticles using the OCHEM platform.
- Author
-
Kovalishyn, Vasyl, Abramenko, Natalia, Kopernyk, Iryna, Charochkina, Larysa, Metelytsia, Larysa, Tetko, Igor V., Peijnenburg, Willie, and Kustov, Leonid
- Subjects
- *
METALLIC oxides , *NANOPARTICLE toxicity , *QSAR models , *INORGANIC compounds , *CHEMICAL models - Abstract
Inorganic nanomaterials have become one of the new areas of modern knowledge and technology and have already found an increasing number of applications. However, some nanoparticles show toxicity to living organisms, and can potentially have a negative influence on environmental ecosystems. While toxicity can be determined experimentally, such studies are time consuming and costly. Computational toxicology can provide an alternative approach and there is a need to develop methods to reliably assess Quantitative Structure–Property Relationships for nanomaterials (nano-QSPRs). Importantly, development of such models requires careful collection and curation of data. This article overviews freely available nano-QSPR models, which were developed using the Online Chemical Modeling Environment (OCHEM). Multiple data on toxicity of nanoparticles to different living organisms were collected from the literature and uploaded in the OCHEM database. The main characteristics of nanoparticles such as chemical composition of nanoparticles, average particle size, shape, surface charge and information about the biological test species were used as descriptors for developing QSPR models. QSPR methodologies used Random Forests (WEKA-RF), k-Nearest Neighbors and Associative Neural Networks. The predictive ability of the models was tested through cross-validation, giving cross-validated coefficients q 2 = 0.58–0.80 for regression models and balanced accuracies of 65–88% for classification models. These results matched the predictions for the test sets used to develop the models. The proposed nano-QSPR models and uploaded data are freely available online at http://ochem.eu/article/103451 and can be used for estimation of toxicity of new and emerging nanoparticles at the early stages of nanomaterial development. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
12. Modeling of the hERG K+ Channel Blockage Using Online Chemical Database and Modeling Environment (OCHEM).
- Author
-
Li, Xiao, Zhang, Yuan, Li, Huanhuan, and Zhao, Yong
- Subjects
CHEMICAL reactions ,ORGANIC compounds ,DRUG development - Abstract
Human ether-a-go-go related gene (hERG) K+ channel plays an important role in cardiac action potential. Blockage of hERG channel may result in long QT syndrome (LQTS), even cause sudden cardiac death. Many drugs have been withdrawn from the market because of the serious hERG-related cardiotoxicity. Therefore, it is quite essential to estimate the chemical blockage of hERG in the early stage of drug discovery. In this study, a diverse set of 3721 compounds with hERG inhibition data was assembled from literature. Then, we make full use of the Online Chemical Modeling Environment (OCHEM), which supplies rich machine learning methods and descriptor sets, to build a series of classification models for hERG blockage. We also generated two consensus models based on the top-performing individual models. The consensus models performed much better than the individual models both on 5-fold cross validation and external validation. Especially, consensus model II yielded the prediction accuracy of 89.5 % and MCC of 0.670 on external validation. This result indicated that the predictive power of consensus model II should be stronger than most of the previously reported models. The 17 top-performing individual models and the consensus models and the data sets used for model development are available at https://ochem.eu/article/103592. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
13. QSAR modeling studies of a library of Human Tyrosinase inhibitors
- Author
-
Mateus, Cristiano Gabi dos Santos, Abreu, Rui M.V., and Barros, Lillian
- Subjects
abTYR ,QSAR ,hsTYR ,OCHEM ,ZINC15 ,Melanin ,COCONUT ,PYTHON ,Domínio/Área Científica::Engenharia e Tecnologia::Outras Engenharias e Tecnologias ,PyQSAR ,Molecular descriptor - Abstract
Melanogenesis is the chemical process responsible for synthesizing melanin, which occurs in melanocytes, in subcellular lysosome-like organelles called melanosomes. Melanin plays a vital role in protecting the skin from damage caused by ultraviolet rays. However, excess melanin production or abnormal distribution can cause various pigmentation disorders, such as over-tanning, age spots, and melasma. Skin disorders like these, have prompted the development of skin-whitening compounds to reduce melanin content. Furthermore, inhibition of melanin synthesis is considered a valid therapeutic strategy for treating advanced melanotic melanomas Human tyrosinase (hsTYR) is the most important enzyme involved in the melanogenesis process, as it catalyzes, at least, its first two steps. Tyrosinase from the white button mushroom Agaricus bisporus (abTYR) has been widely available at low cost from commercial sources for several decades, whereas hsTYR is still expensive and difficult to produce. The importance of discovering more and better hsTYR inhibitors has been widely discussed, as when tested against hsTYR, several abTYR inhibitors provide disappointing results, including some of the most extensively used depigmenting compounds now used in dermocosmetics. An in silico methodology that can be used to predict compound bioactivities is QSAR (quantitative structure-activity relationship) modelling. A QSAR model tries to find correlations between a biological activity of interest and molecular descriptors calculated from the compound structure. In this work, a QSAR model was developed to predict hsTYR inhibition activity using the PYTHON computer language and its PyQSAR package. To develop a QSAR model, a library of 196 known hsTYR inhibitors was gathered, and compounds were divided into 6 groups according to their scaffold structure. A total of 33 QSAR models were prepared using different combinations of the defined groups and different pools of molecular descriptors. QSAR model 32 was selected for further use as it presented good statistical robustness and had the highest number of compounds, 41 in total. Of the 28,933 molecular descriptors calculated by the OCHEM platform for the 41 compounds used, PyQSAR selected 4 to be used in the model: C-026; DISSM2C; MaxdssC; WHALES90_Rem. The statistical data obtained after the validation of the QSAR model by cross-validation was excellent, namely the determination coefficient (R2CV=0.9147), the value of the square root of the mean error (RMSE CV=0.1878) and the mean value of the score of the multiple linear regression method (Q2CV=0.8922). This QSAR model originates a mathematical equation that allows the prediction of hsTYR inhibition activity by new compounds with similar structures. A library of natural compounds, with a structure similar to those used to develop QSAR model 32, was created using the COCONUT database of natural compounds. A total of 1,628 natural compounds were gathered, their molecular descriptors were calculated, and the QSAR model 32 equation was applied. The results are displayed on a website and can be viewed by accessing the URL http://esa.ipb.pt/qsar/. The ZINC15 database was used to determine which of the compounds in the developed natural compound library would be available for purchase after predicting the hsTYR inhibitory activity of each compound in the library. A total of 18 different compounds were bought from different companies. To evaluate these compounds experimental ability to inhibit hsTYR and thus validate QSAR model 32, the compounds will be tested against this enzyme. If those compounds activity is confirmed, they may be used in cosmeceutical applications. A melanogénese é o processo químico responsável pela síntese da melanina, que ocorre nos melanócitos, em organelos subcelulares semelhantes aos lisossomas chamados melanossomas. A melanina desempenha um papel vital na proteção da pele dos danos causados pelos raios ultravioleta. No entanto, a produção excessiva de melanina ou distribuição anormal pode causar vários distúrbios de pigmentação, como bronzeamento excessivo, manchas senis e melasma. Distúrbios de pele como estes levaram ao desenvolvimento de compostos de clareamento da pele para reduzir o conteúdo de melanina. Além disso, a inibição da síntese de melanina é considerada uma estratégia terapêutica válida para o tratamento de melanomas melanóticos avançados A tirosinase humana (hsTYR) é a enzima mais importante envolvida no processo de melanogénese, pois catalisa, pelo menos, as suas duas primeiras etapas. A tirosinase do cogumelo branco Agaricus bisporus (abTYR) está amplamente disponível a baixo custo em fontes comerciais há várias décadas, enquanto a hsTYR ainda é cara e difícil de produzir. A importância de descobrir mais e melhores inibidores de hsTYR tem sido amplamente discutida, pois quando testados contra hsTYR, vários inibidores de abTYR fornecem resultados dececionantes, incluindo alguns dos compostos despigmentantes mais usados atualmente em dermocosméticos. Uma metodologia in silico que pode ser usada para prever bioatividades compostas é a modelação QSAR (quantitative structure-activity relationship). Um modelo QSAR tenta encontrar correlações entre uma atividade biológica de interesse e descritores moleculares calculados a partir da estrutura do composto. Neste trabalho, um modelo QSAR foi desenvolvido para prever a atividade de inibição de hsTYR usando a linguagem de computador PYTHON e seu pacote PyQSAR. Para desenvolver um modelo QSAR, uma biblioteca de 196 inibidores hsTYR conhecidos foi reunida e os compostos foram divididos em 6 grupos de acordo com sua estrutura de base. Um total de 33 modelos QSAR foram preparados usando diferentes combinações dos grupos definidos e diferentes pools de descritores moleculares. O modelo QSAR 32 foi selecionado para uso posterior por apresentar boa robustez estatística e possuir o maior número de compostos, 41 no total. Dos 28 933 descritores moleculares calculados pela plataforma OCHEM para os 41 compostos utilizados, o PyQSAR selecionou 4 para serem utilizados no modelo: C-026; DISSM2C; MaxdssC; WHALES90_Rem. Os dados estatísticos obtidos após a validação do modelo QSAR por validação cruzada foram excelentes, nomeadamente o coeficiente de correlação (R2CV=0,9147), o valor da raiz quadrada do erro médio (RMSE CV=0,1878) e o valor médio da pontuação do método de regressão linear múltipla (Q2CV=0,8922). Este modelo QSAR origina uma equação matemática que permite prever a atividade de inibição de hsTYR por novos compostos com estruturas semelhantes. Uma biblioteca de compostos naturais, com uma estrutura similar àquelas usadas para desenvolver o modelo QSAR 32, foi criada usando o banco de dados de compostos naturais COCONUT. Um total de 1 628 compostos naturais foram recolhidos, os seus descritores moleculares calculados e a equação do modelo QSAR 32 foi aplicada. Os resultados são apresentados num website criado por nós e podem ser visualizados acedendo ao URL http://esa.ipb.pt/qsar/. O banco de dados ZINC15 foi usado para determinar quais compostos na biblioteca de compostos naturais desenvolvidos estariam disponíveis para compra após prever a atividade inibitória de hsTYR de cada composto na biblioteca. Um total de 18 compostos diferentes foram comprados de diferentes empresas. Para avaliar a capacidade experimental destes compostos em inibir a hsTYR e assim validar o modelo QSAR 32, os compostos serão testados contra esta enzima. Caso a atividade desses compostos seja confirmada, eles poderão ser utilizados em aplicações cosmecêuticas.
- Published
- 2022
14. Quinoline Hydrazone Derivatives as New Antibacterials against Multidrug Resistant Strains.
- Author
-
Hodyna D, Kovalishyn V, Romanenko Y, Semenyuta I, Blagodatny V, Kachaeva M, Brazhko O, and Metelytsia L
- Subjects
- Hydrazones pharmacology, Molecular Docking Simulation, Anti-Bacterial Agents pharmacology, Anti-Bacterial Agents chemistry, Microbial Sensitivity Tests, Structure-Activity Relationship, Methicillin-Resistant Staphylococcus aureus, Quinolines pharmacology, Quinolines chemistry
- Abstract
To develop novel antimicrobial agents a series of 2(4)-hydrazone derivatives of quinoline were designed, synthesized and tested. QSAR models of the antibacterial activity of quinoline derivatives were developed by the OCHEM web platform using different machine learning methods. A virtual set of quinoline derivatives was verified with a previously published classification model of anti-E. coli activity and screened using the regression model of anti-S. aureus activity. Selected and synthesized 2(4)-hydrazone derivatives of quinoline exhibited antibacterial activity against the standard and antibiotic-resistant S. aureus and E. coli strains in the range from 15 to 30 mm by the diameter of growth inhibition zones. Molecular docking showed the complex formation of the studied compounds into the catalytic domain of dihydrofolate reductase with an estimated binding affinity from -8.4 to -9.4 kcal/mol., (© 2023 Wiley-VHCA AG, Zurich, Switzerland.)
- Published
- 2023
- Full Text
- View/download PDF
15. Predictive modeling of antibacterial activity of ionic liquids by machine learning methods.
- Author
-
Makarov, D.M., Fadeeva, Yu.A., Safonova, E.A., and Shmukler, L.E.
- Subjects
- *
IONIC liquids , *ANTIBACTERIAL agents , *MACHINE learning , *QSAR models , *PREDICTION models , *STAPHYLOCOCCUS aureus - Abstract
Structural variation and different bioactivity of ionic liquids (ILs) make them highly promising for the development of novel biocides. Application of computational methods to the evaluation of potential antibacterial activity of chemical compounds is a useful, time- and cost-saving tool replacing numerous experimental syntheses. In the present study, quantitative structure–activity relationship (QSAR) modeling is applied to develop models (based on more than 800 data points) aiming to predict the minimal inhibitory concentration (MIC) of ILs against three types of human pathogens – Staphylococcus aureus , Escherichia coli and Pseudomonas aeruginosa. The random forest model with the AlvaDesc descriptors in general demonstrates the best performance for all the three types of bacteria and is suggested as a final model. To interpret the final model and determine the most significant descriptors, a SHapley Additive exPlanation (SHAP) method was applied. Six amino acid ILs, which were synthesized for the first time, and five halogenide ionic liquids purchased, all based on 1-alkyl-3methylimidozolium cations with different alkyl chain lengths, C 10 , C 12 and C 14 , are tested in vitro and used to validate the developed QSAR models. The data sets and developed model are available free of charge at http://ochem.eu/article/147386. [Display omitted] • QSAR model to predict the MIC values of ILs against three bacteria was developed. • We compiled the largest dataset (>800) on the MIC values of ILs against 3 bacteria. • IL properties which mostly governed the MIC value were determined with SHAP method. • The final model was validated with 11 ILs with 1-alkyl-3-methylimidazolium cations. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
16. Machine learning models for phase transition and decomposition temperature of ionic liquids.
- Author
-
Makarov, Dmitriy M., Fadeeva, Yuliya A., Shmukler, Liudmila E., and Tetko, Igor V.
- Subjects
- *
PHASE transitions , *TRANSITION temperature , *GLASS transition temperature , *IONIC liquids , *MELTING points - Abstract
[Display omitted] • QSPR modeling of glass transition, melting and decomposition temperatures of ionic liquids was performed. • Rigorous component validation protocol provided better agreement of statistical parameters for the training and test sets. • Explanation of the models using statistical analysis of functional groups and Molecular Matched Pairs was provided. • Public freely-accessible models to predict phase transition and decomposition temperatures of ILs were contributed. The working temperature range of Ionic Liquids (IL) is determined by their liquid state range, where the IL's melting point/glass transition and decomposition temperatures define the lower and upper limits of the range, respectively. Computational prediction of the structure of new ILs with required properties, e.g. which can exist in a liquid state at room temperature and are stable up to high temperatures, is a much less time-consuming and less expensive approach than stepwise synthesis and experimental examination of all presupposed ILs. Therefore, in the present work the quantitative structure–property relationship (QSPR) models were developed to predict the glass transition temperature (T g), melting point (T m), and decomposition temperatures (T d) of ILs. We showed that a use of component validation protocol provided a better agreement of statistical parameters for the training and test sets. The performance of various modeling algorithms and descriptor sets was discussed and compared and advantages of descriptor-less as well as multi-task modeling were shown. An explanation of the models using statistical analysis of functional groups and Molecular Matched Pairs were provided for the mixtures for the first time. The experimental data and models, which are the first publicly available models for prediction of transition and decomposition temperatures of ILs, are publicly available online at http://ochem.eu/article/140250. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
17. Can machine learning methods accurately predict the molar absorption coefficient of different classes of dyes?
- Author
-
Ksenofontov, Alexander A., Lukanov, Michail M., and Bocharov, Pavel S.
- Subjects
- *
ABSORPTION coefficients , *MACHINE learning , *RANDOM forest algorithms - Abstract
[Display omitted] • A machine learning model for molar absorption coefficient prediction was created. • The model was trained on data more 20,000 unique dye molecules. • The model is freely available as https://ochem.eu/article/145413. In this article, we provide a convenient tool for all researchers to predict the value of the molar absorption coefficient for a wide number of dyes without any computer costs. The new model is based on RFR method (ALogPS, OEstate + Fragmentor + QNPR) and is able to predict the molar absorption coefficient with an accuracy (5-fold cross-validation RMSE) of 0.26 log unit. This accuracy was achieved due to the fact that the model was trained on data for more than 20,000 unique dye molecules. To our knowledge, this is the first model for predicting the molar absorption coefficient trained on such a large and diverse set of dyes. The model is available at https://ochem.eu/article/145413. We hope that the new model will allow researchers to predict dyes with practically significant spectral characteristics and verify existing experimental data. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
18. Prediction-driven matched molecular pairs to interpret QSARs and aid the molecular optimization process.
- Author
-
Sushko, Yurii, Novotarskyi, Sergii, Körner, Robert, Vogt, Joachim, Abdelaziz, Ahmed, and Tetko, Igor V.
- Subjects
- *
QSAR models , *PHARMACEUTICAL chemistry , *TOXICOLOGY of water pollution , *DRUG design , *CHEMICAL amplification - Abstract
Background QSAR is an established and powerful method for cheap in silico assessment of physicochemical properties and biological activities of chemical compounds. However, QSAR models are rather complex mathematical constructs that cannot easily be interpreted. Medicinal chemists would benefit from practical guidance regarding which molecules to synthesize. Another possible approach is analysis of pairs of very similar molecules, so-called matched molecular pairs (MMPs). Such an approach allows identification of molecular transformations that affect particular activities (e.g. toxicity). In contrast to QSAR, chemical interpretation of these transformations is straightforward. Furthermore, such transformations can give medicinal chemists useful hints for the hit-to-lead optimization process. Results The current study suggests a combination of QSAR and MMP approaches by finding MMP transformations based on QSAR predictions for large chemical datasets. The study shows that such an approach, referred to as prediction-driven MMP analysis, is a useful tool for medicinal chemists, allowing identification of large numbers of "interesting" transformations that can be used to drive the molecular optimization process. All the methodological developments have been implemented as software products available online as part of OCHEM (http://ochem.eu/). Conclusions The prediction-driven MMPs methodology was exemplified by two use cases: modelling of aquatic toxicity and CYP3A4 inhibition. This approach helped us to interpret QSAR models and allowed identification of a number of "significant" molecular transformations that affect the desired properties. This can facilitate drug design as a part of molecular optimization process. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
19. Meso-carbazole substituted porphyrin complexes: Synthesis and spectral properties according to experiment, DFT calculations and the prediction by machine learning methods.
- Author
-
Bichan, N.G., Ovchenkova, E.N., Ksenofontov, A.A., Mozgova, V.A., Gruzdev, M.S., Chervonova, U.V., Shelaev, I.V., and Lomova, T.N.
- Subjects
- *
MACHINE learning , *CARBAZOLE , *PORPHYRINS , *ZINC porphyrins , *MASS spectrometry , *OPTICAL properties , *FORECASTING - Abstract
In this work, two porphyrins bearing [3,6-di-tert-butyl-carbazol-9-yl-benzoyloxy)]- (1) and [3,6-bis(3′,6′-di(tert -butyl)-9′H-carbazol)-9H-carbazolbenzoyloxy]phenyl (2) groups and their zinc (1Zn, 2Zn)/cobalt (1Co, 2Co) complexes were synthesized and studied by experimental and theoretical methods. The spectral parameters (UV–vis absorption/femtosecond transient absorption/fluorescence, IR, 1H NMR, mass spectra) of the compounds were observed. Their structure was also examined by the DFT method. The comparative study of the UV–vis spectra by the DFT/TDDFT calculation, and by the prediction of the Soret band maximum using machine learning methods, namely the consensus models based on the data of over 10000 porphyrin free bases and their complexes with metals was performed. The absorption maximum wavelength (Soret band) of porphyrins predicted with machine learning methods showed better agreement with the experimental data compared to the DFT/TDDFT calculation. The final consensus model is freely available at https://ochem.eu/article/145340 and can be used by the other researchers to obtain new functionalized porphyrins with desired optical properties. • New meso-carbazole substituted porphyrin complexes were synthesized. • They were characterized by UV–visible, infrared, fluorescence, transient absorption spectroscopy, and mass spectrometry. • Prediction of the Soret band maximum by DFT calculations and machine learning methods. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
20. Benchmarking machine learning methods for modeling physical properties of ionic liquids.
- Author
-
Baskin, Igor, Epshtein, Alon, and Ein-Eli, Yair
- Subjects
- *
MACHINE learning , *IONIC liquids , *PATTERN matching , *NATURAL language processing , *MELTING points , *REFRACTIVE index , *SURFACE tension - Abstract
• A large-scale comparison of numerous QSPR models for six temperature-dependent properties of ionic liquids was performed. • None of the machine learning methods and molecular representations lead to the best models in all cases. • Only systematic enumeration reveals the best combinations of machine learning methods and molecular representations. • Transformer neural networks tend to provide better predictions of the properties of ionic liquids than other methods. • The properties of ionic liquids at different temperatures can be predicted using multi-task learning. The great importance of the ability to quantitatively predict the properties of ionic liquids (ILs) using quantitative structure–property relationships (QSPR) models necessitates the understanding of which modern machine learning (ML) methods in combination with which types of molecular representations are preferable to use for this purpose. To address this problem, a large-scale benchmarking study of QSPR models built by combining three traditional ML methods and neural networks with seven different architectures with five types of molecular representations (in the form of either numerical molecular descriptors or SMILES text strings) to predict six important physical properties of ILs (density, electrical conductance, melting point, refractive index, surface tension, and viscosity) was carried out. The datasets include from 407 to 1204 diverse ILs composed of various organic and inorganic ions. QSPR models for predicting the properties of ILs at eight different temperatures were built using multi-task learning. The best combinations of ML methods and molecular representations were identified for each of the properties. A unified ranking system was introduced to rank and prioritize different ML methods and molecular representations. It was shown in this study that on average: (i) nonlinear ML methods perform much better than linear ones, (ii) neural networks perform better than traditional ML methods, (iii) Transformers, which are actively used in natural language processing (NLP), perform better than other types of neural networks due to the advanced ability to analyze chemical structures of ILs encoded into SMILES text strings. A special "component-wise" cross-validation scheme was applied to assess how much the predictive performance deteriorates for the ILs composed of cations and anions that are not present in the dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
21. In silico and in vitro studies of a number PILs as new antibacterials against MDR clinical isolate Acinetobacter baumannii
- Author
-
S. A. Chumachenko, Vasyl Kovalishyn, Diana Hodyna, Igor V. Tetko, Volodymyr Brovarets, Larysa Metelytsia, Maria M. Trush, and Olexandr V. Golovchenko
- Subjects
Acinetobacter baumannii ,Quantitative structure–activity relationship ,Computer science ,In silico ,Drug Evaluation, Preclinical ,Ionic Liquids ,Quantitative Structure-Activity Relationship ,Computational biology ,Microbial Sensitivity Tests ,Overfitting ,01 natural sciences ,Biochemistry ,Chemical library ,Machine Learning ,chemistry.chemical_compound ,Organophosphorus Compounds ,Crustacea ,Drug Resistance, Multiple, Bacterial ,Drug Discovery ,Animals ,Humans ,Computer Simulation ,Agar diffusion test ,Pharmacology ,biology ,010405 organic chemistry ,Acinetobacter Baumanii ,Antibacterial Activity ,Ochem ,Phosphonium Ionic Liquids ,Organic Chemistry ,biology.organism_classification ,0104 chemical sciences ,Anti-Bacterial Agents ,010404 medicinal & biomolecular chemistry ,chemistry ,Test set ,Molecular Medicine ,Databases, Chemical ,Applicability domain - Abstract
QSAR analysis of a set of previously synthesized phosphonium ionic liquids (PILs) tested against Gram-negative multidrug-resistant clinical isolate Acinetobacter baumannii was done using the Online Chemical Modeling Environment (OCHEM). To overcome the problem of overfitting due to descriptor selection, fivefold cross-validation with variable selection in each step of the model development was applied. The predictive ability of the classification models was tested by cross-validation, giving balanced accuracies (BA) of 76%-82%. The validation of the models using an external test set proved that the models can be used to predict the activity of newly designed compounds with a reasonable accuracy within the applicability domain (BA = 83%-89%). The models were applied to screen a virtual chemical library with expected activity of compounds against MDR Acinetobacter baumannii. The eighteen most promising compounds were identified, synthesized, and tested. Biological testing of compounds was performed using the disk diffusion method in Mueller-Hinton agar. All tested molecules demonstrated high anti-A. baumannii activity and different toxicity levels. The developed classification SAR models are freely available online at http://ochem.eu/article/113921 and could be used by scientists for design of new more effective antibiotics.
- Published
- 2020
22. Deep neural network model for highly accurate prediction of BODIPYs absorption.
- Author
-
Ksenofontov, Alexander A., Lukanov, Michail M., Bocharov, Pavel S., Berezin, Michail B., and Tetko, Igor V.
- Subjects
- *
ARTIFICIAL neural networks , *FLUORESCENT dyes , *INTERMOLECULAR interactions , *FORECASTING , *QSAR models - Abstract
[Display omitted] • QSPR model to accurately predict the BODIPYs absorption wavelength was developed; • Largest database of 6000-plus fluorescent dyes was analyzed; • Excellent results for retrospective and prospective validation of BODIPYs; • The model is freely available as https://ochem.eu/article/134921. A possibility to accurately predict the absorption maximum wavelength of BODIPYs was investigated. We found that previously reported models had a low accuracy (40–57 nm) to predict BODIPYs due to the limited dataset sizes and/or number of BODIPYs (few hundreds). New models developed in this study were based on data of 6000-plus fluorescent dyes (including 4000-plus BODIPYs) and the deep neural network architecture. The high prediction accuracy (five-fold cross-validation room mean squared error (RMSE) of 18.4 nm) was obtained using a consensus model, which was more accurate than individual models. This model provided the excellent accuracy (RMSE of 8 nm) for molecules previously synthesized in our laboratory as well as for prospective validation of three new BODIPYs. We found that solvent properties did not significantly influence the model accuracy since only few BODIPYs exhibited solvatochromism. The analysis of large prediction errors suggested that compounds able to have intermolecular interactions with solvent or salts were likely to be incorrectly predicted. The consensus model is freely available at https://ochem.eu/article/134921 and can help the other researchers to accelerate design of new dyes with desired properties. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
23. Beware of proper validation of models for ionic Liquids!
- Author
-
Makarov, D.M., Fadeeva, Yu.A., Shmukler, L.E., and Tetko, I.V.
- Subjects
- *
IONIC liquids , *MODEL validation , *CONVOLUTIONAL neural networks , *MELTING points , *NATURAL language processing , *MACHINE learning - Abstract
• The traditional validation of ionic liquids models based on random training/test split may overestimate accuracy of models. • A rigorous mixtures validation protocol provides better agreement of predicted and calculated performances. • Natural Language Processing (NLP) Transformer-CNN model overperforms traditional descriptor-based machine learning approach. • Public freely-accessible models to predict melting point of ionic liquids are contributed. The melting point (MP) of an ionic liquid (IL) is one of the key physical properties as it determines the lower limit of the IL working temperature range. In this work, we analysed the recently published studies to predict MP of ILs. While we were able to reproduce the statistical parameters reported by the authors, we found that the performance of the models with new test set data was much lower than the reported statistical values. The discrepancy was due to the validation protocol (random split of the initial set into training/test subsets) that did not allow correct estimation of how contributions of individual ions affect the model performance. Using a more rigorous validation protocol we reached good agreement between the training and test set statistical parameters. We strongly suggest using this protocol for proper validation of models for other properties of ILs to avoid reporting overoptimistic statistical parameters. We also showed that the Transformer Convolutional Neural Network, which was based on the representation of molecules as text (SMILES), proposed a model with significantly higher prediction accuracy as compared to those developed using descriptors that were used in the previous studies. The RMSE of this model is 44 °C and the model is applicable to any type of ILs. The data and developed models are publicly available online at http://ochem.eu/article/135195. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
24. Structure-Activity Relationship Modeling and Experimental Validation of the Imidazolium and Pyridinium Based Ionic Liquids as Potential Antibacterials of MDR Acinetobacter baumannii and Staphylococcus aureus.
- Author
-
Semenyuta, Ivan V., Trush, Maria M., Kovalishyn, Vasyl V., Rogalsky, Sergiy P., Hodyna, Diana M., Karpov, Pavel, Xia, Zhonghua, Tetko, Igor V., and Metelytsia, Larisa O.
- Subjects
- *
STAPHYLOCOCCUS aureus , *ACINETOBACTER baumannii , *STRUCTURE-activity relationships , *IONIC liquids , *MODEL validation , *ANTIBACTERIAL agents - Abstract
Online Chemical Modeling Environment (OCHEM) was used for QSAR analysis of a set of ionic liquids (ILs) tested against multi-drug resistant (MDR) clinical isolate Acinetobacter baumannii and Staphylococcus aureus strains. The predictive accuracy of regression models has coefficient of determination q2 = 0.66 − 0.79 with cross-validation and independent test sets. The models were used to screen a virtual chemical library of ILs, which was designed with targeted activity against MDR Acinetobacter baumannii and Staphylococcus aureus strains. Seven most promising ILs were selected, synthesized, and tested. Three ILs showed high activity against both these MDR clinical isolates. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
25. Design, synthesize and antiurease activity of novel thiazole derivatives: Machine learning, molecular docking and biological investigation.
- Author
-
Mermer, Arif
- Subjects
- *
THIAZOLES , *MOLECULAR docking , *MACHINE learning , *THIAZOLE derivatives , *BINDING sites , *BIOMOLECULES - Abstract
• Antiurease activity of newly designed and synthesized thiazole-2-imine derivatives. • Docking studies were performed for all compounds and interaction modes with enzyme active sites were determined. • Machine learning method was performed to choose the most active compounds. Machine learning is one of the methods used in the design of new molecules with different biological properties and has become a trend in recent years. Since there are many published studies on urease enzyme inhibition, we accumulated a huge literatüre data set and improved a model for antiurease activity. The balanced accuracy of the selected compounds (classification models) were about 78% and the predictive accuracy of them possessed a coefficient of determination q 2 = 0.2-0.7 (regression models) with cross-validation and independent test sets. Thanks to the chemical library created with the machine learning method, a comparison of the predictive and experimental results of the compounds that previously synthesized by us and investigated urease inhibition was made. Compounds observed to be experimentally active were found to be active with the machine learning method. The models are freely avaible online (http://ochem.eu) and can be used to predict potantial antiurease activity of novel compounds. The activity potentials of compounds 4a-d were further evaluated via molecular docking studies with AutoDock4 and AutoDock Vina softwares. Urease enzyme Image, graphical abstract [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
26. Matched molecular pair analysis on large melting point datasets: A big data perspective
- Author
-
Hongming Chen, Igor V. Tetko, and Michael Withnall
- Subjects
Big Data ,Datasets as Topic ,Quantitative Structure-Activity Relationship ,Nanotechnology ,01 natural sciences ,Biochemistry ,Set (abstract data type) ,melting points ,matched molecular pairs ,Molecular descriptor ,OCHEM ,Drug Discovery ,Range (statistics) ,Molecule ,Transition Temperature ,General Solubility Equation ,Matched Molecular Pairs ,Melting Points ,Ochem ,Statistical physics ,General Pharmacology, Toxicology and Pharmaceutics ,Pharmacology ,Molecular Structure ,Full Paper ,010405 organic chemistry ,Chemistry ,Organic Chemistry ,Hydrogen Bonding ,"Marie Sklodowska-Curie Actions" ,general solubility equation ,Full Papers ,Chemical space ,0104 chemical sciences ,010404 medicinal & biomolecular chemistry ,Models, Chemical ,Solubility ,Line (geometry) ,Melting point ,Molecular Medicine ,Matched molecular pair analysis ,Databases, Chemical - Abstract
A matched molecular pair (MMP) analysis was used to examine the change in melting point (MP) between pairs of similar molecules in a set of ∼275k compounds. We found many cases in which the change in MP (ΔMP) of compounds correlates with changes in functional groups. In line with the results of a previous study, correlations between ΔMP and simple molecular descriptors, such as the number of hydrogen bond donors, were identified. In using a larger dataset, covering a wider chemical space and range of melting points, we observed that this method remains stable and scales well with larger datasets. This MMP-based method could find use as a simple privacy-preserving technique to analyze large proprietary databases and share findings between participating research groups.
- Published
- 2017
27. Matched Molecular Pair Analysis on Large Melting Point Datasets: A Big Data Perspective.
- Author
-
Withnall M, Chen H, and Tetko IV
- Subjects
- Databases, Chemical, Hydrogen Bonding, Molecular Structure, Quantitative Structure-Activity Relationship, Solubility, Big Data, Datasets as Topic, Models, Chemical, Transition Temperature
- Abstract
A matched molecular pair (MMP) analysis was used to examine the change in melting point (MP) between pairs of similar molecules in a set of ∼275k compounds. We found many cases in which the change in MP (ΔMP) of compounds correlates with changes in functional groups. In line with the results of a previous study, correlations between ΔMP and simple molecular descriptors, such as the number of hydrogen bond donors, were identified. In using a larger dataset, covering a wider chemical space and range of melting points, we observed that this method remains stable and scales well with larger datasets. This MMP-based method could find use as a simple privacy-preserving technique to analyze large proprietary databases and share findings between participating research groups., (© 2017 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.)
- Published
- 2018
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.