10 results on '"Dmitry Grin"'
Search Results
2. Enhancements in Functionality of the Interactive Visual Explorer for ATLAS Computing Metadata
- Author
-
Maria Grigorieva, Alexei Klimentov, Aleksandr Alekseev, T.P. Galkin, Dmitry Grin, S. Padolski, Tatiana Korchuganova, Mikhail Titov, and Alexey Artamonov
- Subjects
Visual analytics ,Information retrieval ,010308 nuclear & particles physics ,Group method of data handling ,Physics ,QC1-999 ,01 natural sciences ,Visualization ,Metadata ,Interactivity ,Interactive visual analysis ,0103 physical sciences ,010306 general physics ,Cluster analysis ,Level of detail ,Particle Physics - Experiment - Abstract
The development of the Interactive Visual Explorer (InVEx), a visual analytics tool for the computing metadata of the ATLAS experiment at LHC, includes research of various approaches for data handling both on server and client sides. InVEx is implemented as a web-based application which aims at the enhancing of analytical and visualization capabilities of the existing monitoring tools and facilitates the process of data analysis with the interactivity and human supervision. The current work is focused on the architecture enhancements of the InVEx application. First, we will describe the user-manageable data preparation stage for cluster analysis. Then, the Level-of-Detail approach for the interactive visual analysis will be presented. It starts with the low detailing, when all data records are grouped (by clustering algorithms or by categories) and aggregated. We provide users with means to look deeply into this data, incrementally increasing the level of detail. Finally, we demonstrate the development of data storage backend for InVEx, which is adapted for the Level-of-Detail method to keep all stages of data derivation sequence.
- Published
- 2020
3. Clustering error messages produced by distributed computing infrastructure during the processing of high energy physics data
- Author
-
Dmitry Grin and Maria Grigorieva
- Subjects
Physics ,Nuclear and High Energy Physics ,Large Hadron Collider ,Word embedding ,business.industry ,Atlas (topology) ,Data management ,Distributed computing ,020206 networking & telecommunications ,Astronomy and Astrophysics ,Scientific experiment ,02 engineering and technology ,Atomic and Molecular Physics, and Optics ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Word2vec ,Distributed computing infrastructure ,business ,Cluster analysis - Abstract
Large-scale distributed computing infrastructures ensure the operation and maintenance of scientific experiments at the LHC: more than 160 computing centers all over the world execute tens of millions of computing jobs per day. ATLAS — the largest experiment at the LHC — creates an enormous flow of data which has to be recorded and analyzed by a complex heterogeneous and distributed computing environment. Statistically, about 10–12% of computing jobs end with a failure: network faults, service failures, authorization failures, and other error conditions trigger error messages which provide detailed information about the issue, which can be used for diagnosis and proactive fault handling. However, this analysis is complicated by the sheer scale of textual log data, and often exacerbated by the lack of a well-defined structure: human experts have to interpret the detected messages and create parsing rules manually, which is time-consuming and does not allow identifying previously unknown error conditions without further human intervention. This paper is dedicated to the description of a pipeline of methods for the unsupervised clustering of multi-source error messages. The pipeline is data-driven, based on machine learning algorithms, and executed fully automatically, allowing categorizing error messages according to textual patterns and meaning.
- Published
- 2021
- Full Text
- View/download PDF
4. Serovar-level identification of bacterial foodborne pathogens from full-length 16S rRNA gene sequencing
- Author
-
Dmitry Grinevich, Lyndy Harden, Siddhartha Thakur, and Benjamin Callahan
- Subjects
serotyping ,food-borne pathogens ,Salmonella ,Escherichia coli ,16S rRNA ,long-read sequencing ,Microbiology ,QR1-502 - Abstract
ABSTRACTThe resolution of variation within species is critical for interpreting and acting on many microbial measurements. In the key foodborne pathogens Salmonella and Escherichia coli, the primary subspecies classification scheme used is serotyping: differentiating variants within these species by surface antigen profiles. Serotype prediction from whole-genome sequencing (WGS) of isolates is now seen as comparable or preferable to traditional laboratory methods where WGS is available. However, laboratory and WGS methods depend on an isolation step that is time-consuming and incompletely represents the sample when multiple strains are present. Community sequencing approaches that skip the isolation step are, therefore, of interest for pathogen surveillance. Here, we evaluated the viability of amplicon sequencing of the full-length 16S rRNA gene for serotyping Salmonella enterica and E. coli. We developed a novel algorithm for serotype prediction, implemented as an R package (Seroplacer), which takes as input full-length 16S rRNA gene sequences and outputs serovar predictions after phylogenetic placement into a reference phylogeny. We achieved over 89% accuracy in predicting Salmonella serotypes on in silico test data and identified key pathogenic serovars of Salmonella and E. coli in isolate and environmental test samples. Although serotype prediction from 16S rRNA gene sequences is not as accurate as serotype prediction from WGS of isolates, the potential to identify dangerous serovars directly from amplicon sequencing of environmental samples is intriguing for pathogen surveillance. The capabilities developed here are also broadly relevant to other applications where intraspecies variation and direct sequencing from environmental samples could be valuable.IMPORTANCEIn order to prevent and stop outbreaks of foodborne pathogens, it is important that we can detect when pathogenic bacteria are present in a food or food-associated site and identify connections between specific pathogenic bacteria present in different samples. In this work, we develop a new computational technology that allows the important foodborne pathogens Escherichia coli and Salmonella enterica to be serotyped (a subspecies level classification) from sequencing of a single-marker gene, and the 16S rRNA gene often used to surveil bacterial communities. Our results suggest current limitations to serotyping from 16S rRNA gene sequencing alone but set the stage for further progress that we consider likely given the rapid advance in the long-read sequencing technologies and genomic databases our work leverages. If this research direction succeeds, it could enable better detection of foodborne pathogens before they reach the public and speed the resolution of foodborne pathogen outbreaks.
- Published
- 2024
- Full Text
- View/download PDF
5. Experimental strategies to improve drug-target identification in mass spectrometry-based thermal stability assays
- Author
-
Clifford G. Phaneuf, Konstantin Aizikov, Dmitry Grinfeld, Arne Kreutzmann, Daniel Mourad, Oliver Lange, Daniel Dai, Bailin Zhang, Alexei Belenky, Alexander A. Makarov, and Alexander R. Ivanov
- Subjects
Chemistry ,QD1-999 - Abstract
Mass spectrometry-based thermal stability assays (MS-TSA) are a promising approach to characterize protein-ligand interaction, and several strategies have been recently developed to improve their performance. Here, the authors combine three recent strategies to qualitatively and quantitatively improve aspects of MS-TSA.
- Published
- 2023
- Full Text
- View/download PDF
6. Ultra-accurate microbial amplicon sequencing with synthetic long reads
- Author
-
Benjamin J. Callahan, Dmitry Grinevich, Siddhartha Thakur, Michael A. Balamotis, and Tuval Ben Yehezkel
- Subjects
Synthetic long reads ,Amplicon sequencing ,Metagenomics ,Long-read sequencing ,Microbial ecology ,QR100-130 - Abstract
Abstract Background Out of the many pathogenic bacterial species that are known, only a fraction are readily identifiable directly from a complex microbial community using standard next generation DNA sequencing. Long-read sequencing offers the potential to identify a wider range of species and to differentiate between strains within a species, but attaining sufficient accuracy in complex metagenomes remains a challenge. Methods Here, we describe and analytically validate LoopSeq, a commercially available synthetic long-read (SLR) sequencing technology that generates highly accurate long reads from standard short reads. Results LoopSeq reads are sufficiently long and accurate to identify microbial genes and species directly from complex samples. LoopSeq perfectly recovered the full diversity of 16S rRNA genes from known strains in a synthetic microbial community. Full-length LoopSeq reads had a per-base error rate of 0.005%, which exceeds the accuracy reported for other long-read sequencing technologies. 18S-ITS and genomic sequencing of fungal and bacterial isolates confirmed that LoopSeq sequencing maintains that accuracy for reads up to 6 kb in length. LoopSeq full-length 16S rRNA reads could accurately classify organisms down to the species level in rinsate from retail meat samples, and could differentiate strains within species identified by the CDC as potential foodborne pathogens. Conclusions The order-of-magnitude improvement in length and accuracy over standard Illumina amplicon sequencing achieved with LoopSeq enables accurate species-level and strain identification from complex- to low-biomass microbiome samples. The ability to generate accurate and long microbiome sequencing reads using standard short read sequencers will accelerate the building of quality microbial sequence databases and removes a significant hurdle on the path to precision microbial genomics. Video abstract.
- Published
- 2021
- Full Text
- View/download PDF
7. Iterative quantum amplitude estimation
- Author
-
Dmitry Grinko, Julien Gacon, Christa Zoufal, and Stefan Woerner
- Subjects
Physics ,QC1-999 ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
Abstract We introduce a variant of Quantum Amplitude Estimation (QAE), called Iterative QAE (IQAE), which does not rely on Quantum Phase Estimation (QPE) but is only based on Grover’s Algorithm, which reduces the required number of qubits and gates. We provide a rigorous analysis of IQAE and prove that it achieves a quadratic speedup up to a double-logarithmic factor compared to classical Monte Carlo simulation with provably small constant overhead. Furthermore, we show with an empirical study that our algorithm outperforms other known QAE variants without QPE, some even by orders of magnitude, i.e., our algorithm requires significantly fewer samples to achieve the same estimation accuracy and confidence level.
- Published
- 2021
- Full Text
- View/download PDF
8. Modelling the Generalized Multi-objective Vehicle Routing Problem Based on Costs
- Author
-
Viktor Kubil, Vasily Mokhov, and Dmitry Grinchenkov
- Subjects
multi-objective optimization ,mathematical model ,vehicle routing problem ,combinatorial optimization ,graph theory ,Electronic computers. Computer science ,QA75.5-76.95 ,Technology - Abstract
The following article addresses a complex combinatorial optimization and integer-programming problem, referred to as the vehicle routing problem, which is typically related to the field of transportation logistics. The aim of the research is to combine a set of objective functions, number of common generalizations and extensions of the problem, arising in distributed services or goods supply. For this purpose, literature on the subject has been analysed, leading to the mathematical modelling method being applied. At the current moment such complicated variants of the problem present high importance for research because of both practical applications and high complexity. The paper proposes a new generalized multi-objective vehicle routing problem with multiple depots and heterogeneous vehicles fleet with regard to various factors affecting costs. The problem statement is presented as a mixed integer linear program. Objectives scalarization approach is proposed in order to reduce decision-maker participation. Shortcomings of the single-criterion formulation and negative effects of replacing the criteria with constraints are shown. The results provide initial data for solving a large number of transportation problems that are reduced to the vehicle routing problem. In particular, the application of the ant colony optimization as a method for solving the problem is discussed.
- Published
- 2018
- Full Text
- View/download PDF
9. NEW DOSAGE FORM OF L-ARGININE BASED ON ITS COMPLEX WITH CELLULOSE ACETATE SULFATE AND ACTIVATED CARBON
- Author
-
Ekaterina Shakhno, Tatyana Savitskaya, Tatyana Pokrovskaya, Vladimir Yakushev, and Dmitry Grinshpan
- Subjects
L-arginine ,L-NAME ,nitric oxide deficiency ,water ,Therapeutics. Pharmacology ,RM1-950 - Abstract
Introduction: The oral route of L-arginine intake, despite its great comfort, is less effective in comparison with infusions. The development of new pharmaceutical forms for oral preparation is of undoubted interest. Objectives: To obtain a new dosage form of L-arginine by complexation with sodium salt of cellulose sulfate acetate (Na-CSA) followed by immobilization on activated carbon, and to evaluate its endothelial, cardioprotective and antihypertensive activity. Methods: UV-, visible spectroscopy, FTIR spectroscopy, viscometry, molecular mechanics, animal experiments with male Wistar albino rats in the modeling of L-NAME-induced endothelium dysfunction. Results: The physicochemical parameters of complexation, the adsorption activity of activated carbons of various origins with respect to individual L-arginine and its complex with Na-CSA were investigated. The greatest amount of adsorption and its smaller difference between L-arginine and the complex were taken into account when choosing activated carbon as a carrier (AUT-MI: GL-Arginine=320.2±0.1 mg/g, Gcomplex=340.2±0.1 mg/g; OU-A: GL-Arginine=210.1±0.1 mg/g, Gcomplex=235.1±0.1 mg/g; TH-90G: GL-Arginine=190.1±0.1 mg/g, Gcomplex=190.2±0.1 mg/g). Release of L-arginine as a result of its desorption into the model media of the human body was higher (96.3±0.5 wt.%) for the alkaline medium of intestines than for the acidic medium of stomach (10.0±0.1 wt.%). In animal experiments, it was shown that a combined preparation of L-arginine complex immobilized at doses of 30 mg/kg, 70 mg/kg and 200 mg/kg exhibits a pronounced antihypertensive, endothelioprotecive and cardioprotective activity at a dose of 200 mg/kg with L-NAME-induced nitric oxide deficiency (p
- Published
- 2017
- Full Text
- View/download PDF
10. USE OF L-ARGININE IMMOBILISED ON ACTIVATED CARBON FOR PHARMACOLOGICAL CORRECTION OF ENDOTHELIAL DISFUNCTION
- Author
-
Ekaterina Shakhno, Tatyana Savitskaya, Tatyana Pokrovskaya, Vladimir Yakushev, Mikhail Pokrovskii, and Dmitry Grinshpan
- Subjects
L-arginine ,L-NAME ,water-soluble cellulose deriva ,Therapeutics. Pharmacology ,RM1-950 - Abstract
For the first time a complex of L-arginine sodium salt of sulfate of cellulose acetate on activated carbon. To investigate the processes of sorption-desorption of L-arginine in a model environment. In animal experiments it was shown that granulirovannye the form of a combined preparation in dosages of 30 mg/kg, 70 mg/kg and 200 mg/kg exhibits a pronounced anti-hypertensive, endothelioprotecive and cardioprotective activity in a dosage of 200 mg/kg.
- Published
- 2016
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.