5 results on '"Bahk K"'
Search Results
2. SigAlign: an alignment algorithm guided by explicit similarity criteria.
- Author
-
Bahk K and Sung J
- Subjects
- Humans, Computational Biology methods, Sequence Analysis, DNA methods, Algorithms, Sequence Alignment methods, Software
- Abstract
In biological sequence alignment, prevailing heuristic aligners achieve high-throughput by several approximation techniques, but at the cost of sacrificing the clarity of output criteria and creating complex parameter spaces. To surmount these challenges, we introduce 'SigAlign', a novel alignment algorithm that employs two explicit cutoffs for the results: minimum length and maximum penalty per length, alongside three affine gap penalties. Comparative analyses of SigAlign against leading database search tools (BLASTn, MMseqs2) and read mappers (BWA-MEM, bowtie2, HISAT2, minimap2) highlight its performance in read mapping and database searches. Our research demonstrates that SigAlign not only provides high sensitivity with a non-heuristic approach, but also surpasses the throughput of existing heuristic aligners, particularly for high-accuracy reads or genomes with few repetitive regions. As an open-source library, SigAlign is poised to become a foundational component to provide a transparent and customizable alignment process to new analytical algorithms, tools and pipelines in bioinformatics., (© The Author(s) 2024. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2024
- Full Text
- View/download PDF
3. Pan-lineage Mycobacterium tuberculosis reference genome for enhanced molecular diagnosis.
- Author
-
Bahk K, Sung J, Seki M, Kim K, Kim J, Choi H, Whang J, and Mitarai S
- Subjects
- Whole Genome Sequencing methods, Humans, Microbial Sensitivity Tests, Tuberculosis microbiology, Tuberculosis diagnosis, Mycobacterium tuberculosis genetics, Genome, Bacterial
- Abstract
In Mycobacterium tuberculosis (MTB) control, whole genome sequencing-based molecular drug susceptibility testing (molDST-WGS) has emerged as a pivotal tool. However, the current reliance on a single-strain reference limits molDST-WGS's true potential. To address this, we introduce a new pan-lineage reference genome, 'MtbRf'. We assembled 'unmapped' reads from 3,614 MTB genomes (751 L1; 881 L2; 1,700 L3; and 282 L4) into 35 shared, annotated contigs (54 coding sequences [CDSs]). We constructed MtbRf through: (1) searching for contig homologues among genome database that precipitate results uniquely within Mycobacteria genus; (2) comparing genomes with H37Rv ('lift-over') to define 18 insertions; and (3) filling gaps in H37Rv with insertions. MtbRf adds 1.18% sequences to H37rv, salvaging >60% of previously unmapped reads. Transcriptomics confirmed gene expression of new CDSs. The new variants provided a moderate DST predictive value (AUROC 0.60-0.75). MtbRf thus unveils previously hidden genomic information and lays the foundation for lineage-specific molDST-WGS., (© The Author(s) 2024. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.)
- Published
- 2024
- Full Text
- View/download PDF
4. A High Quality Asian Genome Assembly Identifies Features of Common Missing Regions.
- Author
-
Kim J, Sung J, Han K, Lee W, Mun S, Lee J, Bahk K, Yang I, Bae YK, Kim C, Kim JI, and Seo JS
- Subjects
- High-Throughput Nucleotide Sequencing, Humans, Polymerase Chain Reaction, Repetitive Sequences, Nucleic Acid, Asian People genetics, Genome, Human
- Abstract
The current human reference genome (GRCh38), with its superior quality, has contributed significantly to genome analysis. However, GRCh38 may still underrepresent the ethnic genome, specifically for Asians, though exactly what we are missing is still elusive. Here, we juxtaposed GRCh38 with a high-contiguity genome assembly of one Korean (AK1) to show that a part of AK1 genome is missing in GRCh38 and that the missing regions harbored ~1390 putative coding elements. Furthermore, we found that multiple populations shared some certain parts in the missing genome when we analyzed the "unmapped" (to GRCh38) reads of fourteen individuals (five East-Asians, four Europeans, and five Africans), amounting to ~5.3 Mb (~0.2% of AK1) of the total genomic regions. The recovered AK1 regions from the "unmapped reads", which were the estimated missing regions that did not exist in GRCh38, harbored candidate coding elements. We verified that most of the common (shared by ≥7 individuals) missing regions exist in human and chimpanzee DNA. Moreover, we further identified the occurrence mechanism and ethnic heterogeneity as well as the presence of the common missing regions. This study illuminates a potential advantage of using a pangenome reference and brings up the need for further investigations on the various features of regions globally missed in GRCh38.
- Published
- 2020
- Full Text
- View/download PDF
5. Quantitative 19F NMR study of trifluorothymidine metabolism in rat brain.
- Author
-
Pouremad R, Bahk KD, Shen YJ, Knop RH, and Wyrwicz AM
- Subjects
- Animals, Antimetabolites pharmacokinetics, Biological Transport, Blood-Brain Barrier, Brain enzymology, Liver enzymology, Liver metabolism, Magnetic Resonance Spectroscopy, Pyrimidines metabolism, Rats, Rats, Sprague-Dawley, Thymidine Phosphorylase metabolism, Trifluridine pharmacokinetics, Antimetabolites metabolism, Brain metabolism, Trifluridine metabolism
- Abstract
Metabolism of trifluorothymidine (TFT) and its transport across the blood-brain barrier (BBB) has been measured quantitatively in rats by fluorine-19 nuclear magnetic resonance spectroscopy ((19)F NMR). It is demonstrated that TFT crosses the BBB in micromolar quantities and is metabolized in brain tissue primarily to its free base trifluoromethyluracil (TFMU) by the enzyme thymidine phosphorylase (TP). It is further proposed that the rate of TFMU production can be used as a measure of cerebral TP. The glycols of both TFMU, and to a lesser degree TFT, are generated via an oxidative route. In contrast, the major pathway for hepatic metabolism of this compound is through reduction of the nitrogen base moiety and generation of 5-6-dihydro species followed by ring degradation. Thus, in addition to TFMU as well as the dihydroxy (glycol)-, and the dihydro-species of both TFT and TFMU, alpha-trifluoromethyl-beta-ureidopropionic acid (F(3)MUPA) and alpha-trifluoromethyl-beta-alanine (F(3)MBA) were detected in liver extracts. The total metabolite levels in liver were 2-5 times higher than in the brain. Low levels of fluoride ion were detected in all the extracts from brain and liver, as well as blood and urine. This study characterizes TFT as a potential chemotherapeutic agent for use against brain tumors., (Copyright 1999 John Wiley & Sons, Ltd.)
- Published
- 1999
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.