1. The Thai reference exome (T‐REx) variant database
- Author
-
Chalurmpon Srichomthong, Keswadee Lapphra, Surakameth Mahasirimongkol, Thantrira Porntaveetus, Sissades Tongsima, Vorasuk Shotelersuk, Wanna Chetruengchai, Wichittra Tassaneeyakul, Sujiraporn Pakchuen, Kanya Suphapeetiporn, Verayuth Praphanphoj, Pattarapong Makarawate, Rujipat Wasitthankasem, Adjima Assawapitaksakul, Athiphat Khuninthong, Duangdao Wichadakul, Nusara Satproedprai, Pongsakorn Wangkumhang, Prapaporn Pisitkun, Vorthunju Nakhonsri, Piranit Nik Kantaputra, Alisa Wilantho, Chureerat Phokaew, Philip J. Shaw, Jittima Piriyapongsa, and Chumpol Ngamphiw
- Subjects
DNA Copy Number Variations ,Population genetics ,Genomics ,Biology ,computer.software_genre ,Southeast asian ,Polymorphism, Single Nucleotide ,Genomic Medicine ,Databases, Genetic ,Exome Sequencing ,Genetics ,Humans ,Exome ,Genetic Predisposition to Disease ,Copy-number variation ,Genetic Association Studies ,Genetics (clinical) ,Exome sequencing ,Autosome ,Database ,Computational Biology ,Genetic Variation ,Chromosome ,Molecular Sequence Annotation ,Thailand ,Genetics, Population ,computer - Abstract
To maximize the potential of genomics in medicine, it is essential to establish databases of genomic variants for ethno-geographic groups that can be used for filtering and prioritizing candidate pathogenic variants. Populations with non-European ancestry are poorly represented among current genomic variant databases. Here, we report the first high-density survey of genomic variants for the Thai population, the Thai Reference Exome (T-REx) variant database. T-REx comprises exome sequencing data of 1092 unrelated Thai individuals. The targeted exome regions common among four capture platforms cover 30.04 Mbp on autosomes and chromosome X. 345 681 short variants (18.27% of which are novel) and 34 907 copy number variations were found. Principal component analysis on 38 469 single nucleotide variants present worldwide showed that the Thai population is most genetically similar to East and Southeast Asian populations. Moreover, unsupervised clustering revealed six Thai subpopulations consistent with the evidence of gene flow from neighboring populations. The prevalence of common pathogenic variants in T-REx was investigated in detail, which revealed subpopulation-specific patterns, in particular variants associated with erythrocyte disorders such as the HbE variant in HBB and the Viangchan variant in G6PD. T-REx serves as a pivotal addition to the current databases for genomic medicine.
- Published
- 2021
- Full Text
- View/download PDF