14 results on '"Matthew T. Hickenbotham"'
Search Results
2. Extending assembly of short DNA sequences to handle error.
- Author
-
William R. Jeck, Josephine A. Reinhardt, David A. Baltrus, Matthew T. Hickenbotham, Vincent J. Magrini, Elaine R. Mardis, Jeffery L. Dangl, and Corbin D. Jones
- Published
- 2007
- Full Text
- View/download PDF
3. Flipping NextGen: using biological systems to characterize NextGen sequencing technologies.
- Author
-
Jarret Glasscock, Ryan Richt, and Matthew T. Hickenbotham
- Published
- 2009
4. DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome
- Author
-
Bob Fulton, Scott Abbott, Lisa Cook, Tracie L. Miner, Yu Zhao, Jacqueline E. Payton, David E. Larson, Rhonda E. Ries, Mark A. Watson, Scott M. Smith, Vincent Magrini, John F. DiPersio, Peter Westervelt, Brian H. Dunford-Shore, Sean McGrath, Jennifer Ivanovich, Michael D. McLellan, Matthew J. Walter, Nathan Sander, Timothy J. Ley, Craig Pohl, Matthew T. Hickenbotham, Jarret Glasscock, Daniel C. Koboldt, Amy Hawkins, Devin P. Locke, Rakesh Nagarajan, David Gordon, Li Ding, William D. Shannon, Elaine R. Mardis, Michael H. Tomasson, LaDeana W. Hillier, Ken Chen, Daniel C. Link, Richard K. Wilson, Patrick Minx, John R. Osborne, Sharon Heath, Rachel Abbott, Lucinda Fulton, Joshua J. Conyers, Asif T. Chinwalla, David J. Dooling, Timothy A. Graubert, Todd Wylie, Jack Baty, and Xiaoqi Shi
- Subjects
Myeloid ,Genomics ,Biology ,medicine.disease_cause ,Polymorphism, Single Nucleotide ,Genome ,Article ,DNA sequencing ,Recurrence ,medicine ,Humans ,Sequence Deletion ,Skin ,Genetics ,Multidisciplinary ,Genome, Human ,Gene Expression Profiling ,Sequence Analysis, DNA ,Gene Expression Regulation, Neoplastic ,Gene expression profiling ,Leukemia, Myeloid, Acute ,Mutagenesis, Insertional ,Haematopoiesis ,medicine.anatomical_structure ,Case-Control Studies ,Mutation ,Disease Progression ,Human genome ,Carcinogenesis - Abstract
Lay Summary Acute myeloid leukemia is a highly malignant hematopoietic tumor that affects about 13,000 adults yearly in the United States. The treatment of this disease has changed little in the past two decades, since most of the genetic events that initiate the disease remain undiscovered. Whole genome sequencing is now possible at a reasonable cost and timeframe to utilize this approach for unbiased discovery of tumor-specific somatic mutations that alter the protein-coding genes. Here we show the results obtained by sequencing a typical acute myeloid leukemia genome and its matched normal counterpart, obtained from the patient’s skin. We discovered 10 genes with acquired mutations; two were previously described mutations thought to contribute to tumor progression, and 8 were novel mutations present in virtually all tumor cells at presentation and relapse, whose function is not yet known. Our study establishes whole genome sequencing as an unbiased method for discovering initiating mutations in cancer genomes, and for identifying novel genes that may respond to targeted therapies. We used massively parallel sequencing technology to sequence the genomic DNA of tumor and normal skin cells obtained from a patient with a typical presentation of FAB M1 Acute Myeloid Leukemia (AML) with normal cytogenetics. 32.7-fold ‘haploid’ coverage (98 billion bases) was obtained for the tumor genome, and 13.9-fold coverage (41.8 billion bases) was obtained for the normal sample. Of 2,647,695 well-supported Single Nucleotide Variants (SNVs) found in the tumor genome, 2,588,486 (97.7%) also were detected in the patient’s skin genome, limiting the number of variants that required further study. For the purposes of this initial study, we restricted our downstream analysis to the coding sequences of annotated genes: we found only eight heterozygous, non-synonymous somatic SNVs in the entire genome. All were novel, including mutations in protocadherin/cadherin family members (CDH24 and PCLKC), G-protein coupled receptors (GPR123 and EBI2), a protein phosphatase (PTPRT), a potential guanine nucleotide exchange factor (KNDC1), a peptide/drug transporter (SLC15A1), and a glutamate receptor gene (GRINL1B). We also detected previously described, recurrent somatic insertions in the FLT3 and NPM1 genes. Based on deep readcount data, we determined that all of these mutations (except FLT3) were present in nearly all tumor cells at presentation, and again at relapse 11 months later, suggesting that the patient had a single dominant clone containing all of the mutations. These results demonstrate the power of whole genome sequencing to discover novel cancer-associated mutations.
- Published
- 2008
- Full Text
- View/download PDF
5. The DNA sequence of human chromosome 7
- Author
-
Robert Baertsch, Karen A. Phelps, Lisa Cook, Joelle Kalicki, Michelle O'Laughlin, Kerry L. Bubb, David Torrents, Kristine M. Wylie, Andrew Vanbrunt, Mark E. Schaller, Dan Layman, Kelsi Scott, LaDeana W. Hillier, Marco A. Marra, Caryn Wagner-McPherson, Cindy Strong, Phil Latreille, Hui Sun, Maynard V. Olson, Holland Bradshaw-Cordum, Amanda Abbott, Robert S. Fulton, Nicolas Berkowicz, Richard Harkins, Asif T. Chinwalla, Rajinder Kaul, William E. Nash, Chad Tomlinson, Susan M. Rock, Patricia Wohldmann, Paul Flicek, Elaine R. Mardis, Catrina Strowmatt, James M. Eldred, Betty Lamar, Christopher K. Raymond, Michael C. Wendl, Lauren Bielicki, Shawn Leonard, John Douglas Mcpherson, Christine Nguyen, Jennifer Murray, Michael C. Becker, Lucinda Fulton, Amber Isak, Will Gillett, Matt Cordes, James B. Clendenning, Kymberlie H. Pepin, Mandeep Sekhon, Eric Haugen, Feiyu Du, Theresa Rohlfing, Kimberly D. Delehaunty, Nancy Miller, Amy Kozlowicz-Reilly, Eric D. Green, W. James Kent, Tamberlyn Bieri, Peer Bork, Richard K. Wilson, Patrick Minx, John Spieth, Evan E. Eichler, Shawn Iadanoto, Terrence S. Furey, Matthew E. Portnoy, Shunfang Hou, R. James, Warren Gish, Brian Schultz, Doug Johnson, Philip Ozersky, Jennifer Edwards, Stephanie L. Chissoe, Jeffrey A. Bailey, Tracie L. Miner, Jason Maas, Andrea Holmes, Sandra W. Clifton, Sara Jaeger, Tina Graves, Ruth Levy, Joseph A. Bedell, Ginger A. Fewell, Mikita Suyama, Shiaw-Pyng Yang, Sean R. Eddy, Rebecca S. Walker, Aye-Mon Tin-Wollam, Hui Du, Evan Keibler, Matthew T. Hickenbotham, Sara Dauphin-Kohlberg, Robert H. Waterston, Yang Zhou, Stephanie Andrews, Johar Ali, John W. Wallis, Michael R. Brent, Rachel Maupin, Donald Williams, Elizabeth Simms, Laura Courtney, Anthony R. Harris, Jeffrey Woessner, and Joanne O. Nelson
- Subjects
Williams Syndrome ,Chromosome 7 (human) ,Genetics ,RNA, Untranslated ,Multidisciplinary ,Base Sequence ,Sequence analysis ,Pseudogene ,Molecular Sequence Data ,Proteins ,Sequence Analysis, DNA ,Genome project ,Biology ,Physical Chromosome Mapping ,Conserved sequence ,Sequence-tagged site ,Mice ,Sequence logo ,Species Specificity ,Gene Duplication ,Consensus sequence ,Animals ,Humans ,Chromosomes, Human, Pair 7 ,Pseudogenes - Abstract
Human chromosome 7 has historically received prominent attention in the human genetics community, primarily related to the search for the cystic fibrosis gene and the frequent cytogenetic changes associated with various forms of cancer. Here we present more than 153 million base pairs representing 99.4% of the euchromatic sequence of chromosome 7, the first metacentric chromosome completed so far. The sequence has excellent concordance with previously established physical and genetic maps, and it exhibits an unusual amount of segmentally duplicated sequence (8.2%), with marked differences between the two arms. Our initial analyses have identified 1,150 protein-coding genes, 605 of which have been confirmed by complementary DNA sequences, and an additional 941 pseudogenes. Of genes confirmed by transcript sequences, some are polymorphic for mutations that disrupt the reading frame.
- Published
- 2003
- Full Text
- View/download PDF
6. Next-generation transcriptome sequencing of the premenopausal breast epithelium using specimens from a normal human breast tissue bank
- Author
-
Theresa Mathieson, Bradley A. Hancock, Jarret Glasscock, Yunlong Liu, Matthew T. Hickenbotham, Rebecca W. Doerge, Diane K. Doxey, Candice A.M. Sauder, Milan Radovich, Rachel J. Blosser, Sunil Badve, Faye Zheng, Jin Zhu, Ivanesa Pardo, Mi Ran Choi, Anna Maria Storniolo, Heather A. Lillemoe, Dadrie Baptiste, Rutuja Atale, and Susan E. Clare
- Subjects
Adult ,Aging ,medicine.medical_specialty ,1.1 Normal biological development and functioning ,media_common.quotation_subject ,Mammary gland ,Oncology and Carcinogenesis ,Physiology ,Tissue Banks ,Luteal phase ,Biology ,Luteal Phase ,Epithelium ,Transcriptome ,Breast cancer ,Clinical Research ,Underpinning research ,Internal medicine ,Breast Cancer ,Follicular phase ,Genetics ,medicine ,Humans ,Gene Regulatory Networks ,Breast ,Oncology & Carcinogenesis ,Susan G. Komen for the Cure Tissue Bank at the IU Simon Cancer Center ,Menstrual cycle ,Cancer ,media_common ,Medicine(all) ,Reverse Transcriptase Polymerase Chain Reaction ,Contraception/Reproduction ,Prevention ,Human Genome ,High-Throughput Nucleotide Sequencing ,Middle Aged ,medicine.disease ,Endocrinology ,medicine.anatomical_structure ,Follicular Phase ,Premenopause ,Linear Models ,Female ,Algorithms - Abstract
Introduction Our efforts to prevent and treat breast cancer are significantly impeded by a lack of knowledge of the biology and developmental genetics of the normal mammary gland. In order to provide the specimens that will facilitate such an understanding, The Susan G. Komen for the Cure Tissue Bank at the IU Simon Cancer Center (KTB) was established. The KTB is, to our knowledge, the only biorepository in the world prospectively established to collect normal, healthy breast tissue from volunteer donors. As a first initiative toward a molecular understanding of the biology and developmental genetics of the normal mammary gland, the effect of the menstrual cycle and hormonal contraceptives on DNA expression in the normal breast epithelium was examined. Methods Using normal breast tissue from 20 premenopausal donors to KTB, the changes in the mRNA of the normal breast epithelium as a function of phase of the menstrual cycle and hormonal contraception were assayed using next-generation whole transcriptome sequencing (RNA-Seq). Results In total, 255 genes representing 1.4% of all genes were deemed to have statistically significant differential expression between the two phases of the menstrual cycle. The overwhelming majority (221; 87%) of the genes have higher expression during the luteal phase. These data provide important insights into the processes occurring during each phase of the menstrual cycle. There was only a single gene significantly differentially expressed when comparing the epithelium of women using hormonal contraception to those in the luteal phase. Conclusions We have taken advantage of a unique research resource, the KTB, to complete the first-ever next-generation transcriptome sequencing of the epithelial compartment of 20 normal human breast specimens. This work has produced a comprehensive catalog of the differences in the expression of protein-coding genes as a function of the phase of the menstrual cycle. These data constitute the beginning of a reference data set of the normal mammary gland, which can be consulted for comparison with data developed from malignant specimens, or to mine the effects of the hormonal flux that occurs during the menstrual cycle.
- Published
- 2014
7. Characterizing the heterogeneity of triple-negative breast cancers using microdissected normal ductal epithelium and RNA-sequencing
- Author
-
Mircea Ivan, Bradley A. Hancock, Onur Sakarya, Eric E. Hilligoss, Fiona Hyland, Connie Rufenbarger, Rutuja Atale, Anna Maria Storniolo, Ivanesa Pardo, Jill E. Henry, Susan E. Clare, Candice A.M. Sauder, Jin Zhu, Nawal Kassem, Matthew T. Hickenbotham, Heather A. Lillemoe, Milan Radovich, Diane K. Doxey, Yunlong Liu, Bryan P. Schneider, Jeffrey P. Solzak, Sunil Badve, George W. Sledge, Theresa Mathieson, Jarret Glasscock, Rachel J. Blosser, and Mi Ran Choi
- Subjects
Cancer Research ,Pathology ,Transcription, Genetic ,Gene regulatory network ,Triple Negative Breast Neoplasms ,Transcriptome ,2.1 Biological and endogenous factors ,Cluster Analysis ,Gene Regulatory Networks ,Aetiology ,Microdissection ,Triple-negative breast cancer ,Cancer ,Ductal epithelium ,screening and diagnosis ,Forkhead Transcription Factors ,Mammary Glands ,Gene Expression Regulation, Neoplastic ,Detection ,medicine.anatomical_structure ,Oncology ,Female ,Sequence Analysis ,Transcription ,Adjacent normal ,Human ,medicine.medical_specialty ,Clinical Sciences ,Oncology and Carcinogenesis ,Biology ,Article ,Breast cancer ,Genetic ,Clinical Research ,Breast Cancer ,medicine ,Genetics ,Humans ,Oncology & Carcinogenesis ,Mammary Glands, Human ,Neoplastic ,Sequence Analysis, RNA ,Gene Expression Profiling ,Human Genome ,Forkhead Box Protein M1 ,TCGA ,medicine.disease ,Epithelium ,Normal breast ,4.1 Discovery and preclinical testing of markers and technologies ,Gene expression profiling ,Good Health and Well Being ,Gene Expression Regulation ,Case-Control Studies ,Mutation ,Cancer research ,RNA ,RNA-seq - Abstract
Triple-negative breast cancers (TNBCs) are a heterogeneous set of tumors defined by an absence of actionable therapeutic targets (ER, PR, and HER-2). Microdissected normal ductal epithelium from healthy volunteers represents a novel comparator to reveal insights into TNBC heterogeneity and to inform drug development. Using RNA-sequencing data from our institution and The Cancer Genome Atlas (TCGA) we compared the transcriptomes of 94 TNBCs, 20 microdissected normal breast tissues from healthy volunteers from the Susan G. Komen for the Cure Tissue Bank, and 10 histologically normal tissues adjacent to tumor. Pathway analysis comparing TNBCs to optimized normal controls of microdissected normal epithelium versus classic controls composed of adjacent normal tissue revealed distinct molecular signatures. Differential gene expression of TNBC compared with normal comparators demonstrated important findings for TNBC-specific clinical trials testing targeted agents; lack of over-expression for negative studies and over-expression in studies with drug activity. Next, by comparing each individual TNBC to the set of microdissected normals, we demonstrate that TNBC heterogeneity is attributable to transcriptional chaos, is associated with non-silent DNA mutational load, and explains transcriptional heterogeneity in addition to known molecular subtypes. Finally, chaos analysis identified 146 core genes dysregulated in >90% of TNBCs revealing an over-expressed central network. In conclusion, use of microdissected normal ductal epithelium from healthy volunteers enables an optimized approach for studying TNBC and uncovers biological heterogeneity mediated by transcriptional chaos.
- Published
- 2013
8. The value of avian genomics to the conservation of wildlife
- Author
-
Leona G. Chemnick, Tanya Renner, Matthew T. Hickenbotham, William S. Modi, Christie A. Otten, Sean McGrath, Marisa L. Korody, Jarret Glasscock, Sugandha Dandekar, Yang Da, Emily M Stremel Mork, Eric D. Green, Oliver A. Ryder, Marlys L. Houck, Vincent Magrini, Michael N Romanov, Elaine R. Mardis, Kenneth C. Jones, Elaina M. Tuttle, and Jeanette C. Papp
- Subjects
0106 biological sciences ,Chromosomes, Artificial, Bacterial ,Conservation of Natural Resources ,Genetic Linkage ,Population ,Genomics ,010603 evolutionary biology ,01 natural sciences ,Genome ,Evolution, Molecular ,03 medical and health sciences ,biology.animal ,Genetic variation ,Genetics ,Animals ,education ,QH426 ,030304 developmental biology ,Gene Library ,0303 health sciences ,education.field_of_study ,Sparrow ,biology ,Raptors ,Genetic Variation ,Sequence Analysis, DNA ,15. Life on land ,Genetics, Population ,Proceedings ,Genetic marker ,Evolutionary biology ,Karyotyping ,Microsatellite ,Identification (biology) ,Female ,Sparrows ,Biotechnology ,Microsatellite Repeats - Abstract
Background: Genomic studies in non-domestic avian models, such as the California condor and white-throated sparrow, can lead to more comprehensive conservation plans and provide clues for understanding mechanisms affecting genetic variation, adaptation and evolution. \ud \ud Developing genomic tools and resources including genomic libraries and a genetic map of the California condor is a prerequisite for identification of candidate loci for a heritable embryonic lethal condition. The white-throated sparrow exhibits a stable genetic polymorphism (i.e. chromosomal rearrangements) associated with variation in morphology, physiology, and behavior (e.g., aggression, social behavior, sexual behavior, parental care). In this paper we outline the utility of these species as well as report on recent advances in the study of their genomes. \ud \ud Results: Genotyping of the condor resource population at 17 microsatellite loci provided a better assessment of the current population's genetic variation. Specific New World vulture repeats were found in the condor genome. Using condor BAC library and clones, chicken-condor comparative maps were generated. A condor fibroblast cell line transcriptome was characterized using the 454 sequencing technology. \ud \ud Our karyotypic analyses of the sparrow in combination with other studies indicate that the rearrangements in both chromosomes 2(m) and 3(a) are complex and likely involve multiple inversions, interchromosomal linkage, and pleiotropy. At least a portion of the rearrangement in chromosome 2(m) existed in the common ancestor of the four North American species of Zonotrichia, but not in the one South American species, and that the 2(m) form, originally thought to be the derived condition, might actually be the ancestral one. \ud \ud Conclusion: Mining and characterization of candidate loci in the California condor using molecular genetic and genomic techniques as well as linkage and comparative genomic mapping will eventually enable the identification of carriers of the chondrodystrophy allele, resulting in improved genetic management of this disease. \ud \ud In the white-throated sparrow, genomic studies, combined with ecological data, will help elucidate the basis of genic selection in a natural population. Morphs of the sparrow provide us with a unique opportunity to study intraspecific genomic differences, which have resulted from two separate yet linked evolutionary trajectories. Such results can transform our understanding of evolutionary and conservation biology.
- Published
- 2009
9. Whole-genome sequencing and variant discovery in C. elegans
- Author
-
Jarret Glasscock, Richard K. Wilson, Weichun Huang, Michael Strömberg, Paul Fox, Sacha N. Sander, Tim Schedl, Todd Wylie, Ryan Richt, Eric F. Tsung, Derek Barnett, LaDeana W. Hillier, Ginger A. Fewell, Donald A. Stewart, David J. Dooling, Gabor T. Marth, Aaron R. Quinlan, Matthew T. Hickenbotham, Elaine R. Mardis, and Vincent Magrini
- Subjects
Whole genome sequencing ,Genetics ,Massive parallel sequencing ,Base Sequence ,DNA Mutational Analysis ,Molecular Sequence Data ,Sequence assembly ,Chromosome Mapping ,Genetic Variation ,Hybrid genome assembly ,Cell Biology ,Sequence Analysis, DNA ,Biology ,Biochemistry ,Genome ,Polymorphism, Single Nucleotide ,Deep sequencing ,DNA sequencing ,Animals ,Caenorhabditis elegans ,Molecular Biology ,Biotechnology ,Reference genome - Abstract
Massively parallel sequencing instruments enable rapid and inexpensive DNA sequence data production. Because these instruments are new, their data require characterization with respect to accuracy and utility. To address this, we sequenced a Caernohabditis elegans N2 Bristol strain isolate using the Solexa Sequence Analyzer, and compared the reads to the reference genome to characterize the data and to evaluate coverage and representation. Massively parallel sequencing facilitates strain-to-reference comparison for genome-wide sequence variant discovery. Owing to the short-read-length sequences produced, we developed a revised approach to determine the regions of the genome to which short reads could be uniquely mapped. We then aligned Solexa reads from C. elegans strain CB4858 to the reference, and screened for single-nucleotide polymorphisms (SNPs) and small indels. This study demonstrates the utility of massively parallel short read sequencing for whole genome resequencing and for accurate discovery of genome-wide polymorphisms.
- Published
- 2007
10. Analysis of the prostate cancer cell line LNCaP transcriptome using a sequencing-by-synthesis approach
- Author
-
Anne Go, Allen Delaney, Matthew T. Hickenbotham, Asim Siddiqui, Marco A. Marra, René L. Warren, Tammy L Romanuik, Marianne D. Sadar, Martin Hirst, Matthew N. Bainbridge, Thomas Zeng, Steven J.M. Jones, Elaine R. Mardis, Vincent Magrini, and Malachi Griffith
- Subjects
Male ,DNA, Complementary ,Neoplasms, Hormone-Dependent ,Transcription, Genetic ,lcsh:QH426-470 ,lcsh:Biotechnology ,Biology ,Adenocarcinoma ,Polymorphism, Single Nucleotide ,Transcriptome ,03 medical and health sciences ,0302 clinical medicine ,Cell Line, Tumor ,Sequence Homology, Nucleic Acid ,lcsh:TP248.13-248.65 ,LNCaP ,Genetics ,Chromosomes, Human ,Humans ,RNA, Messenger ,RNA, Neoplasm ,Gene ,030304 developmental biology ,Expressed Sequence Tags ,0303 health sciences ,Expressed sequence tag ,Massive parallel sequencing ,Methodology Article ,Gene Expression Profiling ,Alternative splicing ,Chromosome Mapping ,Prostatic Neoplasms ,Exons ,Sequence Analysis, DNA ,Metribolone ,Gene expression profiling ,Gene Expression Regulation, Neoplastic ,Alternative Splicing ,lcsh:Genetics ,030220 oncology & carcinogenesis ,Androgens ,DNA microarray ,Sequence Alignment ,Biotechnology - Abstract
Background High throughput sequencing-by-synthesis is an emerging technology that allows the rapid production of millions of bases of data. Although the sequence reads are short, they can readily be used for re-sequencing. By re-sequencing the mRNA products of a cell, one may rapidly discover polymorphisms and splice variants particular to that cell. Results We present the utility of massively parallel sequencing by synthesis for profiling the transcriptome of a human prostate cancer cell-line, LNCaP, that has been treated with the synthetic androgen, R1881. Through the generation of approximately 20 megabases (MB) of EST data, we detect transcription from over 10,000 gene loci, 25 previously undescribed alternative splicing events involving known exons, and over 1,500 high quality single nucleotide discrepancies with the reference human sequence. Further, we map nearly 10,000 ESTs to positions on the genome where no transcription is currently predicted to occur. We also characterize various obstacles with using sequencing by synthesis for transcriptome analysis and propose solutions to these problems. Conclusion The use of high-throughput sequencing-by-synthesis methods for transcript profiling allows the specific and sensitive detection of many of a cell's transcripts, and also allows the discovery of high quality base discrepancies, and alternative splice variants. Thus, this technology may provide an effective means of understanding various disease states, discovering novel targets for disease treatment, and discovery of novel transcripts.
- Published
- 2006
11. Generation and annotation of the DNA sequences of human chromosomes 2 and 4
- Author
-
Xinwei She, Sara Dauphin-Kohlberg, Robert H. Waterston, Patricia Wohldmann, Joelle Kalicki, Ivan Ovcharenko, Shawn Leonard, Scott Kruchowski, Ernest Goyea, William Haakenson, Scott Abbott, Catrina Fronick, Aniko Sabo, Maria Cedroni, Edward A. Belter, Hui Du, Asif T. Chinwalla, Kelly Mead, Michael C. Wendl, Rachel Maupin, Amber Isak, David R. Cox, Tamberlyn Bieri, LaDeana W. Hillier, Lee Trani, Neenu Grewal, Aye Mon Tin-Wollam, Rick Meyer, Lachlan G. Oddy, Lucinda Fulton, Li Ding, Richard M. Myers, Neha Shah, Colin Kremitzki, Tracie L. Miner, Holland Bradshaw-Cordum, Tina Graves, Glendoria Elliott, Matthew T. Hickenbotham, Wesley C. Warren, Mandeep Sekhon, Anu Desai, Jason Carter, Thomas Erb, Webb Miller, Prashant R. Sinha, Richard K. Wilson, Krista Haglund, John Spieth, Craig Pohl, Lisa Cook, John Douglas Mcpherson, Patrick Minx, Susan M. Rock, Ginger A. Fewell, Thomas A. Jones, Michael C. Becker, Yoram Shotland, Chunyan Wang, Jason Waligorski, Kyung Kim, Nicolas Berkowicz, Amy Kozlowicz-Reilly, Johar Ali, Xiaoqiu Huang, Shiaw Pyng Yang, Sean R. Eddy, Mikita Suyama, Francesca D. Ciccarelli, Martin Yoakum, John W. Wallis, Kristine M. Wylie, Maxim Radionenko, Donald Williams, Richard Harkins, Terrence S. Furey, Jacqueline E. Snider, Michael D. McLellan, Andrea Holmes, Shunfang Hou, Johanna Thompson, Andrew Van Brunt, Hui Sun, Teresa Davidson, Rekha Meyer, Feiyu Du, Jennifer Randall-Maher, Chad Tomlinson, Jon R. Armstrong, Robert S. Fulton, William E. Nash, Philip Ozersky, Marc Cotton, Sandra W. Clifton, Elisa Izaurralde, Lauren Caruso, Joanne O. Nelson, James M. Eldred, Sharhonda Swearengen-Shahid, Marco A. Marra, Caryn Wagner-McPherson, Cindy Strong, David Torrents, Phil Latreille, Peer Bork, Elaine R. Mardis, Andrew Levy, Evan E. Eichler, Laura Courtney, Anthony R. Harris, Jeremy Schmutz, Christine Nguyen, Dan Layman, James Taylor, Matt Cordes, Joseph T. Strong, Kimberly D. Delehaunty, Kymberlie H. Pepin, Scott Martinka, Tony Gaige, Charlene Pearman, and John R. Osborne
- Subjects
Primates ,RNA, Untranslated ,Centromere ,Molecular Sequence Data ,Biology ,Euchromatin ,Chromosome 16 ,Chromosome 19 ,Gene Duplication ,Animals ,Humans ,RNA, Messenger ,Conserved Sequence ,Genetics ,Expressed Sequence Tags ,Recombination, Genetic ,Base Composition ,Multidisciplinary ,Polymorphism, Genetic ,Base Sequence ,Genetic Variation ,Proteins ,Genomics ,Sequence Analysis, DNA ,Physical Chromosome Mapping ,Chromosome 17 (human) ,Chromosome 4 ,Chromosome 3 ,Chromosomes, Human, Pair 2 ,CpG Islands ,Chromosome 20 ,Chromosomes, Human, Pair 4 ,Chromosome 21 ,Chromosome 22 ,Pseudogenes - Abstract
Human chromosome 2 is unique to the human lineage in being the product of a head-to-head fusion of two intermediate-sized ancestral chromosomes. Chromosome 4 has received attention primarily related to the search for the Huntington's disease gene, but also for genes associated with Wolf-Hirschhorn syndrome, polycystic kidney disease and a form of muscular dystrophy. Here we present approximately 237 million base pairs of sequence for chromosome 2, and 186 million base pairs for chromosome 4, representing more than 99.6% of their euchromatic sequences. Our initial analyses have identified 1,346 protein-coding genes and 1,239 pseudogenes on chromosome 2, and 796 protein-coding genes and 778 pseudogenes on chromosome 4. Extensive analyses confirm the underlying construction of the sequence, and expand our understanding of the structure and evolution of mammalian chromosomes, including gene deserts, segmental duplications and highly variant regions.
- Published
- 2004
12. Viral discovery and sequence recovery using DNA microarrays
- Author
-
Vincent Magrini, Thomas G. Ksiazek, Yu-Tsueng Liu, Don Ganem, Matthew T. Hickenbotham, Michael Springer, Dean D. Erdman, Richard K. Wilson, David Wang, Anatoly Urisman, Elaine R. Mardis, Joseph L. DeRisi, James M. Eldred, J. Phillipe Latreille, and Herbert Virgin
- Subjects
Genes, Viral ,viruses ,medicine.disease_cause ,Severe Acute Respiratory Syndrome ,Communicable Diseases, Emerging ,Genome ,Polymerase Chain Reaction ,Medical and Health Sciences ,law.invention ,law ,2.2 Factors relating to the physical environment ,Viral ,Biology (General) ,Aetiology ,Lung ,Polymerase chain reaction ,Coronavirus ,Oligonucleotide Array Sequence Analysis ,Emerging ,General Neuroscience ,Nucleic Acid Hybridization ,Biological Sciences ,SARS Virus ,Infectious Diseases ,Genetic Techniques ,Severe acute respiratory syndrome-related coronavirus ,GenBank ,Viruses ,Pneumonia & Influenza ,DNA microarray ,General Agricultural and Biological Sciences ,Infection ,Sequence Analysis ,Research Article ,Biotechnology ,QH301-705.5 ,Sequence analysis ,Molecular Sequence Data ,Genome, Viral ,Biology ,Genetics/Genomics/Gene Therapy ,Communicable Diseases ,General Biochemistry, Genetics and Molecular Biology ,Virus ,Vaccine Related ,Virology ,Biodefense ,medicine ,Genetics ,Humans ,General Immunology and Microbiology ,Base Sequence ,Agricultural and Veterinary Sciences ,Prevention ,Human Genome ,Sequence Analysis, DNA ,DNA ,Pneumonia ,Emerging Infectious Diseases ,Good Health and Well Being ,Genes ,Novel virus ,Developmental Biology - Abstract
Because of the constant threat posed by emerging infectious diseases and the limitations of existing approaches used to identify new pathogens, there is a great demand for new technological methods for viral discovery. We describe herein a DNA microarray-based platform for novel virus identification and characterization. Central to this approach was a DNA microarray designed to detect a wide range of known viruses as well as novel members of existing viral families; this microarray contained the most highly conserved 70mer sequences from every fully sequenced reference viral genome in GenBank. During an outbreak of severe acute respiratory syndrome (SARS) in March 2003, hybridization to this microarray revealed the presence of a previously uncharacterized coronavirus in a viral isolate cultivated from a SARS patient. To further characterize this new virus, approximately 1 kb of the unknown virus genome was cloned by physically recovering viral sequences hybridized to individual array elements. Sequencing of these fragments confirmed that the virus was indeed a new member of the coronavirus family. This combination of array hybridization followed by direct viral sequence recovery should prove to be a general strategy for the rapid identification and characterization of novel viruses and emerging infectious disease., Know your enemy. Description of the ‘virus gene chip’ that helped to classify the SARS virus as a novel coronavirus
- Published
- 2003
13. Abstract 2216: Next-generation whole transcriptome sequencing of triple-negative breast tumors and normal tissues
- Author
-
Nawal Kassem, James S. Elliott, Anna Maria Storniolo, George W. Sledge, Jarret Glasscock, Matthew T. Hickenbotham, M Radovich, Eric E. Hilligoss, Heather A. Lillemoe, Yunlong Liu, JE Henry, Ryan Richt, Bradley A. Hancock, Susan E. Clare, Bryan P. Schneider, Connie Rufenbarger, I Pardo, Jie Sun, and T Mathieson
- Subjects
Genetics ,Transcriptome ,Whole genome sequencing ,Cancer Research ,Oncology ,Alternative splicing ,RefSeq ,Human genome ,Genomics ,Biology ,Genome ,Gene - Abstract
Background: Triple-negative breast cancer (TNBC) disproportionally affects pre-menopausal women and women of African-American descent, and has been plagued by the absence of targeted therapies leading to poor survival. The paucity of therapeutic targets in TNBC impels us to utilize new technologies that can determine novel targets on a global scale. Using next-generation sequencing, we embarked on a study to analyze the whole transcriptomes of TNBC tumors compared to normal breast tissues in order to comprehensively identify novel targets by analyzing all full length transcripts expressed in these tissues. Methods: Normal breast tissues from healthy pre-menopausal volunteers with no history of disease were procured from the Susan G. Komen for the Cure® Tissue Bank at the IU Simon Cancer Center. To eliminate bias from stromal tissue, epithelial cells were laser capture microdissected and RNA extracted from captured cells. cDNA libraries from 10 TNBC tumors and 10 normal breast tissues were subsequently sequenced on an ABI SOLiD3 sequencer using a 50bp fragment run. For gene expression, mapping of reads to the human genome was performed using the ABI Whole Transcriptome Pipeline and outputs were imported into Partek Genomics Suite for analysis. To analyze for gene fusions, reads were mapped to the genome using the SOLiD Analysis Pipeline Tool, followed by an alignment to Refseq to map reads crossing exon-exon junctions. A composite transcriptome was formed from areas of the genome with significant expression (17% of the genome sequence) and served as a concise search space for identifying fusions. Reads not mapping to the genome or to RefSeq (a rich source of fusion reads) were then mapped to the composite transcriptome using BLAT to facilitate a highly accurate split-read alignment. Using a custom developed pipeline, reads that spanned transcribed regions from two different chromosomes, or to loci farther than 200kb apart on the same chromosome, were considered as candidate fusions. Results/Discussion: Sequencing of the 10 TNBC tumors and 10 normal samples produced 1.1 billion reads equaling 58.15GB of data. Mapping of the reads to the genome revealed 1.6 million transcribed regions (exons) of significant expression. A preliminary analysis of gene expression shows 55.2% of the transcribed loci to have significant differential expression between tumor and normal. Network-node, non-coding RNA, and statistical analyses are currently ongoing. In a further interim analysis, we bioinformatically identified several interchromosomal fusions that were present in a majority of the tumors but were absent in the normal samples. RT-PCR validation of these candidate fusions in a larger validation cohort of TNBC tumors and normal breast tissues is ongoing. A multitude of additional analyses including but not limited to: novel transcripts, alternative splicing, and presence of viral genes are also planned. Citation Format: {Authors}. {Abstract title} [abstract]. In: Proceedings of the 101st Annual Meeting of the American Association for Cancer Research; 2010 Apr 17-21; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2010;70(8 Suppl):Abstract nr 2216.
- Published
- 2010
- Full Text
- View/download PDF
14. Flipping NextGen: using biological systems to characterize NextGen sequencing technologies
- Author
-
Matthew T. Hickenbotham, Jarret Glasscock, and Ryan Richt
- Subjects
Contig ,Computer science ,Applied Mathematics ,Genomics ,Computational biology ,computer.software_genre ,Genome ,Biochemistry ,Computer Science Applications ,Transcriptome ,Structural Biology ,Meeting Abstract ,Data mining ,DNA microarray ,computer ,Molecular Biology ,Sequence (medicine) - Abstract
Finally, de-novo sequencing (i.e. without a reference) would require a minimum of 1/2 of the sequence length to be unique in order to allow sufficient contig extension in the assembly process. For example, 40–50 bp reads are necessary for de-novo characterization of these systems uniquely defined by 20–25 bp reads. As of 2009, short read NextGen sequencing technologies have moved to 50 bp and beyond, ushering in what is expected to be the start of a revolution in genomics. Conclusion These results establish a lower bound on sequence length (x) required to sufficiently conduct re-sequencing, transcriptome, and de-novo sequencing projects. The asymptotic nature of the results also provides a guide for what percentage of the total space (y) we might expect to define in genomes/transcriptomes of similar size and complexity. from UT-ORNL-KBRIN Bioinformatics Summit 2009 Pikeville, TN, USA. 20–22 March 2009
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.