12 results on '"nucleic acid database"'
Search Results
2. DNA Simulation Benchmarks as Revealed by X-Ray Structures
- Author
-
Olson, Wilma K., Colasanti, Andrew V., Li, Yun, Ge, Wei, Zheng, Guohui, Zhurkin, Victor B., Šponer, J., editor, and Lankaš, F., editor
- Published
- 2006
- Full Text
- View/download PDF
3. Characterization of Gel Separated Proteins
- Author
-
Bergman, A.-CH., Oppermann, M., Oppermann, U., Jörnvall, H., Bergman, T., Kamp, Roza Maria, editor, Kyriakidis, Dimitris, editor, and Choli-Papadopoulou, Theodora, editor
- Published
- 2000
- Full Text
- View/download PDF
4. RNA on the Web
- Author
-
Szymański, M., Clark, B. F. C., Barciszewski, J., Barciszewski, Jan, editor, and Clark, Brian F. C., editor
- Published
- 1999
- Full Text
- View/download PDF
5. Characterization of new Schistosoma mansoni microsatellite loci in sequences obtained from public DNA databases and microsatellite enriched genomic libraries
- Author
-
NB Rodrigues, PT LoVerde, AJ Romanha, and G Oliveira
- Subjects
Schistosoma mansoni ,microsatellite repeats ,polymorphism ,DNA sequence analysis ,nucleic acid database ,computational biology ,Microbiology ,QR1-502 ,Infectious and parasitic diseases ,RC109-216 - Abstract
In the last decade microsatellites have become one of the most useful genetic markers used in a large number of organisms due to their abundance and high level of polymorphism. Microsatellites have been used for individual identification, paternity tests, forensic studies and population genetics. Data on microsatellite abundance comes preferentially from microsatellite enriched libraries and DNA sequence databases. We have conducted a search in GenBank of more than 16,000 Schistosoma mansoni ESTs and 42,000 BAC sequences. In addition, we obtained 300 sequences from CA and AT microsatellite enriched genomic libraries. The sequences were searched for simple repeats using the RepeatMasker software. Of 16,022 ESTs, we detected 481 (3%) sequences that contained 622 microsatellites (434 perfect, 164 imperfect and 24 compounds). Of the 481 ESTs, 194 were grouped in 63 clusters containing 2 to 15 ESTs per cluster. Polymorphisms were observed in 16 clusters. The 287 remaining ESTs were orphan sequences. Of the 42,017 BAC end sequences, 1,598 (3.8%) contained microsatellites (2,335 perfect, 287 imperfect and 79 compounds). The 1,598 BAC end sequences 80 were grouped into 17 clusters containing 3 to 17 BAC end sequences per cluster. Microsatellites were present in 67 out of 300 sequences from microsatellite enriched libraries (55 perfect, 38 imperfect and 15 compounds). From all of the observed loci 55 were selected for having the longest perfect repeats and flanking regions that allowed the design of primers for PCR amplification. Additionally we describe two new polymorphic microsatellite loci.
- Published
- 2002
- Full Text
- View/download PDF
6. Bioinformatics database infrastructure for biotechnology research
- Author
-
Whitfield, Eleanor J., Pruess, Manuela, and Apweiler, Rolf
- Subjects
- *
BIOINFORMATICS , *COMPUTERS in medicine , *INFORMATION storage & retrieval systems , *GENOMICS - Abstract
Abstract: Many databases are available that provide valuable data resources for the biotechnological researcher. According to their core data, they can be divided into different types. Some databases provide primary data, like all published nucleotide sequences, others deal with protein sequences. In addition to these two basic types of databases, a huge number of more specialized resources are available, like databases about protein structures, protein identification, special features of genes and/or proteins, or certain organisms. Furthermore, some resources offer integrated views on different types of data, allowing the user to do easy customised queries over large datasets and to compare different types of data. [Copyright &y& Elsevier]
- Published
- 2006
- Full Text
- View/download PDF
7. The Protein Identification Resource (PIR): An On-Line Computer System for the Characterization of Proteins Based on Comparisons with Previously Characterized Protein Sequences
- Author
-
George, David G., Barker, Winona C., Hunt, Lois T., and L’Italien, James J., editor
- Published
- 1987
- Full Text
- View/download PDF
8. Overlap and diversity in antimicrobial peptide databases: Compiling a non-redundant set of sequences
- Author
-
Longendri Aguilera-Mendoza, Jun Liu, Roberto Tellez-Ibarra, Jesús Salgado, Stephen J. Barigye, Monica T. Llorente-Quesada, and Yovani Marrero-Ponce
- Subjects
Statistics and Probability ,Similarity (geometry) ,Computer science ,Sequence analysis ,Antimicrobial peptides ,Peptaibol ,Peptide ,computer.software_genre ,Procedures ,Biochemistry ,Set (abstract data type) ,chemistry.chemical_compound ,Protein methods ,Sequence Analysis, Protein ,Redundancy (engineering) ,Humans ,Databases, Protein ,Molecular Biology ,Antimicrobial cationic peptides ,chemistry.chemical_classification ,Sequence ,Antimicrobial cationic peptide ,Database ,Sequence database ,Computer Science Applications ,Algorithm ,Computational Mathematics ,Chemistry ,Protein database ,Computational Theory and Mathematics ,chemistry ,Data mining ,Nucleic acid database ,Databases, Nucleic Acid ,computer ,Software ,Algorithms ,Human - Abstract
Motivation: The large variety of antimicrobial peptide (AMP) databases developed to date are characterized by a substantial overlap of data and similarity of sequences. Our goals are to analyze the levels of redundancy for all available AMP databases and use this information to build a new non-redundant sequence database. For this purpose, a new software tool is introduced. Results: A comparative study of 25 AMP databases reveals the overlap and diversity among them and the internal diversity within each database. The overlap analysis shows that only one database (Peptaibol) contains exclusive data, not present in any other, whereas all sequences in the LAMP_Patent database are included in CAMP_Patent. However, the majority of databases have their own set of unique sequences, as well as some overlap with other databases. The complete set of non-duplicate sequences comprises 16 990 cases, which is almost half of the total number of reported peptides. On the other hand, the diversity analysis identifies the most and least diverse databases and proves that all databases exhibit some level of redundancy. Finally, we present a new parallel-free software, named Dover Analyzer, developed to compute the overlap and diversity between any number of databases and compile a set of non-redundant sequences. These results are useful for selecting or building a suitable representative set of AMPs, according to specific needs. Availability and implementation: The regularly updated non-redundant sequence databases and the Dover Analyzer software to perform custom analysis are available at http://mobiosd-hub.com/doveranalyzer/. Contact: ymarrero77@yahoo.es Supplementary information: Supplementary data are available at Bioinformatics online.
- Published
- 2015
9. Molecular analysis of Fanconi anemia: the experience of the Bone Marrow Failure Study Group of the Italian Association of Pediatric Onco-Hematology
- Author
-
Fabio Corsolini, Roberta Bottega, Federico Verzegnassi, Adriana Borriello, Elena Nicchia, Silverio Perrotta, Simona Cavani, Marta Pillon, Paola Grammatico, Johanna Svahn, Fulvio Della Ragione, Walter Barberi, Chiara Greco, Anna Locasciulli, Maria Criscuolo, Enrico Cappelli, Ugo Ramenghi, Piero Farruggia, Gabriella Casazza, Daniela Longoni, Fabio Tucci, Chiara Cugno, Daniela De Rocco, Cristina Mecucci, Anna Savoia, Helmut Hanenberg, Carlo Dufour, De Rocco, D, Bottega, R, Cappelli, E, Cavani, S, Criscuolo, M, Nicchia, E, Corsolini, F, Greco, C, Borriello, Adriana, Svahn, J, Pillon, M, Mecucci, C, Casazza, G, Verzegnassi, F, Cugno, C, Locasciulli, A, Farruggia, P, Longoni, D, Ramenghi, U, Barberi, W, Tucci, F, Perrotta, Silverio, Grammatico, P, Hanenberg, H, DELLA RAGIONE, Fulvio, Dufour, C, Savoia, A, Bone Marrow Failure Study Group of the Italian Association of Pediatric Onco, Hematology, DE ROCCO, Daniela, Bottega, Roberta, Nicchia, Elena, Borriello, A, Perrotta, S, Della Ragione, F, and Savoia, Anna
- Subjects
Candidate gene ,gene amplification ,genotype ,cytogenetics and molecular genetics ,genetic analysis ,Bioinformatics ,Western blotting ,hematopoietic stem cell ,bone marrow failure ,fanconi anemia ,Cohort Studies ,genetic heterogeneity ,single nucleotide polymorphism ,FANCG ,Fanconi anemia ,hemic and lymphatic diseases ,Genotype ,genetics ,gene mutation ,DNA extraction ,Genetics ,biology ,pathogenesis ,Fanconi anemia group A protein ,Articles ,bioinformatics ,cell line ,genetic screening ,Hematology ,cohort analysis ,Fanconi Anemia Complementation Group Proteins ,founder effect ,Italy ,nucleic acid database ,Errata Corrige ,Databases, Nucleic Acid ,amino acid substitution ,Fanconi anemia group C protein ,Fanconi anemia group D2 protein ,Fanconi anemia group G protein ,Fanconi anemia proteinarticle ,bone marrow depression ,controlled study ,gene sequence ,human ,human cell ,missense mutation ,molecular diagnosis ,molecular genetics ,protein analysis ,mosaicism ,mutation ,congenital, hereditary, and neonatal diseases and abnormalities ,Biology ,Polymorphism, Single Nucleotide ,FANCD2 ,medicine ,Humans ,Genetic heterogeneity ,Computational Biology ,nutritional and metabolic diseases ,medicine.disease ,FANCA ,FANCB - Abstract
Fanconi anemia is an inherited disease characterized by congenital malformations, pancytopenia, cancer predisposition, and sensitivity to cross-linking agents. The molecular diagnosis of Fanconi anemia is relatively complex for several aspects including genetic heterogeneity with mutations in at least 16 different genes. In this paper, we report the mutations identified in 100 unrelated probands enrolled into the National Network of the Italian Association of Pediatric Hematoly and Oncology. In approximately half of these cases, mutational screening was carried out after retroviral complementation analyses or protein analysis. In the other half, the analysis was performed on the most frequently mutated genes or using a next generation sequencing approach. We identified 108 distinct variants of the FANCA, FANCG, FANCC, FANCD2, and FANCB genes in 85, 9, 3, 2, and 1 families, respectively. Despite the relatively high number of private mutations, 45 of which are novel Fanconi anemia alleles, 26% of the FANCA alleles are due to 5 distinct mutations. Most of the mutations are large genomic deletions and nonsense or frameshift mutations, although we identified a series of missense mutations, whose pathogenetic role was not always certain. The molecular diagnosis of Fanconi anemia is still a tiered procedure that requires identifying candidate genes to avoid useless sequencing. Introduction of next generation sequencing strategies will greatly improve the diagnostic process, allowing a rapid analysis of all the genes.
- Published
- 2014
10. DNA variant databases improve test accuracy and phenotype prediction in Alport syndrome
- Author
-
Savige, J., Ars, E., Cotton, R. G. H., Crockett, D., Dagher, H., Constantinou-Deltas, Constantinos D., Ding, J., Flinter, F., Pont-Kingdon, G., Smaoui, N., Torra, R., Storey, H., and Constantinou-Deltas, Constantinos D. [0000-0001-5549-9169]
- Subjects
fibrocystin ,Inherited renal disease ,Genetic testing ,genotype phenotype correlation ,COL4A5 gene ,Nephritis, Hereditary ,osteogenesis imperfecta ,computer.software_genre ,urologic and male genital diseases ,human variome project ,nephritis ,genetic database ,genetic variability ,pathogenicity ,genetics ,gene mutation ,cell interaction ,Genetics ,Database ,medicine.diagnostic_test ,genetic screening ,bioinformatics ,Phenotype ,X chromosome linked disorder ,female genital diseases and pregnancy complications ,unclassified drug ,COL4A5 protein, human ,priority journal ,Nephrology ,disease severity ,nucleic acid database ,focal glomerulosclerosis ,Databases, Nucleic Acid ,Collagen Type IV ,DNA database ,congenital, hereditary, and neonatal diseases and abnormalities ,Gene variant ,phenotype ,Human Variome Project ,MEDLINE ,review ,Biology ,collagen type 4 ,medicine ,otorhinolaryngologic diseases ,genomics ,Humans ,stop codon ,human ,Alport syndrome ,gene ,missense mutation ,DNA ,medicine.disease ,amino acid sequence ,kidney failure ,collagen type 4A5 ,Pediatrics, Perinatology and Child Health ,Leiden Open Variation Database ,X chromosome inactivation ,computer ,Reference genome - Abstract
X-linked Alport syndrome is a form of progressive renal failure caused by pathogenic variants in the COL4A5 gene. More than 700 variants have been described and a further 400 are estimated to be known to individual laboratories but are unpublished. The major genetic testing laboratories for X-linked Alport syndrome worldwide have established a Web-based database for published and unpublished COL4A5 variants ( https://grenada.lumc.nl/LOVD2/COL4A/home.php?select_db=COL4A5 ). This conforms with the recommendations of the Human Variome Project: it uses the Leiden Open Variation Database (LOVD) format, describes variants according to the human reference sequence with standardized nomenclature, indicates likely pathogenicity and associated clinical features, and credits the submitting laboratory. The database includes non-pathogenic and recurrent variants, and is linked to another COL4A5 mutation database and relevant bioinformatics sites. Access is free. Increasing the number of COL4A5 variants in the public domain helps patients, diagnostic laboratories, clinicians, and researchers. The database improves the accuracy and efficiency of genetic testing because its variants are already categorized for pathogenicity. The description of further COL4A5 variants and clinical associations will improve our ability to predict phenotype and our understanding of collagen IV biochemistry. The database for X-linked Alport syndrome represents a model for databases in other inherited renal diseases.
- Published
- 2014
11. Rapid match-searching for gene silencing assessment
- Author
-
Horn, Mark, Waterhouse, Peter, Horn, Mark, and Waterhouse, Peter
- Abstract
Motivation: Gene silencing, also called RNA interference, requires reliable assessment of silencer impacts. A critical task is to find matches between silencer oligomers and sites in the genome, in accordance with one-to-many matching rules (G-U matching, with provision for mismatches). Fast search algorithms are required to support silencer impact assessments in procedures for designing effective silencer sequences.Results: The article presents a matching algorithm and data structures specialized for matching searches, including a kernel procedure that addresses a Boolean version of the database task called the skyline search. Besides exact matches, the algorithm is extended to allow for the location-specific mismatches applicable in plants. Computational tests show that the algorithm is significantly faster than suffix-tree alternatives. © The Author 2010. Published by Oxford University Press. All rights reserved.
- Published
- 2010
12. Bioinformatics issues for automating the annotation of genomic sequences
- Author
-
Carter, K., Oka, A., Tamiya, G., Bellgard, M. I., Carter, K., Oka, A., Tamiya, G., and Bellgard, M. I.
- Abstract
Free to read at publisher's site. The rapid explosion in the amount of biological data being generated worldwide is surpassing efforts to manage analysis of the data. As part of an ongoing project to automate and manage bioinformatics analysis, the authors have designed and implemented a simple automated annotation system, which is described in this paper. The system is applied to existing GenBank/DDBJ/EMBL entries and compared with existing annotations to illustrate not only potential errors but also that they are generally not up-to-date, as a result of new versions of analysis tools and updates of genomic repositories. We highlight the important Bioinformatics issues of storage and management of information to ensure data and results are kept up-to-date in light of new information becoming available. Surprisingly, from just four database entries, a significant number of new features were found. We describe the results as well as identify important issues that need to be addressed in order to automate the re-analysis/re-annotation of genomic sequences within a reasonable timeframe.
- Published
- 2001
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.