Author: "Ivan Borozan" / Journal: bmc bioinformatics - Searchworks@Jio Institute Digital Library Search Results

1. MAID : An effect size based model for microarray data integration across laboratories and platforms

Author: Aled M. Edwards, Michael A. Katze, Limin Chen, Ian D. McGilvray, Jenny Heathcote, Ivan Borozan, Bryan W. Paeper, and Zhaolei Zhang
Subjects: Quality Control, Computational biology, Biology, lcsh:Computer applications to medicine. Medical informatics, computer.software_genre, Biochemistry, Statistical power, 03 medical and health sciences, 0302 clinical medicine, Meta-Analysis as Topic, Artificial Intelligence, Structural Biology, Databases, Genetic, Humans, Microarray databases, lcsh:QH301-705.5, Molecular Biology, Oligonucleotide Array Sequence Analysis, 030304 developmental biology, Genetics, 0303 health sciences, Microarray analysis techniques, Methodology Article, Data Collection, Gene Expression Profiling, Applied Mathematics, Replicate, Hepatitis C, Chronic, Reference Standards, Data Compression, Computer Science Applications, Systems Integration, Gene expression profiling, lcsh:Biology (General), Liver, Sample Size, 030220 oncology & carcinogenesis, Gene chip analysis, lcsh:R858-859.7, DNA microarray, computer, Data integration
Abstract: Background Gene expression profiling has the potential to unravel molecular mechanisms behind gene regulation and identify gene targets for therapeutic interventions. As microarray technology matures, the number of microarray studies has increased, resulting in many different datasets available for any given disease. The increase in sensitivity and reliability of measurements of gene expression changes can be improved through a systematic integration of different microarray datasets that address the same or similar biological questions. Results Traditional effect size models can not be used to integrate array data that directly compare treatment to control samples expressed as log ratios of gene expressions. Here we extend the traditional effect size model to integrate as many array datasets as possible. The extended effect size model (MAID) can integrate any array datatype generated with either single or two channel arrays using either direct or indirect designs across different laboratories and platforms. The model uses two standardized indices, the standard effect size score for experiments with two groups of data, and a new standardized index that measures the difference in gene expression between treatment and control groups for one sample data with replicate arrays. The statistical significance of treatment effect across studies for each gene is determined by appropriate permutation methods depending on the type of data integrated. We apply our method to three different expression datasets from two different laboratories generated using three different array platforms and two different experimental designs. Our results indicate that the proposed integration model produces an increase in statistical power for identifying differentially expressed genes when integrating data across experiments and when compared to other integration models. We also show that genes found to be significant using our data integration method are of direct biological relevance to the three experiments integrated. Conclusion High-throughput genomics data provide a rich and complex source of information that could play a key role in deciphering intricate molecular networks behind disease. Here we propose an extension of the traditional effect size model to allow the integration of as many array experiments as possible with the aim of increasing the statistical power for identifying differentially expressed genes.
Full Text: View/download PDF

2. CaPSID: A bioinformatics platform for computational pathogen sequence identification in human genomes and transcriptomes

Author: Fabrice Sircoulomb, Vincent Ferretti, Philippe Laflamme, Stuart Watt, Philip E. Branton, Paola Blanchette, Robert Rottapel, Paul M. Krzyzanowski, Ivan Borozan, and Shane Wilson
Subjects: Sequence analysis, Computational biology, Genome browser, Biology, lcsh:Computer applications to medicine. Medical informatics, Bioinformatics, Sensitivity and Specificity, Genome, Biochemistry, DNA sequencing, Transcriptome, 03 medical and health sciences, Structural Biology, Cell Line, Tumor, Databases, Genetic, Humans, Computer Simulation, lcsh:QH301-705.5, Molecular Biology, 030304 developmental biology, Ovarian Neoplasms, Genetics, Internet, 0303 health sciences, Genome, Human, Applied Mathematics, 030302 biochemistry & molecular biology, Computational Biology, 3. Good health, Computer Science Applications, lcsh:Biology (General), Capsid, lcsh:R858-859.7, Female, Human genome, DNA microarray, Oncogenic Viruses, Algorithms, Software
Abstract: Background It is now well established that nearly 20% of human cancers are caused by infectious agents, and the list of human oncogenic pathogens will grow in the future for a variety of cancer types. Whole tumor transcriptome and genome sequencing by next-generation sequencing technologies presents an unparalleled opportunity for pathogen detection and discovery in human tissues but requires development of new genome-wide bioinformatics tools. Results Here we present CaPSID (Computational Pathogen Sequence IDentification), a comprehensive bioinformatics platform for identifying, querying and visualizing both exogenous and endogenous pathogen nucleotide sequences in tumor genomes and transcriptomes. CaPSID includes a scalable, high performance database for data storage and a web application that integrates the genome browser JBrowse. CaPSID also provides useful metrics for sequence analysis of pre-aligned BAM files, such as gene and genome coverage, and is optimized to run efficiently on multiprocessor computers with low memory usage. Conclusions To demonstrate the usefulness and efficiency of CaPSID, we carried out a comprehensive analysis of both a simulated dataset and transcriptome samples from ovarian cancer. CaPSID correctly identified all of the human and pathogen sequences in the simulated dataset, while in the ovarian dataset CaPSID’s predictions were successfully validated in vitro.
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

2 results on '"Ivan Borozan"'

1. MAID : An effect size based model for microarray data integration across laboratories and platforms

2. CaPSID: A bioinformatics platform for computational pathogen sequence identification in human genomes and transcriptomes

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

2 results on '"Ivan Borozan"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources