Author: "Asa Ben-Hur" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Asa Ben-Hur"' showing total 121 results

Start Over Author "Asa Ben-Hur"

121 results on '"Asa Ben-Hur"'

101. A user's guide to support vector machines

Author: Asa, Ben-Hur and Jason, Weston
Subjects: Models, Statistical, Databases, Factual, Nonlinear Dynamics, Artificial Intelligence, Linear Models, Normal Distribution, Computational Biology, Data Mining, Algorithms, Software
Abstract: The Support Vector Machine (SVM) is a widely used classifier in bioinformatics. Obtaining the best results with SVMs requires an understanding of their workings and the various ways a user can influence their accuracy. We provide the user with a basic understanding of the theory behind SVMs and focus on their use in practice. We describe the effect of the SVM parameters on the resulting classifier, how to select good values for those parameters, data normalization, factors that affect training time, and software for training SVMs.
Published: 2010

102. Genome-wide analysis of alternative splicing in Chlamydomonas reinhardtii

Author: Alicia Link, Adam Labadorf, Mark F. Rogers, Asa Ben-Hur, Anireddy S. N. Reddy, and Julie Thomas
Subjects: 0106 biological sciences, lcsh:QH426-470, lcsh:Biotechnology, Chlamydomonas reinhardtii, Genomics, 01 natural sciences, 03 medical and health sciences, Exon, DNA, Algal, lcsh:TP248.13-248.65, Genetics, Gene, 030304 developmental biology, 2. Zero hunger, Expressed Sequence Tags, 0303 health sciences, Base Composition, biology, Alternative splicing, Chlamydomonas, Intron, Computational Biology, Exons, Sequence Analysis, DNA, biology.organism_classification, Introns, Alternative Splicing, lcsh:Genetics, RNA splicing, Software, 010606 plant biology & botany, Biotechnology, Research Article
Abstract: Background Genome-wide computational analysis of alternative splicing (AS) in several flowering plants has revealed that pre-mRNAs from about 30% of genes undergo AS. Chlamydomonas, a simple unicellular green alga, is part of the lineage that includes land plants. However, it diverged from land plants about one billion years ago. Hence, it serves as a good model system to study alternative splicing in early photosynthetic eukaryotes, to obtain insights into the evolution of this process in plants, and to compare splicing in simple unicellular photosynthetic and non-photosynthetic eukaryotes. We performed a global analysis of alternative splicing in Chlamydomonas reinhardtii using its recently completed genome sequence and all available ESTs and cDNAs. Results Our analysis of AS using BLAT and a modified version of the Sircah tool revealed AS of 498 transcriptional units with 611 events, representing about 3% of the total number of genes. As in land plants, intron retention is the most prevalent form of AS. Retained introns and skipped exons tend to be shorter than their counterparts in constitutively spliced genes. The splice site signals in all types of AS events are weaker than those in constitutively spliced genes. Furthermore, in alternatively spliced genes, the prevalent splice form has a stronger splice site signal than the non-prevalent form. Analysis of constitutively spliced introns revealed an over-abundance of motifs with simple repetitive elements in comparison to introns involved in intron retention. In almost all cases, AS results in a truncated ORF, leading to a coding sequence that is around 50% shorter than the prevalent splice form. Using RT-PCR we verified AS of two genes and show that they produce more isoforms than indicated by EST data. All cDNA/EST alignments and splice graphs are provided in a website at http://combi.cs.colostate.edu/as/chlamy. Conclusions The extent of AS in Chlamydomonas that we observed is much smaller than observed in land plants, but is much higher than in simple unicellular heterotrophic eukaryotes. The percentage of different alternative splicing events is similar to flowering plants. Prevalence of constitutive and alternative splicing in Chlamydomonas, together with its simplicity, many available public resources, and well developed genetic and molecular tools for this organism make it an excellent model system to elucidate the mechanisms involved in regulated splicing in photosynthetic eukaryotes.
Published: 2010

103. A Promiscuous Prion: Efficient Induction of [URE3] Prion Formation by Heterologous Prion Domains

Author: Michael Hamilton, Eric D. Ross, Blake R. McCarty, Carley D. Ross, and Asa Ben-Hur
Subjects: Amyloid, Saccharomyces cerevisiae Proteins, Prions, Saccharomyces cerevisiae, Blotting, Western, Plasma protein binding, Investigations, medicine.disease_cause, Genetics, medicine, Binding site, chemistry.chemical_classification, Mutation, Glutathione Peroxidase, Binding Sites, biology, Ure2, biology.organism_classification, Fungal prion, Amino acid, Phenotype, Biochemistry, chemistry, Peptide Termination Factors, Protein Binding
Abstract: The [URE3] and [PSI+] prions are the infections amyloid forms of the Saccharomyces cerevisiae proteins Ure2p and Sup35p, respectively. Randomizing the order of the amino acids in the Ure2 and Sup35 prion domains while retaining amino acid composition does not block prion formation, indicating that amino acid composition, not primary sequence, is the predominant feature driving [URE3] and [PSI+] formation. Here we show that Ure2p promiscuously interacts with various compositionally similar proteins to influence [URE3] levels. Overexpression of scrambled Ure2p prion domains efficiently increases de novo formation of wild-type [URE3] in vivo. In vitro, amyloid aggregates of the scrambled prion domains efficiently seed wild-type Ure2p amyloid formation, suggesting that the wild-type and scrambled prion domains can directly interact to seed prion formation. To test whether interactions between Ure2p and naturally occurring yeast proteins could similarly affect [URE3] formation, we identified yeast proteins with domains that are compositionally similar to the Ure2p prion domain. Remarkably, all but one of these domains were also able to efficiently increase [URE3] formation. These results suggest that a wide variety of proteins could potentially affect [URE3] formation.
Published: 2009

104. A User’s Guide to Support Vector Machines

Author: Jason Weston and Asa Ben-Hur
Subjects: Computer Science::Machine Learning, Structured support vector machine, business.industry, Computer science, Machine learning, computer.software_genre, Support vector machine, Statistics::Machine Learning, ComputingMethodologies_PATTERNRECOGNITION, Kernel method, Computer Science::Sound, Least squares support vector machine, Margin classifier, Sequential minimal optimization, Artificial intelligence, business, Classifier (UML), computer
Abstract: The Support Vector Machine (SVM) is a widely used classifier in bioinformatics. Obtaining the best results with SVMs requires an understanding of their workings and the various ways a user can influence their accuracy. We provide the user with a basic understanding of the theory behind SVMs and focus on their use in practice. We describe the effect of the SVM parameters on the resulting classifier, how to select good values for those parameters, data normalization, factors that affect training time, and software for training SVMs.
Published: 2009
Full Text: View/download PDF

105. The use of gene ontology evidence codes in preventing classifier assessment bias

Author: Asa Ben-Hur and Mark F. Rogers
Subjects: Statistics and Probability, Computer science, Biological database, computer.software_genre, Machine learning, Biochemistry, Software, Databases, Protein, Molecular Biology, Gene, business.industry, Gene ontology, A protein, Computational Biology, Proteins, Biological process, Computer Science Applications, Computational Mathematics, Computational Theory and Mathematics, Genes, Artificial intelligence, Data mining, business, Classifier (UML), computer
Abstract: Motivation: The biological community's reliance on computational annotations of protein function makes correct assessment of function prediction methods an issue of great importance. The fact that a large fraction of the annotations in current biological databases are based on computational methods can lead to bias in estimating the accuracy of function prediction methods. This can happen since predicting an annotation that was derived computationally in the first place is likely easier than predicting annotations that were derived experimentally, leading to over-optimistic classifier performance estimates. Results: We illustrate this phenomenon in a set of controlled experiments using a nearest neighbor classifier that uses PSI-BLAST similarity scores. Our results demonstrate that the source of Gene Ontology (GO) annotations used to assess a protein function predictor can have a highly significant influence on classifier accuracy: the average accuracy over four species and over GO terms in the biological process namespace increased from 0.72 to 0.87 when the classifier was given access to annotations that are assigned evidence codes that indicate a possible computational source, instead of experimentally determined annotations. Slightly smaller increases were observed in the other namespaces. In these comparisons the total number of annotations and their distribution across GO terms were kept the same. Conclusion: In conclusion, taking into account GO evidence codes is required for reporting accuracy statistics that do not overestimate a model's performance, and is of particular importance for a fair comparison of classifiers that rely on different information sources. Contact: rogersma@cs.colostate.edu Supplementary information: Supplementary data are available at Bioinformatics online.
Published: 2009

106. Design and Analysis of the NIPS2003 Challenge

Author: Asa Ben Hur, Steve R. Gunn, Gideon Dror, and Isabelle Guyon
Subjects: business.industry, Computer science, Feature selection, Machine learning, computer.software_genre, Random forest, Support vector machine, Set (abstract data type), Kernel method, Ranking, Test set, Benchmark (computing), Artificial intelligence, business, computer
Abstract: We organized in 2003 a benchmark of feature selection methods, whose results are summarized and analyzed in this chapter. The top ranking entrants of the competition describe their methods and results in more detail in the following chapters. We provided participants with five datasets from different application domains and called for classification results using a minimal number of features. Participants were asked to make on-line submissions on two test sets: a validation set and a “final” test set, with performance on the validation set being presented immedi to the participant and performance on the final test set presented at the end of the competition. The competition took place over a period of 13 weeks and attracted 78 research groups. In total 1863 entries were made on the validation sets during the development period and 135 entries on all test sets for the final competition. The winners used a combination of Bayesian neural networks with ARD priors and Dirichlet diffusion trees. Other top entries used a variety of methods for feature selection, which combined filters and/or wrapper or embedded methods using Random Forests, kernel methods, neural networks as classification engine. The classification engines most often used after feature selection are regularized kernel methods, including SVMs. The results of the benchmark (including the predictions made by the participants and the features they selected) and the scoring software are publicly available. The benchmark is available at http://www.nipsfsc.ecs.soton.ac.uk/ for post-challenge submissions to stimulate further research.
Published: 2008
Full Text: View/download PDF

107. Sequence Motifs: Highly Predictive Features of Protein Function

Author: Asa Ben-Hur and Douglas L. Brutlag
Subjects: Support vector machine, Discriminative model, Computer science, Consensus sequence, Feature selection, Protein function prediction, Computational biology, Structural Classification of Proteins database, Sequence motif, Function (biology)
Abstract: Protein function prediction, i.e. classification of proteins according to their biological function, is an important task in bioinformatics. In this chapter, we illustrate that the presence of sequence motifs — elements that are conserved across different proteins — are highly discriminative features for predicting the function of a protein. This is in agreement with the biological thinking that considers motifs to be the building blocks of protein sequences. We focus on proteins annotated as enzymes, and show that despite the fact that motif composition is a very high dimensional representation of a sequence, that most classes of enzymes can be classified using a handful of motifs, yielding accurate and interpretable classifiers. The enzyme data falls into a large number of classes; we find that the one-against-the-rest multi-class method works better than the one-against-one method on this data.
Published: 2008
Full Text: View/download PDF

108. Integrating Information for Protein Function Prediction

Author: William Stafford Noble and Asa Ben-Hur
Subjects: Mathematical optimization, Kernel (image processing), Protein function prediction, Algorithm, Mathematics, Convolution
Published: 2008
Full Text: View/download PDF

109. A structural alignment kernel for protein structures

Author: William Stafford Noble, Jean-Philippe Vert, Asa Ben-Hur, Jian Qiu, Martial Hue, Department of Genome Sciences [Seattle] (GS), University of Washington [Seattle], Centre de Bioinformatique (CBIO), MINES ParisTech - École nationale supérieure des mines de Paris, and Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)
Subjects: Statistics and Probability, Computer science, MESH: Sequence Analysis, Protein, MESH: Sequence Homology, Amino Acid, Structural alignment, Molecular Sequence Data, MESH: Sequence Alignment, MESH: Algorithms, MESH: Amino Acid Sequence, Machine learning, computer.software_genre, Biochemistry, Pattern Recognition, Automated, 03 medical and health sciences, Kernel (linear algebra), String kernel, Artificial Intelligence, Sequence Analysis, Protein, MESH: Artificial Intelligence, MESH: Pattern Recognition, Automated, MESH: Proteins, Amino Acid Sequence, Molecular Biology, 030304 developmental biology, 0303 health sciences, MESH: Molecular Sequence Data, Sequence Homology, Amino Acid, business.industry, 030302 biochemistry & molecular biology, Proteins, [SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM], Computer Science Applications, Support vector machine, Computational Mathematics, Kernel method, Computational Theory and Mathematics, Radial basis function kernel, Artificial intelligence, Tree kernel, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], business, Classifier (UML), computer, Sequence Alignment, Algorithms
Abstract: Motivation: This work aims to develop computational methods to annotate protein structures in an automated fashion. We employ a support vector machine (SVM) classifier to map from a given class of structures to their corresponding structural (SCOP) or functional (Gene Ontology) annotation. In particular, we build upon recent work describing various kernels for protein structures, where a kernel is a similarity function that the classifier uses to compare pairs of structures.Results: We describe a kernel that is derived in a straightforward fashion from an existing structural alignment program, MAMMOTH. We find in our benchmark experiments that this kernel significantly out-performs a variety of other kernels, including several previously described kernels. Furthermore, in both benchmarks, classifying structures using MAMMOTH alone does not work as well as using an SVM with the MAMMOTH kernel.Availability: http://noble.gs.washington.edu/proj/3dkernelContact: noble@gs.washington.edu
Published: 2007
Full Text: View/download PDF

110. Detecting Stable Clusters Using Principal Component Analysis

Author: Isabelle Guyon and Asa Ben-Hur
Subjects: business.industry, Principal component analysis, Unsupervised learning, Pattern recognition, Artificial intelligence, Similarity measure, Cluster analysis, business, External Data Representation, Class variable, Data type, Hierarchical clustering
Abstract: Clustering is one of the most commonly used tools in the analysis of gene expression data (1, 2) . The usage in grouping genes is based on the premise that co-expression is a result of co-regulation. It is thus a preliminary step in extracting gene networks and inference of gene function (3, 4) . Clustering of experiments can be used to discover novel phenotypic aspects of cells and tissues (3, 5, 6) , including sensitivity to drugs (7) , and can also detect artifacts of experimental conditions (8) . Clustering and its applications in biology are presented in greater detail in the chapter by Zhao and Karypis (see also (9) ). While we focus on gene expression data in this chapter, the methodology presented here is applicable for other types of data as well. Clustering is a form of unsupervised learning, i.e. no information on the class variable is assumed, and the objective is to find the “natural” groups in the data. However, most clustering algorithms generate a clustering even if the data has no inherent cluster structure, so external validation tools are required. Given a set of partitions of the data into an increasing number of clusters (e.g. by a hierarchical clustering algorithm, or k-means), such a validation tool will tell the user the number of clusters in the data (if any). Many methods have been proposed in the literature to address this problem (10–15) . Recent studies have shown the advantages of sampling-based methods (12, 14) . These methods are based on the idea that when a partition has captured the structure in the data, this partition should be stable with respect to perturbation of the data. Bittner et al. (16) used a similar approach to validate clusters representing gene expression of melanoma patients. The emergence of cluster structure depends on several choices: data representation and normalization, the choice of a similarity measure and clustering algorithm. In this chapter we extend the stability-based validation of cluster structure, and propose stability as a figure of merit that is useful for comparing clustering solutions, thus helping in making these choices. We use this framework to demonstrate the ability of Principal Component Analysis (PCA) to extract features relevant to the cluster structure. We use stability as a tool for simultaneously choosing the number of principal components and the number of clusters; we compare the performance of different similarity measures and normalization schemes. The approach is demonstrated through a case study of yeast gene expression data from Eisen et al. (1) . For yeast, a functional classification of a large number of genes is known, and we use this classification for validating the results produced by clustering. A method for comparing clustering solutions specifically applicable to gene expression data was introduced in (17) . However, it cannot be used to choose the number of clusters, and is not directly applicable in choosing the number of principal components. The results of clustering are easily corrupted by the addition of noise: even a few
Published: 2003
Full Text: View/download PDF

111. CREME: a framework for identifying cis-regulatory modules in human-mouse conserved segments

Author: Asa Ben-Hur, Ivan Ovcharenko, Richard M. Karp, and Roded Sharan
Subjects: Statistics and Probability, Sequence analysis, Response element, Biology, Regulatory Sequences, Nucleic Acid, Biochemistry, Mice, User-Computer Interface, Animals, Cluster Analysis, Molecular Biology, Gene, Transcription factor, Conserved Sequence, Cis-regulatory module, Genetics, Gene Expression Profiling, Cell Cycle, Promoter, Sequence Analysis, DNA, Computer Science Applications, Computational Mathematics, Oxidative Stress, Computational Theory and Mathematics, Gene Expression Regulation, Regulatory sequence, Sequence Alignment, Function (biology), Algorithms, Software, Transcription Factors
Abstract: Motivation: The binding of transcription factors to specific regulatory sequence elements is a primary mechanism for controlling gene transcription. Recent findings suggest a modular organization of binding sites for transcription factors that cooperate in the regulation of genes. In this work we establish a framework for finding recurrent cis-regulatory modules in the promoters of a selected set of genes and scoring their statistical significance. Results: Proceeding from a database of identified binding site motifs and their genomic locations we seek motifs whose frequency in the selected promoters is different than in a background promoter set. We present several statistical tests designed for this purpose. We provide a hashing algorithm for detecting combinations of these motifs that co-occur in clusters within the selected promoters. The significance of such co-occurrences is evaluated using novel statistical scores. Our methods are combined in CREME, a suite of software which includes a browser for viewing the pattern of occurrence of selected cis-regulatory modules. We applied our methodology to find modules within human-mouse conserved promoter segments, focusing on cell cycle regulated genes and stress response related genes. To validate the biological significance of the identified modules we tested whether the associated genes tended to be co-expressed or share similar function. In the cell cycle set five of the seven identified sets of genes were coherently expressed. On the stress response data four of the six detected sets fell predominantly into well-defined functional sub-categories. Availability: http://icsi.berkeley.edu/~roded/creme.html Contact: roded@icsi.berkeley.edu. Keywords: Cis-regulatory module, transcription factor binding site, motif cluster, statistical test. *To whom correspondence should be addressed. †These authors contributed equally to this work.
Published: 2003

112. On probabilistic analog automata

Author: Alexander Roitershtein, Asa Ben-Hur, and Hava T. Siegelmann
Subjects: FOS: Computer and information sciences, TheoryofComputation_COMPUTATIONBYABSTRACTDEVICES, General Computer Science, Generalization, Computer science, Computation, Other Computer Science (cs.OH), Theoretical Computer Science, Reduction (complexity), Regular language, Computer Science - Other Computer Science, Markov operators, Quantum finite automata, Probabilistic automata, Definite languages, Probabilistic logic, Noisy computational systems, F.1.1, F.1.2, Regular languages, Nonlinear Sciences::Cellular Automata and Lattice Gases, Automaton, Algebra, Probabilistic automaton, Automata theory, Probabilistic computation, Computer Science::Formal Languages and Automata Theory, Computer Science(all)
Abstract: We consider probabilistic automata on a general state space and study their computational power. The model is based on the concept of language recognition by probabilistic automata due to Rabin (Inform. Control 3 (1963) 230) and models of analog computation in a noisy environment suggested by Maass and Orponen (Neural Comput. 10 (1998) 1071), and Maass and Sontag (Neural Comput. 11 (1999) 771). Our main result is a generalization of Rabin's reduction theorem that implies that under very mild conditions, the computational power of such automata is limited to regular languages.
Published: 2003

113. Detecting stable clusters using principal component analysis

Author: Asa, Ben-Hur and Isabelle, Guyon
Subjects: Principal Component Analysis, Gene Expression Profiling, Yeasts, Cluster Analysis, Algorithms, Oligonucleotide Array Sequence Analysis
Published: 2003

114. A support vector clustering method

Author: Vladimir Vapnik, Hava T. Siegelmann, Asa Ben-Hur, and David Horn
Subjects: Clustering high-dimensional data, Kernel method, Data stream clustering, Fuzzy clustering, CURE data clustering algorithm, business.industry, Correlation clustering, FLAME clustering, Pattern recognition, Artificial intelligence, business, Cluster analysis, Mathematics
Abstract: We present a novel kernel method for data clustering using a description of the data by support vectors. The kernel reflects a projection of the data points from data space to a high dimensional feature space. Cluster boundaries are defined as spheres in feature space, which represent complex geometric shapes in data space. We utilize this geometric representation of the data to construct a simple clustering algorithm.
Published: 2002
Full Text: View/download PDF

115. Macroscopic Molecular Computation with Gene Networks

Author: Asa Ben-Hur and Hava T. Siegelmann
Subjects: Very-large-scale integration, Turing machine, symbols.namesake, Theoretical computer science, Neuromorphic engineering, Artificial neural network, Computer science, Process (engineering), Computation, Gene regulatory network, symbols, Quantum computer
Abstract: In recent years scientists have been looking for new paradigms for constructing computational devices. These include quantum computation, DNA computation, neural networks, neuromorphic engineering and other analog VLSI devices. Since the 60’s genetic regulatory systems are thought of as “circuits” or “networks” of interacting components. The genetic material is the “program” that guides protein production in a cell. Protein levels determine the evolution of the network at subsequent times, and thus serve as its “memory”. This analogy between computing and the process of gene expression was pointed out in various papers. Bray suggests that protein based circuits are the device by which unicellular organisms react to their environment, instead of a nervous system. However, until recently this was only a useful metaphor for describing gene networks. Recent papers describe the successful fabrication of synthetic networks, i.e. programming of a gene network. Furthermore, it was shown both theoretically and experimentally that chemical reactions can be used to implement Boolean logic and neural networks.
Published: 2001
Full Text: View/download PDF

116. Computational complexity for continuous-time dynamics

Author: Shmuel Fishman, Asa Ben-Hur, and Hava T. Siegelmann
Subjects: Discrete system, Turing machine, symbols.namesake, Theoretical computer science, DTIME, Discrete time and continuous time, Computer science, Model of computation, Computation, Theory of computation, symbols, Symbolic computation
Abstract: Dissipative flows model a large variety of physical systems. In this Letter the evolution of such systems is interpreted as a process of computation; the attractor of the dynamics represents the output. A framework for an algorithmic analysis of dissipative flows is presented, enabling the comparison of the performance of discrete and continuous time analog computation models. A simple algorithm for finding the maximum of n numbers is analyzed, and shown to be highly efficient. The notion of tractable (polynomial) computation in the Turing model is conjectured to correspond to computation with tractable (analytically solvable) dynamical systems having polynomial complexity. The computation of a digital computer, and its mathematical abstraction, the Turing machine is described by a map on a discrete configuration space. In recent years scientists have developed new approaches to computation, some of them based on continuous time analog systems. The most promising are neuromorphic systems [1], models of human memory [2], and experimentally realizable quantum computers [3]. Although continuous time systems are widespread in experimental realizations, no theory exists for their algorithmic analysis. The standard theory of computation and computational complexity [4] deals with computation in discrete time and in a discrete configuration space, and is inadequate for the description of such systems. This Letter describes an attempt to fill this gap. Our model of a computer is based on dissipa
Published: 2000
Full Text: View/download PDF

117. Universality in Sandpile Models

Author: Asa Ben-Hur and Ofer Biham
Subjects: Nonlinear Sciences::Adaptation and Self-Organizing Systems, Statistical Mechanics (cond-mat.stat-mech), Abelian sandpile model, Condensed Matter::Statistical Mechanics, FOS: Physical sciences, Statistical physics, Renormalization group, Nonlinear Sciences::Cellular Automata and Lattice Gases, Condensed Matter - Statistical Mechanics, Mathematics, Universality (dynamical systems), Mathematical physics
Abstract: A new classification of sandpile models into universality classes is presented. On the basis of extensive numerical simulations, in which we measure an extended set of exponents, the Manna two state model [S. S. Manna, J. Phys. A 24, L363 (1991)] is found to belong to a universality class of random neighbor models which is distinct from the universality class of the original model of Bak, Tang and Wiesenfeld [P. Bak, C. Tang and K. Wiensenfeld, Phys. Rev. Lett. 59, 381 (1987)]. Directed models are found to belong to a universality class which includes the directed model introduced and solved by Dhar, Comment: 13 pages of text, RevTeX, additional 3 figures in 5 PS files
Published: 1998
Full Text: View/download PDF

118. Kernel methods for predicting protein–protein interactions.

Author: Asa Ben-Hur and William Stafford Noble
Published: 2005
Full Text: View/download PDF

119. Computation in gene networks.

Author: Asa Ben-Hur, S. and Siegelmann, Hava T.
Subjects: GENETIC regulation, CELLULAR control mechanisms, PHYSIOLOGICAL control systems, MOLECULAR genetics, BIOSYNTHESIS, GENETICS
Abstract: Genetic regulatory networks have the complex task of controlling all aspects of life. Using a model of gene expression by piecewise linear differential equations we show that this process can be considered as a process of computation. This is demonstrated by showing that this model can simulate memory bounded Turing machines. The simulation is robust with respect to perturbations of the system, an important property for both analog computers and biological systems. Robustness is achieved using a condition that ensures that the model equations, that are generally chaotic, follow a predictable dynamics. © 2004 American Institute of Physics. [ABSTRACT FROM AUTHOR]
Published: 2004
Full Text: View/download PDF

120. SpliceGrapher: detecting patterns of alternative splicing from RNA-Seq data in the context of gene models and EST data

Author: Julie Thomas, Mark F. Rogers, Asa Ben-Hur, and Anireddy S. N. Reddy
Subjects: 0106 biological sciences, Arabidopsis, Method, Context (language use), RNA-Seq, Computational biology, Biology, 01 natural sciences, Genome, 03 medical and health sciences, Humans, splice, natural sciences, Gene, 030304 developmental biology, Expressed Sequence Tags, Genetics, 0303 health sciences, Expressed sequence tag, Genome, Human, Alternative splicing, Computational Biology, food and beverages, Alternative Splicing, Human genome, RNA Splice Sites, Databases, Nucleic Acid, Algorithms, Software, 010606 plant biology & botany
Abstract: We propose a method for predicting splice graphs that enhances curated gene models using evidence from RNA-Seq and EST alignments. Results obtained using RNA-Seq experiments in Arabidopsis thaliana show that predictions made by our SpliceGrapher method are more consistent with current gene models than predictions made by TAU and Cufflinks. Furthermore, analysis of plant and human data indicates that the machine learning approach used by SpliceGrapher is useful for discriminating between real and spurious splice sites, and can improve the reliability of detection of alternative splicing. SpliceGrapher is available for download at http://SpliceGrapher.sf.net.
Full Text: View/download PDF

121. InSite: a computational method for identifying protein-protein interaction binding sites on a proteome-wide scale

Author: Qian-Ru Li, Asa Ben-Hur, Haidong Wang, Eran Segal, Marc Vidal, and Daphne Koller
Subjects: Proteomics, Proteome, Method, Saccharomyces cerevisiae, Biology, Protein–protein interaction, 03 medical and health sciences, Data sequences, Two-Hybrid System Techniques, Protein Interaction Mapping, Humans, Binding site, Databases, Protein, 030304 developmental biology, Genetics, Electronic Data Processing, 0303 health sciences, Polymorphism, Genetic, Gene ontology, Gene Expression Profiling, 030302 biochemistry & molecular biology, Protein pair, Computational Biology, computer.file_format, Protein Data Bank, Human genetics, computer, Algorithms, Software, Protein Binding
Abstract: InSite is a computational method that integrates high-throughput protein and sequence data to infer the specific binding regions of interacting protein pairs., We propose InSite, a computational method that integrates high-throughput protein and sequence data to infer the specific binding regions of interacting protein pairs. We compared our predictions with binding sites in Protein Data Bank and found significantly more binding events occur at sites we predicted. Several regions containing disease-causing mutations or cancer polymorphisms in human are predicted to be binding for protein pairs related to the disease, which suggests novel mechanistic hypotheses for several diseases.
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

121 results on '"Asa Ben-Hur"'

101. A user's guide to support vector machines

102. Genome-wide analysis of alternative splicing in Chlamydomonas reinhardtii

103. A Promiscuous Prion: Efficient Induction of [URE3] Prion Formation by Heterologous Prion Domains

104. A User’s Guide to Support Vector Machines

105. The use of gene ontology evidence codes in preventing classifier assessment bias

106. Design and Analysis of the NIPS2003 Challenge

107. Sequence Motifs: Highly Predictive Features of Protein Function

108. Integrating Information for Protein Function Prediction

109. A structural alignment kernel for protein structures

110. Detecting Stable Clusters Using Principal Component Analysis

111. CREME: a framework for identifying cis-regulatory modules in human-mouse conserved segments

112. On probabilistic analog automata

113. Detecting stable clusters using principal component analysis

114. A support vector clustering method

115. Macroscopic Molecular Computation with Gene Networks

116. Computational complexity for continuous-time dynamics

117. Universality in Sandpile Models

118. Kernel methods for predicting protein–protein interactions.

119. Computation in gene networks.

120. SpliceGrapher: detecting patterns of alternative splicing from RNA-Seq data in the context of gene models and EST data

121. InSite: a computational method for identifying protein-protein interaction binding sites on a proteome-wide scale

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

121 results on '"Asa Ben-Hur"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources