54 results on '"Håvard Rue"'
Search Results
2. Parallelized integrated nested Laplace approximations for fast Bayesian inference
- Author
-
Lisa Gaedke-Merzhäuser, Janet van Niekerk, Olaf Schenk, and Håvard Rue
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,Computer Science - Distributed, Parallel, and Cluster Computing ,Computational Theory and Mathematics ,Distributed, Parallel, and Cluster Computing (cs.DC) ,Statistics, Probability and Uncertainty ,Statistics - Computation ,Computation (stat.CO) ,Theoretical Computer Science - Abstract
There is a growing demand for performing larger-scale Bayesian inference tasks, arising from greater data availability and higher-dimensional model parameter spaces. In this work we present parallelization strategies for the methodology of integrated nested Laplace approximations (INLA), a popular framework for performing approximate Bayesian inference on the class of Latent Gaussian models. Our approach makes use of nested OpenMP parallelism, a parallel line search procedure using robust regression in INLA's optimization phase and the state-of-the-art sparse linear solver PARDISO. We leverage mutually independent function evaluations in the algorithm as well as advanced sparse linear algebra techniques. This way we can flexibly utilize the power of today's multi-core architectures. We demonstrate the performance of our new parallelization scheme on a number of different real-world applications. The introduction of parallelism leads to speedups of a factor 10 and more for all larger models. Our work is already integrated in the current version of the open-source R-INLA package, making its improved performance conveniently available to all users., 18 pages, 7 figures
- Published
- 2022
3. A Spliced Gamma-Generalized Pareto Model for Short-Term Extreme Wind Speed Probabilistic Forecasting
- Author
-
Håvard Rue, Daniela Castro-Camilo, and Raphaël Huser
- Subjects
0106 biological sciences ,Statistics and Probability ,Mathematical optimization ,Computer science ,Astrophysics::High Energy Astrophysical Phenomena ,010603 evolutionary biology ,01 natural sciences ,Wind speed ,010104 statistics & probability ,Generalized Pareto distribution ,0101 mathematics ,Extreme value theory ,Physics::Atmospheric and Oceanic Physics ,General Environmental Science ,Wind power ,business.industry ,Applied Mathematics ,Probabilistic logic ,Pareto principle ,Agricultural and Biological Sciences (miscellaneous) ,Renewable energy ,Physics::Space Physics ,Probabilistic forecasting ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,business - Abstract
Renewable sources of energy such as wind power have become a sustainable alternative to fossil fuel-based energy. However, the uncertainty and fluctuation of the wind speed derived from its intermittent nature bring a great threat to the wind power production stability, and to the wind turbines themselves. Lately, much work has been done on developing models to forecast average wind speed values, yet surprisingly little has focused on proposing models to accurately forecast extreme wind speeds, which can damage the turbines. In this work, we develop a flexible spliced Gamma-Generalized Pareto model to forecast extreme and non-extreme wind speeds simultaneously. Our model belongs to the class of latent Gaussian models, for which inference is conveniently performed based on the integrated nested Laplace approximation method. Considering a flexible additive regression structure, we propose two models for the latent linear predictor to capture the spatio-temporal dynamics of wind speeds. Our models are fast to fit and can describe both the bulk and the tail of the wind speed distribution while producing short-term extreme and non-extreme wind speed probabilistic forecasts. Supplementary materials accompanying this paper appear online.
- Published
- 2019
4. A Hierarchical Spatiotemporal Statistical Model Motivated by Glaciology
- Author
-
Finnur Pálsson, Guðfinna Aðalgeirsdóttir, Birgir Hrafnkelsson, Giri Gopalan, Håvard Rue, Christopher K. Wikle, and Alexander H. Jarosch
- Subjects
0106 biological sciences ,Statistics and Probability ,Emulation ,Computer science ,Applied Mathematics ,Physical system ,Inference ,Statistical model ,Solver ,010603 evolutionary biology ,01 natural sciences ,Agricultural and Biological Sciences (miscellaneous) ,010104 statistics & probability ,Linear algebra ,0101 mathematics ,Statistics, Probability and Uncertainty ,Scenario testing ,Uncertainty quantification ,General Agricultural and Biological Sciences ,Algorithm ,General Environmental Science - Abstract
In this paper, we extend and analyze a Bayesian hierarchical spatiotemporal model for physical systems. A novelty is to model the discrepancy between the output of a computer simulator for a physical process and the actual process values with a multivariate random walk. For computational efficiency, linear algebra for bandwidth limited matrices is utilized, and first-order emulator inference allows for the fast emulation of a numerical partial differential equation (PDE) solver. A test scenario from a physical system motivated by glaciology is used to examine the speed and accuracy of the computational methods used, in addition to the viability of modeling assumptions. We conclude by discussing how the model and associated methodology can be applied in other physical contexts besides glaciology.
- Published
- 2019
5. Modelling sub-daily precipitation extremes with the blended generalised extreme value distribution
- Author
-
Silius M. Vandeskog, Sara Martino, Daniela Castro-Camilo, and Håvard Rue
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,Applied Mathematics ,Applications (stat.AP) ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Agricultural and Biological Sciences (miscellaneous) ,Statistics - Applications ,General Environmental Science - Abstract
A new method is proposed for modelling the yearly maxima of sub-daily precipitation, with the aim of producing spatial maps of return level estimates. Yearly precipitation maxima are modelled using a Bayesian hierarchical model with a latent Gaussian field, with the blended generalised extreme value (bGEV) distribution used as a substitute for the more standard generalised extreme value (GEV) distribution. Inference is made less wasteful with a novel two-step procedure that performs separate modelling of the scale parameter of the bGEV distribution using peaks over threshold data. Fast inference is performed using integrated nested Laplace approximations (INLA) together with the stochastic partial differential equation approach, both implemented in . Heuristics for improving the numerical stability of with the GEV and bGEV distributions are also presented. The model is fitted to yearly maxima of sub-daily precipitation from the south of Norway and is able to quickly produce high-resolution return level maps with uncertainty. The proposed two-step procedure provides an improved model fit over standard inference techniques when modelling the yearly maxima of sub-daily precipitation with the bGEV distribution. Supplementary materials accompanying this paper appear on-line.
- Published
- 2021
6. Importance Sampling with the Integrated Nested Laplace Approximation
- Author
-
Martin Outzen Berild, Sara Martino, Virgilio Gómez-Rubio, and Håvard Rue
- Subjects
Statistics and Probability ,Methodology (stat.ME) ,FOS: Computer and information sciences ,Statistics::Applications ,Discrete Mathematics and Combinatorics ,Statistics, Probability and Uncertainty ,Statistics - Computation ,Computation (stat.CO) ,Statistics - Methodology ,Statistics::Computation - Abstract
The integrated nested Laplace approximation (INLA) is a deterministic approach to Bayesian inference on latent Gaussian models (LGMs) and focuses on fast and accurate approximation of posterior marginals for the parameters in the models. Recently, methods have been developed to extend this class of models to those that can be expressed as conditional LGMs by fixing some of the parameters in the models to descriptive values. These methods differ in the manner descriptive values are chosen. This article proposes to combine importance sampling with INLA (IS-INLA), and extends this approach with the more robust adaptive multiple importance sampling algorithm combined with INLA (AMIS-INLA). This article gives a comparison between these approaches and existing methods on a series of applications with simulated and observed datasets and evaluates their performance based on accuracy, efficiency, and robustness. The approaches are validated by exact posteriors in a simple bivariate linear model; then, they are applied to a Bayesian lasso model, a Poisson mixture, a zero-inflated Poisson model and a spatial autoregressive combined model. The applications show that the AMIS-INLA approach, in general, outperforms the other methods compared, but the IS-INLA algorithm could be considered for faster inference when good proposals are available. Supplementary materials for this article are available online.
- Published
- 2021
7. A principled distance-based prior for the shape of the Weibull model
- Author
-
Haakon Bakka, Håvard Rue, and J. van Niekerk
- Subjects
Statistics and Probability ,FOS: Computer and information sciences ,010102 general mathematics ,01 natural sciences ,Shape parameter ,Methodology (stat.ME) ,010104 statistics & probability ,Component (UML) ,0101 mathematics ,Statistics, Probability and Uncertainty ,Algorithm ,Statistics - Methodology ,Weibull distribution ,Mathematics ,Distance based - Abstract
The use of flat or weakly informative priors is popular due to the objective a priori belief in the absence of strong prior information. In the case of the Weibull model the improper uniform, equal parameter gamma and joint Jeffrey's priors for the shape parameter are popular choices. The effects and behaviors of these priors have yet to be established from a modeling viewpoint, especially their ability to reduce to the simpler exponential model. In this work we propose a new principled prior for the shape parameter of the Weibull model, originating from a prior on the distance function, and advocate this new prior as a principled choice in the absence of strong prior information. This new prior can then be used in models with a Weibull modeling component, like competing risks, joint and spatial models, to mention a few. This prior is available in the R-INLA for use, and is applied in a joint longitudinal-survival model framework using the INLA method.
- Published
- 2020
- Full Text
- View/download PDF
8. An approximate fractional Gaussian noise model with $$\mathcal {O}(n)$$ O ( n ) computational cost
- Author
-
Eirik Myrvoll-Nilsen, Håvard Rue, and Sigrunn Holbek Sørbye
- Subjects
Statistics and Probability ,Markov chain ,Covariance matrix ,Gaussian ,Autocorrelation ,010103 numerical & computational mathematics ,01 natural sciences ,Toeplitz matrix ,Theoretical Computer Science ,010104 statistics & probability ,symbols.namesake ,Computational Theory and Mathematics ,Autoregressive model ,Laplace's method ,Gaussian noise ,symbols ,Applied mathematics ,0101 mathematics ,Statistics, Probability and Uncertainty ,Mathematics - Abstract
Fractional Gaussian noise (fGn) is a stationary time series model with long-memory properties applied in various fields like econometrics, hydrology and climatology. The computational cost in fitting an fGn model of length n using a likelihood-based approach is $${{\mathcal {O}}}(n^{2})$$ , exploiting the Toeplitz structure of the covariance matrix. In most realistic cases, we do not observe the fGn process directly but only through indirect Gaussian observations, so the Toeplitz structure is easily lost and the computational cost increases to $${{\mathcal {O}}}(n^{3})$$ . This paper presents an approximate fGn model of $${{\mathcal {O}}}(n)$$ computational cost, both with direct and indirect Gaussian observations, with or without conditioning. This is achieved by approximating fGn with a weighted sum of independent first-order autoregressive (AR) processes, fitting the parameters of the approximation to match the autocorrelation function of the fGn model. The resulting approximation is stationary despite being Markov and gives a remarkably accurate fit using only four AR components. Specifically, the given approximate fGn model is incorporated within the class of latent Gaussian models in which Bayesian inference is obtained using the methodology of integrated nested Laplace approximation. The performance of the approximate fGn model is demonstrated in simulations and two real data examples.
- Published
- 2018
9. Careful prior specification avoids incautious inference for log-Gaussian Cox point processes
- Author
-
David F. R. P. Burslem, Sigrunn H. S⊘rbye, Håvard Rue, Janine B. Illian, Daniel Simpson, University of St Andrews. Statistics, University of St Andrews. Scottish Oceans Institute, and University of St Andrews. Centre for Research into Ecological & Environmental Modelling
- Subjects
0106 biological sciences ,Statistics and Probability ,Soil map ,Spatial point process ,History ,Soil nutrients ,Spatial modelling ,Bayesian analysis ,Library science ,Inference ,Penalized complexity prior ,3rd-DAS ,15. Life on land ,Tropical forest ,010603 evolutionary biology ,01 natural sciences ,Field (geography) ,Statistics::Computation ,010104 statistics & probability ,Statistics::Methodology ,QA Mathematics ,0101 mathematics ,Statistics, Probability and Uncertainty ,QA ,R-INLA - Abstract
We acknowledge the principal investigators who were responsible for collecting and analysing the soil maps (Jim Dallin, Robert John, Kyle Harms, Robert Stallard and Joe Yavitt), the funding sources (National Science Foundation grants DEB021104, 021115, 0212284 and 0212818 and Office of International Science and Engineering grant 0314581, the Smithsonian Tropical Research Institute soils initiative and the Center for Tropical Forest Science) and field assistants (Paolo Segre and Juan Di Trani). Hyperprior specifications for random fields in spatial point process modelling can have a major impact on the results. In fitting log-Gaussian Cox processes to rainforest tree species, we consider a reparameterised model combining a spatially structured and an unstructured random field into a single component. This component has one hyperpa- rameter accounting for marginal variance, while an additional hyperparameter governs the fraction of the variance explained by the spatially structured effect. This facilitates inter- pretation of the hyperparameters and significance of covariates is studied for a range of hyperprior specifications. Appropriate scaling makes the analysis invariant to grid resolution. Postprint
- Published
- 2019
10. You Just Keep on Pushing My Love over the Borderline: A Rejoinder
- Author
-
Daniel Simpson, Håvard Rue, Andrea Riebler, Sigrunn Holbek Sørbye, and Thiago G. Martins
- Subjects
0301 basic medicine ,Statistics and Probability ,Psychoanalysis ,General Mathematics ,01 natural sciences ,VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410 ,VDP::Mathematics and natural science: 400::Mathematics: 410 ,010104 statistics & probability ,03 medical and health sciences ,030104 developmental biology ,0101 mathematics ,Statistics, Probability and Uncertainty ,Mathematics - Abstract
Source at https://doi.org/10.1214/17-sts576rej. INTRODUCTION: The point of departure for our paper is that most modern statistical models are built to be flexible enough to model diverse data generating mechanisms. Good statistical practice requires us to limit this flexibility, which is typically controlled by a small number of parameters, to the amount “needed” to model the data at hand. The Bayesian framework provides a natural method for doing this although, as DD points out, this trend for penalising model complexity casts a broad shadow over all of modern statistics and data science.The PC prior framework argues for setting priors on these flexibility parameters that are specifically built to penalise a certain type of complexity and avoid overfitting. The discussants raised various points about this core idea. First, DD pointed out that while over-fitting a model is a bad thing, under-fitting is not better: we do not want Occam’s razor to slit our throat. We saw this behaviour when using a half-Normal prior on the distance, while the exponential prior does not lead to obvious attenuation of the estimates. This is confirmed experimentally by Klein and Kneib (2016).
- Published
- 2017
11. Extending Integrated Nested Laplace Approximation to a Class of Near-Gaussian Latent Models
- Author
-
Thiago G. Martins and Håvard Rue
- Subjects
Statistics and Probability ,Mathematical optimization ,Gaussian ,Markov chain Monte Carlo ,Field (computer science) ,symbols.namesake ,Distribution (mathematics) ,Laplace's method ,Component (UML) ,symbols ,Applied mathematics ,Fraction (mathematics) ,Statistics, Probability and Uncertainty ,Scope (computer science) ,Mathematics - Abstract
This work extends the integrated nested Laplace approximation (INLA) method to latent models outside the scope of latent Gaussian models, where independent components of the latent field can have a near-Gaussian distribution. The proposed methodology is an essential component of a bigger project that aims to extend the R package INLA in order to allow the user to add flexibility and challenge the Gaussian assumptions of some of the model components in a straightforward and intuitive way. Our approach is applied to two examples, and the results are compared with that obtained by Markov chain Monte Carlo, showing similar accuracy with only a small fraction of computational time. Implementation of the proposed extension is available in the R-INLA package.
- Published
- 2014
12. Geostatistical Survival Models for Environmental Risk Assessment with Large Retrospective Cohorts
- Author
-
Håvard Rue, Huan Jiang, Patrick Brown, and Silvia Emiko Shimakura
- Subjects
Statistics and Probability ,Economics and Econometrics ,education.field_of_study ,Markov chain ,Computer science ,Population ,Retrospective cohort study ,Bayesian inference ,Statistics ,Econometrics ,Statistical inference ,Statistics, Probability and Uncertainty ,Spatial dependence ,Risk assessment ,education ,Social Sciences (miscellaneous) ,Survival analysis - Abstract
Summary Motivated by the problem of cancer risk assessment near a nuclear power generating station, the paper describes a methodology for fitting a spatially correlated survival model to large retrospective cohort data sets. Retrospective cohorts, which can be assembled inexpensively from population-based health databases, can partially account for lags between exposures and outcome of chronic diseases such as cancer. These data sets overcome one of the principal limitations of cross-sectional spatial analyses, though performing statistical inference requires accommodating censored and truncated event times as well as spatial dependence. The use of spatial survival models for large retrospective cohorts is described, and Bayesian inference using Markov random-field approximations and integrated nested Laplace approximations is presented. The method is applied to data from individuals living near Pickering Nuclear Generating Station in Canada, showing that the effect of ambient radiation on cancer is not statistically significant.
- Published
- 2013
13. Discussion of ‘Beyond mean regression’
- Author
-
Håvard Rue, Janine B. Illian, Thiago G. Martins, Óli Páll Geirsson, and Daniel Simpson
- Subjects
Statistics and Probability ,Statistics ,Statistics, Probability and Uncertainty ,Regression ,Mathematics - Published
- 2013
14. Bayesian Computing with INLA: A Review
- Author
-
Janine B. Illian, Håvard Rue, Andrea Riebler, Finn Lindgren, Daniel Simpson, and Sigrunn Holbek Sørbye
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,Mathematical optimization ,010504 meteorology & atmospheric sciences ,VDP::Mathematics and natural science: 400::Mathematics: 410::Statistics: 412 ,Gaussian ,Bayesian probability ,VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Statistikk: 412 ,Bayesian inference ,01 natural sciences ,Methodology (stat.ME) ,010104 statistics & probability ,symbols.namesake ,Applied mathematics ,0101 mathematics ,Statistics - Methodology ,0105 earth and related environmental sciences ,Mathematics ,Sparse matrix ,Laplace transform ,Statistical model ,Statistics::Computation ,Bayesian statistics ,Laplace's method ,symbols ,Statistics, Probability and Uncertainty - Abstract
The key operation in Bayesian inference, is to compute high-dimensional integrals. An old approximate technique is the Laplace method or approximation, which dates back to Pierre- Simon Laplace (1774). This simple idea approximates the integrand with a second order Taylor expansion around the mode and computes the integral analytically. By developing a nested version of this classical idea, combined with modern numerical techniques for sparse matrices, we obtain the approach of Integrated Nested Laplace Approximations (INLA) to do approximate Bayesian inference for latent Gaussian models (LGMs). LGMs represent an important model-abstraction for Bayesian inference and include a large proportion of the statistical models used today. In this review, we will discuss the reasons for the success of the INLA-approach, the R-INLA package, why it is so accurate, why the approximations are very quick to compute and why LGMs make such a useful concept for Bayesian computing., 28 pages, 7 figures
- Published
- 2016
15. Spatial Modelling of Lupus Incidence Over 40 Years with Changes in Census Areas
- Author
-
Patrick Brown, Håvard Rue, Mustafa al-Maini, Ye Li, and Paul R. Fortin
- Subjects
Statistics and Probability ,education.field_of_study ,Systemic lupus erythematosus ,Incidence (epidemiology) ,Posterior probability ,Population ,Inference ,Census ,medicine.disease ,Grid ,Bayesian inference ,Geography ,Statistics ,medicine ,Statistics, Probability and Uncertainty ,education - Abstract
Summary Clinical data on the location of residence at the time of diagnosis of new lupus cases in Toronto, Canada, for the 40 years to 2007 are modelled with the aim of finding areas of abnormally high risk. Inference is complicated by numerous irregular changes in the census regions on which population is reported. A model is introduced consisting of a continuous random spatial surface and fixed effects for time and ages of individuals. The process is modelled on a fine grid and Bayesian inference performed by using integrated nested Laplace approximations. Predicted risk surfaces and posterior probabilities of exceedance are produced for lupus and, for comparison, psoriatic arthritis data from the same clinic. Simulations studies are also carried out to understand better the performance of the model proposed as well as to compare with existing methods.
- Published
- 2011
16. Gender-Specific Differences and the Impact of Family Integration on Time Trends in Age-Stratified Swiss Suicide Rates
- Author
-
Andrea Riebler, Håvard Rue, Matthias Bopp, Leonhard Held, University of Zurich, and Riebler, A
- Subjects
Statistics and Probability ,Economics and Econometrics ,Multivariate statistics ,3301 Social Sciences (miscellaneous) ,610 Medicine & health ,2002 Economics and Econometrics ,Suicide rates ,01 natural sciences ,010104 statistics & probability ,03 medical and health sciences ,0302 clinical medicine ,Overdispersion ,Medicine ,1804 Statistics, Probability and Uncertainty ,2613 Statistics and Probability ,0101 mathematics ,Suicide Risk ,Suicide mortality ,Time trends ,business.industry ,10060 Epidemiology, Biostatistics and Prevention Institute (EBPI) ,16. Peace & justice ,030227 psychiatry ,Cohort effect ,Statistics, Probability and Uncertainty ,Birth cohort ,business ,Social Sciences (miscellaneous) ,Demography - Abstract
Summary Suicide has become one of the leading causes of death of Swiss males aged between 15 and 44 years, whose age-standardized rates are about three times higher than those for females. We compared age-stratified suicide rates of Swiss men and women aged 15–79 years and analysed gender-specific differences from 1950 to 2007. Furthermore, we explored whether changes in measures of family integration can explain changes in suicide trends. The use of multivariate age–period–cohort models avoids age aggregation and allows the exploration of heterogeneous time trends across age, period and birth cohort. In addition, explanatory variables can be included. We found strong gender-specific differences in suicide mortality. Although the same risk factors may act on age and overdispersion, there was no significant correlation between gender-specific cohort effects. Family integration had an effect on Swiss suicide risk but only partially explained the underlying trends over time.
- Published
- 2011
17. Simultaneous Credible Bands for Latent Gaussian Models
- Author
-
Sigrunn Holbek Sørbye and Håvard Rue
- Subjects
Statistics and Probability ,Pointwise ,Mathematical optimization ,Gaussian ,Bayesian probability ,Inference ,Multivariate normal distribution ,Bayesian inference ,Statistics::Computation ,symbols.namesake ,Joint probability distribution ,Credible interval ,symbols ,Applied mathematics ,Statistics, Probability and Uncertainty ,Mathematics - Abstract
Deterministic Bayesian inference for latent Gaussian models has recently become available using integrated nested Laplace approximations (INLA). Applying the INLA-methodology, marginal estimates for elements of the latent field can be computed efficiently, providing relevant summary statistics like posterior means, variances and pointwise credible intervals. In this article, we extend the use of INLA to joint inference and present an algorithm to derive analytical simultaneous credible bands for subsets of the latent field. The algorithm is based on approximating the joint distribution of the subsets by multivariate Gaussian mixtures. Additionally, we present a saddlepoint approximation to compute Bayesian contour probabilities, representing the posterior support of fixed parameter vectors of interest. We perform a simulation study and apply the given methods to two real examples.
- Published
- 2011
18. On a hybrid data cloning method and its application in generalized linear mixed models
- Author
-
Hossein Baghishani, Mohsen Mohammadzadeh, and Håvard Rue
- Subjects
Statistics and Probability ,Mixed model ,Mathematical optimization ,Cloning (programming) ,Maximum likelihood ,Asymptotic distribution ,Statistical model ,Generalized linear mixed model ,Theoretical Computer Science ,Computational Theory and Mathematics ,Laplace's method ,Statistics, Probability and Uncertainty ,Algorithm ,Hybrid data ,Mathematics - Abstract
The data cloning method is a new computational tool for computing maximum likelihood estimates in complex statistical models such as mixed models. This method is synthesized with integrated nested Laplace approximation to compute maximum likelihood estimates efficiently via a fast implementation in generalized linear mixed models. Asymptotic behavior of the hybrid data cloning method is discussed. The performance of the proposed method is illustrated through a simulation study and real examples. It is shown that the proposed method performs well and rightly justifies the theory. Supplemental materials for this article are available online.
- Published
- 2011
19. Improved auxiliary mixture sampling for hierarchical models of non-Gaussian data
- Author
-
R. Frühwirth, Håvard Rue, Sylvia Frühwirth-Schnatter, and Leonhard Held
- Subjects
Statistics and Probability ,Generalized linear model ,Negative binomial distribution ,Sampling (statistics) ,Latent variable ,Theoretical Computer Science ,Binomial distribution ,Quasi-likelihood ,Computational Theory and Mathematics ,Statistics ,Applied mathematics ,Multinomial distribution ,Statistics, Probability and Uncertainty ,Mathematics ,Count data - Abstract
The article considers Bayesian analysis of hierarchical models for count, binomial and multinomial data using efficient MCMC sampling procedures. To this end, an improved method of auxiliary mixture sampling is proposed. In contrast to previously proposed samplers the method uses a bounded number of latent variables per observation, independent of the intensity of the underlying Poisson process in the case of count data, or of the number of experiments in the case of binomial and multinomial data. The bounded number of latent variables results in a more general error distribution, which is a negative log-Gamma distribution with arbitrary integer shape parameter. The required approximations of these distributions by Gaussian mixtures have been computed. Overall, the improvement leads to a substantial increase in efficiency of auxiliary mixture sampling for highly structured models. The method is illustrated for finite mixtures of generalized linear models and an epidemiological case study.
- Published
- 2008
20. On the Second-Order Random Walk Model for Irregular Locations
- Author
-
Finn Lindgren and Håvard Rue
- Subjects
Statistics and Probability ,Markov chain ,Density estimation ,Random walk ,Stochastic differential equation ,symbols.namesake ,Wiener process ,Approximation error ,Calculus ,symbols ,Applied mathematics ,Statistics, Probability and Uncertainty ,Galerkin method ,Smoothing ,Mathematics - Abstract
The second-order random walk (RW2) model is commonly used for smoothing data and for modelling response functions. It is computationally efficient due to the Markov properties of the joint (intrinsic) Gaussian density. For evenly spaced locations the RW2 model is well established, whereas for irregularly spaced locations there is no well established construction in the literature. By considering the RW2 model as the solution of a stochastic differential equation (SDE), a dis- cretely observed integrated Wiener process, it is possible to derive the density preserving the Markov properties by augmenting the state-space with the velocities. Here, we derive a computationally more efficient RW2 model for irregular locations using a Galerkin approximation to the solution of the SDE without the need of augmenting the state-space. Numerical comparison with the exact solution demonstrates that the error in the Galerkin approximation is small and negligible in applications.
- Published
- 2008
21. Approximate Bayesian inference for hierarchical Gaussian Markov random field models
- Author
-
Sara Martino and Håvard Rue
- Subjects
Statistics and Probability ,Random field ,Markov chain ,Applied Mathematics ,Monte Carlo method ,Markov chain Monte Carlo ,Marginal model ,Markov model ,Bayesian inference ,symbols.namesake ,Econometrics ,symbols ,Applied mathematics ,Statistics, Probability and Uncertainty ,Marginal distribution ,Mathematics - Abstract
Many commonly used models in statistics can be formulated as (Bayesian) hierarchical Gaussian Markov random field models. These are characterised by assuming a (often large) Gaussian Markov random field (GMRF) as the second stage in the hierarchical structure and a few hyperparameters at the third stage. Markov chain Monte Carlo is the common approach for Bayesian inference in such models. The variance of the Monte Carlo estimates is Op(M 1/2 ) where M is the number of samples in the chain so, in order to obtain precise estimates of marginal densities, say, we need M to be very large. Inspired by the fact that often one-block and independence samplers can be constructed for hierarchical GMRF models, we will in this work investigate whether MCMC is really needed to estimate marginal densities, which often is the goal of the analysis. By making use of GMRFapproximations, we show by typical examples that marginal densities can indeed be very precisely estimated by deterministic schemes. The methodological and practical consequence of these findings are indeed positive. We conjecture that for many hierarchical GMRF-models there is really no need for MCMC based inference to estimate marginal densities. Further, by making use of numerical methods for sparse matrices the computational costs of these deterministic schemes are nearly instant compared to the MCMC alternative. In particular, we discuss in detail the issue of computing marginal variances for GMRFs.
- Published
- 2007
22. Recursive computing and simulation-free inference for general factorizable models
- Author
-
Håvard Rue and Nial Friel
- Subjects
Statistics and Probability ,Markov chain ,Stochastic process ,Applied Mathematics ,General Mathematics ,Monte Carlo method ,Inference ,Markov chain Monte Carlo ,Agricultural and Biological Sciences (miscellaneous) ,Approximate inference ,symbols.namesake ,Sampling distribution ,symbols ,Calculus ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Hidden Markov random field ,Algorithm ,Mathematics - Abstract
We illustrate how the recursive algorithm of Reeves & Pettitt (2004) for general factorizable models can be extended to allow exact sampling, maximization of distributions and computation of marginal distributions. All of the methods we describe apply to discrete-valued Markov random fields with nearest neighbour integrations defined on regular lattices; in particular we illustrate that exact inference can be performed for hidden autologistic models defined on moderately sized lattices. In this context we offer an extension of this methodology which allows approximate inference to be carried out for larger lattices without resorting to simulation techniques such as Markov chain Monte Carlo. In particular our work offers the basis for an automatic inference machine for such models. Copyright 2007, Oxford University Press.
- Published
- 2007
23. Penalized complexity priors for degrees of freedom in Bayesian P-splines
- Author
-
Massimo Ventrucci, Håvard Rue, Ventrucci, M, and Rue, H
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,Computer science ,Gaussian ,05 social sciences ,Bayesian probability ,Penalized complexity prior ,P splines ,penalized spline regression ,01 natural sciences ,Methodology (stat.ME) ,010104 statistics & probability ,symbols.namesake ,Spline (mathematics) ,Bayesian P-spline ,degrees of freedom ,0502 economics and business ,Prior probability ,symbols ,0101 mathematics ,Statistics, Probability and Uncertainty ,Gaussian markov random fields ,Algorithm ,Statistics - Methodology ,050205 econometrics - Abstract
Bayesian penalized splines (P-splines) assume an intrinsic Gaussian Markov random field prior on the spline coefficients, conditional on a precision hyper-parameter [Formula: see text]. Prior elicitation of [Formula: see text] is difficult. To overcome this issue, we aim to building priors on an interpretable property of the model, indicating the complexity of the smooth function to be estimated. Following this idea, we propose penalized complexity (PC) priors for the number of effective degrees of freedom. We present the general ideas behind the construction of these new PC priors, describe their properties and show how to implement them in P-splines for Gaussian data.
- Published
- 2015
24. Bayesian Spatial Modelling with R-INLA
- Author
-
Finn Lindgren and Håvard Rue
- Subjects
Statistics and Probability ,Statistics::Applications ,business.industry ,Computer science ,Gaussian ,Bayesian probability ,Markov chain Monte Carlo ,Geostatistics ,Machine learning ,computer.software_genre ,Bayesian inference ,Point process ,Statistics::Computation ,Stochastic partial differential equation ,symbols.namesake ,Laplace's method ,symbols ,Applied mathematics ,Artificial intelligence ,Statistics, Probability and Uncertainty ,business ,lcsh:Statistics ,lcsh:HA1-4737 ,computer ,Software - Abstract
The principles behind the interface to continuous domain spatial models in the R- INLA software package for R are described. The integrated nested Laplace approximation (INLA) approach proposed by Rue, Martino, and Chopin (2009) is a computationally effective alternative to MCMC for Bayesian inference. INLA is designed for latent Gaussian models, a very wide and flexible class of models ranging from (generalized) linear mixed to spatial and spatio-temporal models. Combined with the stochastic partial differential equation approach (SPDE, Lindgren, Rue, and Lindström 2011), one can accommodate all kinds of geographically referenced data, including areal and geostatistical ones, as well as spatial point process data. The implementation interface covers stationary spatial mod- els, non-stationary spatial models, and also spatio-temporal models, and is applicable in epidemiology, ecology, environmental risk assessment, as well as general geostatistics.
- Published
- 2015
25. Beyond the Valley of the Covariance Function
- Author
-
Håvard Rue, Finn Lindgren, and Daniel Simpson
- Subjects
Statistics and Probability ,FOS: Computer and information sciences ,Class (set theory) ,Multivariate statistics ,Covariance function ,Statistics::Applications ,Computer science ,General Mathematics ,Quantitative Biology::Tissues and Organs ,Solid-state ,Univariate ,Quantitative Biology::Other ,Methodology (stat.ME) ,stat.ME ,Statistics::Methodology ,Point (geometry) ,Statistics, Probability and Uncertainty ,Focus (optics) ,Mathematical economics ,Spatial analysis ,Statistics - Methodology - Abstract
Discussion of "Cross-Covariance Functions for Multivariate Geostatistics" by Genton and Kleiber [arXiv:1507.08017]., Comment: Published at http://dx.doi.org/10.1214/15-STS515 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)
- Published
- 2015
- Full Text
- View/download PDF
26. Improving the INLA approach for approximate Bayesian inference for latent Gaussian models
- Author
-
Håvard Rue and Egil Ferkingstad
- Subjects
Statistics and Probability ,copulas ,Gaussian ,Bayesian probability ,Inference ,Mathematics - Statistics Theory ,integrated nested Laplace approximation ,Bayesian computation ,Bayesian inference ,Poisson distribution ,Statistics - Computation ,Generalized linear mixed model ,Statistics::Computation ,symbols.namesake ,generalized linear mixed models ,latent Gaussian models ,Laplace's method ,Binary data ,symbols ,Applied mathematics ,Statistics, Probability and Uncertainty ,62F15 ,Statistics - Methodology ,Mathematics - Abstract
Weintroduceanewcopula-basedcorrectionforgeneralizedlinearmixedmodels(GLMMs)within the integrated nested Laplace approximation (INLA) approach for approximateBayesian inference for latent Gaussian models. GLMMs for Binomial and Poisson datawith many zeroes and low counts have been a somewhat difficult case for INLA, and thecase of binary data has been particularly problematic. Our new correction has been im-plemented as part of the R-INLA package, and adds only negligible computational cost.Empiricalevaluationsonbothrealandsimulateddataindicatethatthemethodworkswell.Keywords: Bayesian computation; copulas; generalized linear mixed models; integratednestedLaplaceapproximation;latentGaussianmodels 1 Introduction IntegratedNestedLaplaceApproximations(INLA)wereintroducedbyRue,MartinoCMcCulloch,SearleN2;:::;n,letProb(y
- Published
- 2015
27. Spatial Data Analysis withR-INLAwith Some Extensions
- Author
-
Virgilio Gómez-Rubio, Håvard Rue, and Roger Bivand
- Subjects
Statistics and Probability ,Interface (Java) ,Bayesian probability ,Model parameters ,computer.software_genre ,Bayesian inference ,spatial statistics ,Statistics::Computation ,Numerical integration ,Range (mathematics) ,Laplace's method ,INLA ,Data mining ,Statistics, Probability and Uncertainty ,lcsh:Statistics ,lcsh:HA1-4737 ,Spatial analysis ,Algorithm ,computer ,Software ,Mathematics - Abstract
http://www.jstatsoft.org/v63/i20 The integrated nested Laplace approximation (INLA) provides an interesting way of approximating the posterior marginals of a wide range of Bayesian hierarchical models. This approximation is based on conducting a Laplace approximation of certain functions and numerical integration is extensively used to integrate some of the models parameters out. The R - INLA package o ers an interface to INLA, providing a suitable framework for data analysis. Although the INLA methodology can deal with a large number of models, only the most relevant have been implemented within R - INLA . However, many other important models are not available for R - INLA yet. In this paper we show how to fit a number of spatial models with R - INLA , including its interaction with other R packages for data analysis. Secondly, we describe a novel method to extend the number of latent models available for the model parameters. Our approach is based on conditioning on one or several model parameters and fit these conditioned models with R-INLA. Then these models are combined using Bayesian model averaging to provide the final approximations to the posterior marginals of the model. Finally, we show some examples of the application of this technique in spatial statistics. It is worth noting that our approach can be extended to a number of other fields, and not only spatial statistics
- Published
- 2015
28. Bayesian analysis of measurement error models using integrated nested Laplace approximations
- Author
-
Stefanie Muff, Andrea Riebler, Håvard Rue, Leonhard Held, Philippe Saner, University of Zurich, and Held, Leonhard
- Subjects
Statistics and Probability ,Mathematical optimization ,Observational error ,Laplace transform ,Gaussian ,Bayesian probability ,Sampling (statistics) ,610 Medicine & health ,10060 Epidemiology, Biostatistics and Prevention Institute (EBPI) ,Bayesian inference ,Field (computer science) ,symbols.namesake ,symbols ,Applied mathematics ,Errors-in-variables models ,1804 Statistics, Probability and Uncertainty ,2613 Statistics and Probability ,Statistics, Probability and Uncertainty ,Mathematics - Abstract
Summary To account for measurement error (ME) in explanatory variables, Bayesian approaches provide a flexible framework, as expert knowledge can be incorporated in the prior distributions. Recently, integrated nested Laplace approximations have been proven to be a computationally convenient alternative to sampling approaches for Bayesian inference in latent Gaussian models. We show how the most common approaches to adjust for ME, the classical and the Berkson ME, fit into this framework. This is achieved through a reformulation with augmented pseudo-observations and a suitable extension of the latent Gaussian field. Two specific classes are described, which allow for a particularly simple implementation using integrated nested Laplace approximations. We present three applications within the framework of generalized linear (mixed) models with ME. To illustrate the practical feasibility, R code is provided in on-line supplementary material.
- Published
- 2015
- Full Text
- View/download PDF
29. M-Smoother with local Linear Fit
- Author
-
Fred Godtliebsen, Håvard Rue, C. K. Chu, and James Stephen Marron
- Subjects
Statistics and Probability ,Mathematical optimization ,Boundary effects ,Local linear ,Noise reduction ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Signal ,Nonparametric regression ,Maxima and minima ,Statistics, Probability and Uncertainty ,Constant (mathematics) ,Algorithm ,Smoothing ,ComputingMethodologies_COMPUTERGRAPHICS ,Mathematics - Abstract
Local linear M-smoothing is proposed as a method for noise reduction in one-dimensional signals. It is more appropriate than conventional local linear smoothing, because it does not introduce blurring of jumps in the signal. It improves local constant M-smoothing, by avoiding boundary effects at edges and jumps. While the idea of local linear M-smoothing is straightforward, numerical issues are challenging, because of the local minima aspect that is crucial to good performance. We give an algorithm which is both fast and robust together with the theoretical properties of the local linear M-smoother. The new M-smoother gives a large improvement for some data sets compared to the local constant M-smoother and demonstrates elsewhere good performance on various artificial and magnetic resonance data sets.
- Published
- 2002
30. Penalising model component complexity: A principled, practical approach to constructing priors
- Author
-
Andrea Riebler, Håvard Rue, Sigrunn Holbek Sørbye, Thiago G. Martins, and Daniel Simpson
- Subjects
0106 biological sciences ,Statistics and Probability ,FOS: Computer and information sciences ,Mathematical optimization ,information geometry ,Exploit ,General Mathematics ,Bayesian probability ,disease mapping ,occam ,Machine learning ,computer.software_genre ,010603 evolutionary biology ,01 natural sciences ,Methodology (stat.ME) ,010104 statistics & probability ,Bayesian theory ,Prior probability ,interpretable prior distributions ,Information geometry ,0101 mathematics ,Invariant (mathematics) ,Scaling ,Statistics - Methodology ,Mathematics ,computer.programming_language ,hierarchical models ,business.industry ,Univariate ,prior on correlation matrices ,VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410 ,VDP::Mathematics and natural science: 400::Mathematics: 410 ,stat.ME ,Artificial intelligence ,Statistics, Probability and Uncertainty ,business ,computer - Abstract
In this paper, we introduce a new concept for constructing prior distributions. We exploit the natural nested structure inherent to many model components, which defines the model component to be a flexible extension of a base model. Proper priors are defined to penalise the complexity induced by deviating from the simpler base model and are formulated after the input of a user-defined scaling parameter for that model component, both in the univariate and the multivariate case. These priors are invariant to reparameterisations, have a natural connection to Jeffreys' priors, are designed to support Occam's razor and seem to have excellent robustness properties, all which are highly desirable and allow us to use this approach to define default prior distributions. Through examples and theoretical results, we demonstrate the appropriateness of this approach and how it can be applied in various situations., Major revision of previous version. Includes a beefed up literature review and new desiderata for hierarchical priors. Removes (for space) the Cox proportional hazard model and the section on hyperparameters for Gaussian random fields
- Published
- 2014
31. [Untitled]
- Author
-
Ingelin Steinsland, Håvard Rue, and Merrilee Hurn
- Subjects
Statistics and Probability ,Structure (mathematical logic) ,Estimation theory ,business.industry ,Computer science ,Cognitive neuroscience of visual object recognition ,Statistical model ,Markov chain Monte Carlo ,Pattern recognition ,Theoretical Computer Science ,Image (mathematics) ,symbols.namesake ,Software ,Template ,Computational Theory and Mathematics ,symbols ,Artificial intelligence ,Statistics, Probability and Uncertainty ,business - Abstract
In recent years, a number of statistical models have been proposed for the purposes of high-level image analysis tasks such as object recognition. However, in general, these models remain hard to use in practice, partly as a result of their complexity, partly through lack of software. In this paper we concentrate on a particular deformable template model which has proved potentially useful for locating and labelling cells in microscope slides Rue and Hurn (1999). This model requires the specification of a number of rather non-intuitive parameters which control the shape variability of the deformed templates. Our goal is to arrange the estimation of these parameters in such a way that the microscope user's expertise is exploited to provide the necessary training data graphically by identifying a number of cells displayed on a computer screen, but that no additional statistical input is required. In this paper we use maximum likelihood estimation incorporating the error structure in the generation of our training data.
- Published
- 2001
32. Bayesian object identification
- Author
-
Håvard Rue and Merrilee Hurn
- Subjects
Statistics and Probability ,Markov chain ,Applied Mathematics ,General Mathematics ,Cognitive neuroscience of visual object recognition ,Markov process ,Reversible-jump Markov chain Monte Carlo ,Object (computer science) ,Agricultural and Biological Sciences (miscellaneous) ,symbols.namesake ,Prior probability ,Econometrics ,symbols ,Object type ,Statistics, Probability and Uncertainty ,General Agricultural and Biological Sciences ,Algorithm ,Gibbs sampling ,Mathematics - Abstract
This paper addresses the task of locating and identifying an unknown number of objects of different types in an image. Baddeley & Van Lieshout (1993) advocate marked point processes as object priors, whereas Grenander & Miller (1994) use deformable template models. In this paper elements of both approaches are combined to handle scenes containing variable numbers of objects of different types, using reversible jump Markov chain Monte Carlo methods for inference (Green, 1995). The naive application of these methods here leads to slow mixing and we adapt the model and algorithm in tandem in proposing three strategies to deal with this. The first two expand the model space by introducing an additional 'unknown' object type and the idea of a variable resolution template. The third strategy, utilising the first two, augments the algorithm with classes of updates which provide intuitive transitions between realisations containing different numbers of cells by splitting or merging nearby objects.
- Published
- 1999
33. Block updating in constrained Markov chain Monte Carlo sampling
- Author
-
Håvard Rue, Merrilee A. Hum, and Nuala A. Sheehan
- Subjects
Statistics and Probability ,Mathematical optimization ,Markov chain mixing time ,Markov chain ,Monte Carlo method ,Slice sampling ,Markov chain Monte Carlo ,Hybrid Monte Carlo ,symbols.namesake ,Coupling from the past ,symbols ,Parallel tempering ,Statistics, Probability and Uncertainty ,Mathematics - Abstract
Markov chain Monte Carlo methods are widely used to study highly structured stochastic systems. However, when the system is subject to constraints, it is difficult to find irreducible proposal distributions. We suggest a “block-wise” approach for constrained sampling and optimisation.
- Published
- 1999
34. Bayesian Image Classification with Baddeley's Delta Loss
- Author
-
Håvard Rue and Arnoldo Frigessi
- Subjects
Statistics and Probability ,Contextual image classification ,business.industry ,Bayesian probability ,Estimator ,Pattern recognition ,Bayesian inference ,Metropolis–Hastings algorithm ,Metric (mathematics) ,Discrete Mathematics and Combinatorics ,Artificial intelligence ,Statistics, Probability and Uncertainty ,business ,Invariant estimator ,Image restoration ,Mathematics - Abstract
In this article we adopt Baddeley's delta metric as a loss function in Bayesian image restoration and classification. We develop a new algorithm that allows us to approximate the corresponding optimal Bayesian estimator. With this algorithm good practical estimates can be obtained at approximately the same computational cost as traditional estimators like the marginal posterior mode (MPM). A comparison of our proposed classification with MPM shows significant advantages, especially with respect to fine structures.
- Published
- 1997
35. A Loss Function Model for the Restoration of Grey Level Images
- Author
-
Håvard Rue
- Subjects
Statistics and Probability ,Bayes estimator ,Markov chain ,Estimator ,Markov chain Monte Carlo ,Bayesian inference ,symbols.namesake ,Bayes' theorem ,Huber loss ,Statistics ,symbols ,Statistics, Probability and Uncertainty ,Algorithm ,Image restoration ,Mathematics - Abstract
Common loss functions used for the restoration of grey scale images include the zero-one loss and the sum of squared errors. The corresponding estimators, the posterior mode and the posterior marginal mean, are optimal Bayes estimators with respect to their way of measuring the loss for different error configurations. However, both these loss functions have a fundamental weakness: the loss does not depend on the spatial structure of the errors. This is important because a systematic structure in the errors can lead to misinterpretation of the estimated image. We propose a new loss function that also penalizes strong local sample covariance in the error and we discuss how the optimal Bayes estimator can be estimated using a two-step Markov chain Monte Carlo and simulated annealing algorithm. We present simulation results for some artificial data which show improvement with respect to small structures in the image.
- Published
- 1997
36. Exploring a New Class of Non-stationary Spatial Gaussian Random Fields with Varying Local Anisotropy
- Author
-
Daniel Simpson, Geir-Arne Fuglstad, Håvard Rue, and Finn Lindgren
- Subjects
Statistics and Probability ,FOS: Computer and information sciences ,Random field ,Markov chain ,Gaussian ,Bayesian probability ,White noise ,Hierarchical database model ,Methodology (stat.ME) ,symbols.namesake ,Matrix (mathematics) ,stat.ME ,Position (vector) ,symbols ,Statistics::Methodology ,Statistical physics ,Statistics, Probability and Uncertainty ,Statistics - Methodology ,Mathematics - Abstract
Gaussian random fields (GRFs) constitute an important part of spatial modelling, but can be computationally infeasible for general covariance structures. An efficient approach is to specify GRFs via stochastic partial differential equations (SPDEs) and derive Gaussian Markov random field (GMRF) approximations of the solutions. We consider the construction of a class of non-stationary GRFs with varying local anisotropy, where the local anisotropy is introduced by allowing the coefficients in the SPDE to vary with position. This is done by using a form of diffusion equation driven by Gaussian white noise with a spatially varying diffusion matrix. This allows for the introduction of parameters that control the GRF by parametrizing the diffusion matrix. These parameters and the GRF may be considered to be part of a hierarchical model and the parameters estimated in a Bayesian framework. The results show that the use of an SPDE with non-constant coefficients is a promising way of creating non-stationary spatial GMRFs that allow for physical interpretability of the parameters, although there are several remaining challenges that would need to be solved before these models can be put to general practical use.
- Published
- 2013
- Full Text
- View/download PDF
37. A skew Gaussian decomposable graphical model
- Author
-
Håvard Rue, Finn Lindgren, Majid Jafari Khaledi, and Hamid Zareifard
- Subjects
Statistics and Probability ,FOS: Computer and information sciences ,Multivariate random variable ,Gaussian ,Mathematics - Statistics Theory ,Statistics Theory (math.ST) ,01 natural sciences ,Methodology (stat.ME) ,010104 statistics & probability ,symbols.namesake ,Decomposable graphical models ,Conditional independence ,0502 economics and business ,Prior probability ,Statistics ,FOS: Mathematics ,Graphical model ,Multivariate closed skew normal distribution ,0101 mathematics ,Statistics - Methodology ,050205 econometrics ,Mathematics ,Noninformative prior ,Numerical Analysis ,05 social sciences ,Skew ,Skewness ,symbols ,Identifiability ,Graph (abstract data type) ,Statistics, Probability and Uncertainty ,Algorithm - Abstract
This paper proposes a novel decomposable graphical model to accommodate skew Gaussian graphical models. We encode conditional independence structure among the components of the multivariate closed skew normal random vector by means of a decomposable graph so that the pattern of zero off-diagonal elements in the precision matrix corresponds to the missing edges of the given graph. We present conditions that guarantee the propriety of the posterior distributions under the standard noninformative priors for mean vector and precision matrix, and a proper prior for skewness parameter. The identifiability of the parameters is investigated by a simulation study. Finally, we apply our methodology to two data sets.
- Published
- 2013
- Full Text
- View/download PDF
38. A toolbox for fitting complex spatial point process models using integrated nested Laplace approximation (INLA)
- Author
-
Janine B. Illian, Sigrunn Holbek Sørbye, Håvard Rue, University of St Andrews. School of Mathematics and Statistics, University of St Andrews. Scottish Oceans Institute, and University of St Andrews. Centre for Research into Ecological & Environmental Modelling
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,Speedup ,VDP::Mathematics and natural science: 400::Mathematics: 410::Statistics: 412 ,Computer science ,VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Statistikk: 412 ,Model assessment ,Marked point patterns ,Statistics - Applications ,marked point patterns ,Covariate ,Applications (stat.AP) ,Point (geometry) ,QA Mathematics ,QA ,model assessment ,Process (computing) ,Model comparison ,Toolbox ,Data set ,Cox processes ,Laplace's method ,model comparison ,Modeling and Simulation ,Statistics, Probability and Uncertainty ,Point process models ,Algorithm - Abstract
This paper develops methodology that provides a toolbox for routinely fitting complex models to realistic spatial point pattern data. We consider models that are based on log-Gaussian Cox processes and include local interaction in these by considering constructed covariates. This enables us to use integrated nested Laplace approximation and to considerably speed up the inferential task. In addition, methods for model comparison and model assessment facilitate the modelling process. The performance of the approach is assessed in a simulation study. To demonstrate the versatility of the approach, models are fitted to two rather different examples, a large rainforest data set with covariates and a point pattern with multiple marks., Comment: Published in at http://dx.doi.org/10.1214/11-AOAS530 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)
- Published
- 2012
39. New Loss Functions in Bayesian Imaging
- Author
-
Håvard Rue
- Subjects
Statistics and Probability ,Markov random field ,Markov chain ,Bayesian probability ,Markov chain Monte Carlo ,Bayesian inference ,symbols.namesake ,Bayes' theorem ,Gaussian noise ,Simulated annealing ,Statistics ,symbols ,Statistics, Probability and Uncertainty ,Algorithm ,Mathematics - Abstract
Unlike the development of more accurate prior distributions for use in Bayesian imaging, the design of more sensible estimators through loss functions has been neglected in the literature. We discuss the design of loss functions with a local structure that depend only on a binary misclassification vector. The proposed approach is similar to modeling with a Markov random field. The Bayes estimate is calculated in a two-step algorithm using Markov chain Monte Carlo and simulated annealing algorithms. We present simulation experiments with the Ising model, where the observations are corrupted with Gaussian and flip noise.
- Published
- 1995
40. Bayesian analysis of RNA sequencing data by estimating multiple shrinkage priors
- Author
-
Håvard Rue, A.W. van der Vaart, M.A. van de Wiel, W.N. van Wieringen, Gwenaël G. R. Leday, Luba M. Pardo, Mathematics, Neuroscience Campus Amsterdam - Neurobiology of Mental Health, Neuroscience Campus Amsterdam - Brain Mechanisms in Health & Disease, Epidemiology and Data Science, Human genetics, NCA - Brain mechanisms in health and disease, CCA - Disease profiling, and NCA - Neurobiology of mental health
- Subjects
Statistics and Probability ,Genetics ,Models, Statistical ,Base Sequence ,Sequence Analysis, RNA ,Sequencing data ,Bayesian probability ,Molecular Sequence Data ,RNA ,Bayes Theorem ,General Medicine ,Computational biology ,Biology ,DNA sequencing ,chemistry.chemical_compound ,chemistry ,Data Interpretation, Statistical ,Prior probability ,Computer Simulation ,Statistics, Probability and Uncertainty ,DNA microarray ,DNA - Abstract
Next generation sequencing is quickly replacing microarrays as a technique to probe different molecular levels of the cell, such as DNA or RNA. The technology provides higher resolution, while reducing bias. RNA sequencing results in counts of RNA strands. This type of data imposes new statistical challenges. We present a novel, generic approach to model and analyze such data. Our approach aims at large flexibility of the likelihood (count) model and the regression model alike. Hence, a variety of count models is supported, such as the popular NB model, which accounts for overdispersion. In addition, complex, non-balanced designs and random effects are accommodated. Like some other methods, our method provides shrinkage of dispersion-related parameters. However, we extend it by enabling joint shrinkage of parameters, including those for which inference is desired. We argue that this is essential for Bayesian multiplicity correction. Shrinkage is effectuated by empirically estimating priors. We discuss several parametric (mixture) and non-parametric priors and develop procedures to estimate (parameters of) those. Inference is provided by means of local and Bayesian false discovery rates. We illustrate our method on several simulations and two data sets, also to compare it with other methods. Model-and data-based simulations show substantial improvements in the sensitivity at the given specificity. The data motivate the use of the ZI-NB as a powerful alternative to the NB, which results in higher detection rates for low-count data. Finally, compared with other methods, the results on small sample subsets are more reproducible when validated on their large sample complements, illustrating the importance of the type of shrinkage. © 2012 The Author.
- Published
- 2012
41. Estimation and extrapolation of time trends in registry data—Borrowing strength from related populations
- Author
-
Leonhard Held, Håvard Rue, and Andrea Riebler
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,multivariate age-period-cohort model ,Multivariate statistics ,projections ,Univariate ,Bayesian analysis ,Markov chain Monte Carlo ,Missing data ,Random effects model ,Statistics - Applications ,symbols.namesake ,Overdispersion ,Modeling and Simulation ,uniform correlation matrix ,Prior probability ,Statistics ,symbols ,INLA ,Applications (stat.AP) ,Imputation (statistics) ,Statistics, Probability and Uncertainty ,Mathematics - Abstract
To analyze and project age-specific mortality or morbidity rates age-period-cohort (APC) models are very popular. Bayesian approaches facilitate estimation and improve predictions by assigning smoothing priors to age, period and cohort effects. Adjustments for overdispersion are straightforward using additional random effects. When rates are further stratified, for example, by countries, multivariate APC models can be used, where differences of stratum-specific effects are interpretable as log relative risks. Here, we incorporate correlated stratum-specific smoothing priors and correlated overdispersion parameters into the multivariate APC model, and use Markov chain Monte Carlo and integrated nested Laplace approximations for inference. Compared to a model without correlation, the new approach may lead to more precise relative risk estimates, as shown in an application to chronic obstructive pulmonary disease mortality in three regions of England and Wales. Furthermore, the imputation of missing data for one particular stratum may be improved, since the new approach takes advantage of the remaining strata if the corresponding observations are available there. This is shown in an application to female mortality in Denmark, Sweden and Norway from the 20th century, where we treat for each country in turn either the first or second half of the observations as missing and then impute the omitted data. The projections are compared to those obtained from a univariate APC model and an extended Lee--Carter demographic forecasting approach using the proper Dawid--Sebastiani scoring rule., Comment: Published in at http://dx.doi.org/10.1214/11-AOAS498 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)
- Published
- 2012
42. An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach
- Author
-
Finn Lindgren, Johan Lindström, and Håvard Rue
- Subjects
Statistics and Probability ,Mathematical optimization ,Covariance function ,Gaussian ,Covariance ,Gaussian random field ,Stochastic partial differential equation ,Matrix (mathematics) ,symbols.namesake ,symbols ,Applied mathematics ,Markov property ,Statistics, Probability and Uncertainty ,covariance functions ,gaussian markov random fields ,latent gaussian models ,stochastic partial differential equations ,gaussian fields ,sparse matrices ,approximate bayesian inference ,Sparse matrix ,Mathematics - Abstract
Summary Continuously indexed Gaussian fields (GFs) are the most important ingredient in spatial statistical modelling and geostatistics. The specification through the covariance function gives an intuitive interpretation of the field properties. On the computational side, GFs are hampered with the big n problem, since the cost of factorizing dense matrices is cubic in the dimension. Although computational power today is at an all time high, this fact seems still to be a computational bottleneck in many applications. Along with GFs, there is the class of Gaussian Markov random fields (GMRFs) which are discretely indexed. The Markov property makes the precision matrix involved sparse, which enables the use of numerical algorithms for sparse matrices, that for fields in ℝ2 only use the square root of the time required by general algorithms. The specification of a GMRF is through its full conditional distributions but its marginal properties are not transparent in such a parameterization. We show that, using an approximate stochastic weak solution to (linear) stochastic partial differential equations, we can, for some GFs in the Matérn class, provide an explicit link, for any triangulation of ℝd, between GFs and GMRFs, formulated as a basis function representation. The consequence is that we can take the best from the two worlds and do the modelling by using GFs but do the computations by using GMRFs. Perhaps more importantly, our approach generalizes to other covariance functions generated by SPDEs, including oscillating and non-stationary GFs, as well as GFs on manifolds. We illustrate our approach by analysing global temperature data with a non-stationary model defined on a sphere.
- Published
- 2011
43. Approximate Bayesian Inference for Survival Models
- Author
-
Rupali Akerkar, Sara Martino, and Håvard Rue
- Subjects
Statistics and Probability ,business.industry ,Machine learning ,computer.software_genre ,Bayesian inference ,Variable-order Bayesian network ,Marginal likelihood ,Statistics::Computation ,Bayesian statistics ,Approximate inference ,Frequentist inference ,Statistics ,Bayesian experimental design ,Artificial intelligence ,Statistics, Probability and Uncertainty ,Bayesian linear regression ,business ,computer ,Mathematics - Abstract
Bayesian analysis of time-to-event data, usually called survival analysis, has received increasing attention in the last years. In Cox-type models it allows to use information from the full likelihood instead of from a partial likelihood, so that the baseline hazard function and the model parameters can be jointly estimated. In general, Bayesian methods permit a full and exact posterior inference for any parameter or predictive quantity of interest. On the other side, Bayesian inference often relies on Markov chain Monte Carlo (MCMC) techniques which, from the user point of view, may appear slow at delivering answers. In this article, we show how a new inferential tool named integrated nested Laplace approximations can be adapted and applied to many survival models making Bayesian analysis both fast and accurate without having to rely on MCMC-based inference.
- Published
- 2010
44. Bayesian inference for generalized linear mixed models
- Author
-
Jon Wakefield, Youyi Fong, and Håvard Rue
- Subjects
Statistics and Probability ,Bayesian probability ,Inference ,Breast Neoplasms ,Machine learning ,computer.software_genre ,Bayesian inference ,Generalized linear mixed model ,Cohort Studies ,Bayes' theorem ,Seizures ,Humans ,Computer Simulation ,Longitudinal Studies ,Mathematics ,Stochastic Processes ,Epilepsy ,business.industry ,Linear model ,Bayes Theorem ,General Medicine ,Articles ,Statistics::Computation ,Binary data ,Linear Models ,Female ,Artificial intelligence ,Statistics, Probability and Uncertainty ,business ,computer ,Smoothing - Abstract
Generalized linear mixed models (GLMMs) continue to grow in popularity due to their ability to directly acknowledge multiple levels of dependency and model different data types. For small sample sizes especially, likelihood-based inference can be unreliable with variance components being particularly difficult to estimate. A Bayesian approach is appealing but has been hampered by the lack of a fast implementation, and the difficulty in specifying prior distributions with variance components again being particularly problematic. Here, we briefly review previous approaches to computation in Bayesian implementations of GLMMs and illustrate in detail, the use of integrated nested Laplace approximations in this context. We consider a number of examples, carefully specifying prior distributions on meaningful quantities in each case. The examples cover a wide range of data types including those requiring smoothing over time and a relatively complicated spline model for which we examine our prior specification in terms of the implied degrees of freedom. We conclude that Bayesian inference is now practically feasible for GLMMs and provides an attractive alternative to likelihood-based approaches such as penalized quasi-likelihood. As with likelihood-based approaches, great care is required in the analysis of clustered binary data since approximation strategies may be less accurate for such data.
- Published
- 2010
45. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations
- Author
-
Håvard Rue, Sara Martino, Nicolas Chopin, Department of Mathematical Sciences, Centre de Recherche en Économie et Statistique (CREST), and Ecole Nationale de la Statistique et de l'Analyse de l'Information [Bruz] (ENSAI)-École polytechnique (X)-École Nationale de la Statistique et de l'Administration Économique (ENSAE Paris)-Centre National de la Recherche Scientifique (CNRS)
- Subjects
Statistics and Probability ,Mathematical optimization ,Markov chain ,Linear model ,Markov chain Monte Carlo ,Bayesian inference ,01 natural sciences ,Variable-order Bayesian network ,010104 statistics & probability ,03 medical and health sciences ,symbols.namesake ,0302 clinical medicine ,Laplace's method ,[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST] ,symbols ,Applied mathematics ,030212 general & internal medicine ,Semiparametric regression ,0101 mathematics ,Statistics, Probability and Uncertainty ,ComputingMilieux_MISCELLANEOUS ,Gibbs sampling ,Mathematics - Abstract
SummaryStructured additive regression models are perhaps the most commonly used class of models in statistical applications. It includes, among others, (generalized) linear models, (generalized) additive models, smoothing spline models, state space models, semiparametric regression, spatial and spatiotemporal models, log-Gaussian Cox processes and geostatistical and geoadditive models. We consider approximate Bayesian inference in a popular subset of structured additive regression models, latent Gaussian models, where the latent field is Gaussian, controlled by a few hyperparameters and with non-Gaussian response variables. The posterior marginals are not available in closed form owing to the non-Gaussian response variables. For such models, Markov chain Monte Carlo methods can be implemented, but they are not without problems, in terms of both convergence and computational time. In some practical applications, the extent of these problems is such that Markov chain Monte Carlo sampling is simply not an appropriate tool for routine analysis. We show that, by using an integrated nested Laplace approximation and its simplified version, we can directly compute very accurate approximations to the posterior marginals. The main benefit of these approximations is computational: where Markov chain Monte Carlo algorithms need hours or days to run, our approximations provide more precise estimates in seconds or minutes. Another advantage with our approach is its generality, which makes it possible to perform Bayesian analysis in an automatic, streamlined way, and to compute model comparison criteria and various predictive measures so that models can be compared and the model under study can be challenged.
- Published
- 2009
46. Approximate Bayesian Inference in Spatial Generalized Linear Mixed Models
- Author
-
Sara Martino, Håvard Rue, and Jo Eidsvik
- Subjects
Statistics and Probability ,Approximation theory ,Approximation error ,Posterior probability ,Calculus ,Linear model ,Applied mathematics ,Conditional probability distribution ,Statistics, Probability and Uncertainty ,Bayesian inference ,Likelihood function ,Marginal likelihood ,Mathematics - Abstract
This thesis consists of five papers, presented in chronological order. Their content is summarised in this section.Paper I introduces the approximation tool for latent GMRF models and discusses, in particular, the approximation for the posterior of the hyperparameters θ in equation (1). It is shown that this approximation is indeed very accurate, as even long MCMC runs cannot detect any error in it. A Gaussian approximation to the density of χi|θ, y is also discussed. This appears to give reasonable results and it is very fast to compute. However, slight errors are detected when comparing the approximation with long MCMC runs. These are mostly due to the fact that a possible - skewed density is approximated via a symmetric one. Paper I presents also some details about sparse matrices algorithms.The core of the thesis is presented in Paper II. Here most of the remaining issues present in Paper I are solved. Three different approximation for χi|θ, y with different degrees of accuracy and computational costs are described. Moreover, ways to assess the approximation error and considerations about the asymptotical behaviour of the approximations are also discussed. Through a series of examples covering a wide range of commonly used latent GMRF models, the approximations are shown to give extremely accurate results in a fraction of the computing time used by MCMC algorithms.Paper III applies the same ideas as Paper II to generalised linear mixed models where χ represents a latent variable at n spatial sites on a two dimensional domain. Out of these n sites k, with n >> k , are observed through data. The n sites are assumed to be on a regular grid and wrapped on a torus. For the class of models described in Paper III the computations are based on discrete Fourier transform instead of sparse matrices. Paper III illustrates also how marginal likelihood π (y) can be approximated, provides approximate strategies for Bayesian outlier detection and perform approximate evaluation of spatial experimental design.Paper IV presents yet another application of the ideas in Paper II. Here approximate techniques are used to do inference on multivariate stochastic volatility models, a class of models widely used in financial applications. Paper IV discusses also problems deriving from the increased dimension of the parameter vector θ, a condition which makes all numerical integration more computationally intensive. Different approximations for the posterior marginals of the parameters θ, π(θi)|y), are also introduced. Approximations to the marginal likelihood π(y) are used in order to perform model comparison.Finally, Paper V is a manual for a program, named inla which implements all approximations described in Paper II. A large series of worked out examples, covering many well known models, illustrate the use and the performance of the inla program. This program is a valuable instrument since it makes most of the Bayesian inference techniques described in this thesis easily available for everyone.
- Published
- 2008
47. Unsupervised empirical Bayesian multiple testing with external covariates
- Author
-
Egil Ferkingstad, Arnoldo Frigessi, Håvard Rue, Gudmar Thorleifsson, and Augustine Kong
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,Bioinformatics ,Computer science ,Bayesian probability ,Posterior probability ,false discovery rates ,Statistics - Applications ,multiple hypothesis testing ,Modeling and Simulation ,Statistical significance ,Statistics ,Covariate ,Multiple comparisons problem ,Statistics::Methodology ,Probability distribution ,Applications (stat.AP) ,Statistics, Probability and Uncertainty ,Null hypothesis ,data integration ,empirical Bayes ,Statistical hypothesis testing - Abstract
In an empirical Bayesian setting, we provide a new multiple testing method, useful when an additional covariate is available, that influences the probability of each null hypothesis being true. We measure the posterior significance of each test conditionally on the covariate and the data, leading to greater power. Using covariate-based prior information in an unsupervised fashion, we produce a list of significant hypotheses which differs in length and order from the list obtained by methods not taking covariate-information into account. Covariate-modulated posterior probabilities of each null hypothesis are estimated using a fast approximate algorithm. The new method is applied to expression quantitative trait loci (eQTL) data., Published in at http://dx.doi.org/10.1214/08-AOAS158 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)
- Published
- 2008
48. Discussion of 'modern statistics of spatial point processes'
- Author
-
Olle Häggström, Jesper Møller, Andrew B. Lawson, Yongtao Guan, Peter Guttorp, Jorge Mateu, Dietrich Stoyan, G Hellmund, A Sarkke, Sara Martino, Pavel Grabarnik, Wilfrid S. Kendall, M Prokesovva, Lothar Heinrich, Adrian Baddeley, Nicolas Chopin, Antti Penttinen, Håvard Rue, Rasmus Waagepetersen, U Hahn, Jean-Michel Billiot, Noel A Cressie, Frederic Paik Schoenberg, Statistique Appliquée et de Géométrie Aléatoire de Grenoble (SAGAG), Laboratoire Jean Kuntzmann (LJK), and Université Pierre Mendès France - Grenoble 2 (UPMF)-Université Joseph Fourier - Grenoble 1 (UJF)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Centre National de la Recherche Scientifique (CNRS)-Université Pierre Mendès France - Grenoble 2 (UPMF)-Université Joseph Fourier - Grenoble 1 (UJF)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Centre National de la Recherche Scientifique (CNRS)
- Subjects
Statistics and Probability ,010104 statistics & probability ,Point (typography) ,[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST] ,010102 general mathematics ,Statistics ,Mathematical statistics ,[STAT.TH]Statistics [stat]/Statistics Theory [stat.TH] ,0101 mathematics ,Statistics, Probability and Uncertainty ,01 natural sciences ,Point process ,Mathematics - Abstract
The paper ‘Modern statistics for spatial point processes' by Jesper Møller and Rasmus P. Waagepetersen is based on a special invited lecture given by the authors at the 21st Nordic Conference on Mathematical Statistics, held at Rebild, Denmark, in June 2006. At the conference, Antti Penttinen and Eva B. Vedel Jensen were invited to discuss the paper. We here present the comments from the two invited discussants and from a number of other scholars, as well as the authors' responses to these comments. Below Figure 1, Figure 2, etc., refer to figures in the paper under discussion, while Figure A, Figure B, etc., refer to figures in the current discussion. All numbered sections and formulas refer to the paper. The paper ‘Modern statistics for spatial point processes' by Jesper Møller and Rasmus P. Waagepetersen is based on a special invited lecture given by the authors at the 21st Nordic Conference on Mathematical Statistics, held at Rebild, Denmark, in June 2006. At the conference, Antti Penttinen and Eva B. Vedel Jensen were invited to discuss the paper. We here present the comments from the two invited discussants and from a number of other scholars, as well as the authors' responses to these comments. Below Figure 1, Figure 2, etc., refer to figures in the paper under discussion, while Figure A, Figure B, etc., refer to figures in the current discussion. All numbered sections and formulas refer to the paper.
- Published
- 2007
49. Approximating Hidden Gaussian Markov Random Fields
- Author
-
Håvard Rue, Ingelin Steinsland, and Sveinung Erland
- Subjects
Statistics and Probability ,Mathematical optimization ,Random field ,Statistics::Applications ,Markov chain ,Gaussian ,Normalizing constant ,Markov chain Monte Carlo ,Markov model ,symbols.namesake ,symbols ,Applied mathematics ,Markov property ,Statistics, Probability and Uncertainty ,Hidden Markov model ,Mathematical Physics and Mathematics ,Mathematics - Abstract
SummaryGaussian Markov random-field (GMRF) models are frequently used in a wide variety of applications. In most cases parts of the GMRF are observed through mutually independent data; hence the full conditional of the GMRF, a hidden GMRF (HGMRF), is of interest. We are concerned with the case where the likelihood is non-Gaussian, leading to non-Gaussian HGMRF models. Several researchers have constructed block sampling Markov chain Monte Carlo schemes based on approximations of the HGMRF by a GMRF, using a second-order expansion of the log-density at or near the mode. This is possible as the GMRF approximation can be sampled exactly with a known normalizing constant. The Markov property of the GMRF approximation yields computational efficiency.The main contribution in the paper is to go beyond the GMRF approximation and to construct a class of non-Gaussian approximations which adapt automatically to the particular HGMRF that is under study. The accuracy can be tuned by intuitive parameters to nearly any precision. These non-Gaussian approximations share the same computational complexity as those which are based on GMRFs and can be sampled exactly with computable normalizing constants. We apply our approximations in spatial disease mapping and model-based geostatistical models with different likelihoods, obtain procedures for block updating and construct Metropolized independence samplers.
- Published
- 2003
50. Introduction to 'Fast matrix computations for functional additive models' by S. Barthelmé
- Author
-
Håvard Rue
- Subjects
Statistics and Probability ,Algebra ,Matrix (mathematics) ,Computational Theory and Mathematics ,Computation ,Statistics, Probability and Uncertainty ,Additive model ,Algorithm ,Theoretical Computer Science ,Mathematics - Published
- 2014
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.