1. Bayesian Variable Selection for Gaussian Copula Regression Models
- Author
-
Angelos Alexopoulos, Leonardo Bottolo, Bottolo, L [0000-0002-6381-2327], Apollo - University of Cambridge Repository, Alexopoulos, Angelis [0000-0002-5723-6570], and Bottolo, Leonardo [0000-0002-6381-2327]
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,Statistics::Theory ,Variable selection ,Mixed data ,Bayesian probability ,Feature selection ,Statistics - Applications ,Statistics - Computation ,01 natural sciences ,Article ,Methodology (stat.ME) ,Statistics::Machine Learning ,010104 statistics & probability ,0502 economics and business ,Statistics ,Sparse co-variance matrix ,Statistics::Methodology ,Discrete Mathematics and Combinatorics ,Applications (stat.AP) ,0101 mathematics ,stat.AP ,Statistics - Methodology ,Computation (stat.CO) ,050205 econometrics ,Mathematics ,stat.CO ,Bayesian variable selection ,05 social sciences ,Regression analysis ,16. Peace & justice ,Multiple-response regression models ,Statistics::Computation ,Gaussian copula ,stat.ME ,62J12, 62P10 ,Statistics, Probability and Uncertainty ,human activities - Abstract
We develop a novel Bayesian method to select important predictors in regression models with multiple responses of diverse types. A sparse Gaussian copula regression model is used to account for the multivariate dependencies between any combination of discrete and/or continuous responses and their association with a set of predictors. We utilize the parameter expansion for data augmentation strategy to construct a Markov chain Monte Carlo algorithm for the estimation of the parameters and the latent variables of the model. Based on a centered parametrization of the Gaussian latent variables, we design a fixed-dimensional proposal distribution to update jointly the latent binary vectors of important predictors and the corresponding non-zero regression coefficients. For Gaussian responses and for outcomes that can be modeled as a dependent version of a Gaussian response, this proposal leads to a Metropolis-Hastings step that allows an efficient exploration of the predictors' model space. The proposed strategy is tested on simulated data and applied to real data sets in which the responses consist of low-intensity counts, binary, ordinal and continuous variables., Comment: 39 pages main paper and 21 pages Supplementary Material
- Published
- 2020
- Full Text
- View/download PDF