522 results on '"Deshpande, Sameer"'
Search Results
2. Bayesian Causal Forests & the 2022 ACIC Data Challenge: Scalability and Sensitivity
- Author
-
Kokandakar, Ajinkya H., Kang, Hyunseung, and Deshpande, Sameer K.
- Published
- 2023
3. Adolescent sports participation and health in early adulthood: An observational study
- Author
-
Kokandakar, Ajinkya H., Lin, Yuzhou, Jin, Steven, Weiss, Jordan, Rabinowitz, Amanda R., May, Reuben A. Buford, Small, Dylan, and Deshpande, Sameer K.
- Subjects
Statistics - Applications - Abstract
We study the impact of teenage sports participation on early-adulthood health using longitudinal data from the National Study of Youth and Religion. We focus on two primary outcomes measured at ages 23--28 -- self-rated health and total score on the PHQ9 Patient Depression Questionnaire -- and control for several potential confounders related to demographics and family socioeconomic status. To probe the possibility that certain types of sports participation may have larger effects on health than others, we conduct a matched observational study at each level within a hierarchy of exposures. Our hierarchy ranges from broadly defined exposures (e.g., participation in any organized after-school activity) to narrow (e.g., participation in collision sports). We deployed an ordered testing approach that exploits the hierarchical relationships between our exposure definitions to perform our analyses while maintaining a fixed family-wise error rate. Compared to teenagers who did not participate in any after-school activities, those who participated in sports had statistically significantly better self-rated and mental health outcomes in early adulthood., Comment: The pre-analysis protocol for this study is available at arXiv:2211.02104
- Published
- 2024
4. New directions in algebraic statistics: Three challenges from 2023
- Author
-
Alexandr, Yulia, Bakenhus, Miles, Curiel, Mark, Deshpande, Sameer K., Gross, Elizabeth, Gu, Yuqi, Hill, Max, Johnson, Joseph, Kagy, Bryson, Karwa, Vishesh, Li, Jiayi, Lyu, Hanbaek, Petrović, Sonja, and Rodriguez, Jose Israel
- Subjects
Mathematics - Statistics Theory ,62R01 - Abstract
In the last quarter of a century, algebraic statistics has established itself as an expanding field which uses multilinear algebra, commutative algebra, computational algebra, geometry, and combinatorics to tackle problems in mathematical statistics. These developments have found applications in a growing number of areas, including biology, neuroscience, economics, and social sciences. Naturally, new connections continue to be made with other areas of mathematics and statistics. This paper outlines three such connections: to statistical models used in educational testing, to a classification problem for a family of nonparametric regression models, and to phase transition phenomena under uniform sampling of contingency tables. We illustrate the motivating problems, each of which is for algebraic statistics a new direction, and demonstrate an enhancement of related methodologies., Comment: This research was performed while the authors were visiting the Institute for Mathematical and Statistical Innovation (IMSI), which is supported by the National Science Foundation (Grant No. DMS-1929348). We participated in the long program "Algebraic Statistics and Our Changing World"
- Published
- 2024
5. Evaluating plate discipline in Major League Baseball with Bayesian Additive Regression Trees
- Author
-
Yee, Ryan and Deshpande, Sameer K.
- Subjects
Statistics - Applications - Abstract
We introduce a three-step framework to determine at which pitches Major League batters should swing. Unlike traditional plate discipline metrics, which implicitly assume that all batters should always swing at (resp. take) pitches inside (resp. outside) the strike zone, our approach explicitly accounts not only for the players and umpires involved in the pitch but also in-game contextual information like the number of outs, the count, baserunners, and score. We first fit flexible Bayesian nonparametric models to estimate (i) the probability that the pitch is called a strike if the batter takes the pitch; (ii) the probability that the batter makes contact if he swings; and (iii) the number of runs the batting team is expected to score following each pitch outcome (e.g. swing and miss, take a called strike, etc.). We then combine these intermediate estimates to determine whether swinging increases the batting team's run expectancy. Our approach enables natural uncertainty propagation so that we can not only determine the optimal swing/take decision but also quantify our confidence in that decision. We illustrate our framework using a case study of pitches faced by Mike Trout in 2019.
- Published
- 2023
- Full Text
- View/download PDF
6. Are you using test log-likelihood correctly?
- Author
-
Deshpande, Sameer K., Ghosh, Soumya, Nguyen, Tin D., and Broderick, Tamara
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning ,Statistics - Other Statistics - Abstract
Test log-likelihood is commonly used to compare different models of the same data or different approximate inference algorithms for fitting the same probabilistic model. We present simple examples demonstrating how comparisons based on test log-likelihood can contradict comparisons according to other objectives. Specifically, our examples show that (i) approximate Bayesian inference algorithms that attain higher test log-likelihoods need not also yield more accurate posterior approximations and (ii) conclusions about forecast accuracy based on test log-likelihood comparisons may not agree with conclusions based on root mean squared error., Comment: Presented at the ICBINB Workshop at NeurIPS 2022. This version accepted at TMLR, available at https://openreview.net/forum?id=n2YifD4Dxo
- Published
- 2022
7. flexBART: Flexible Bayesian regression trees with categorical predictors
- Author
-
Deshpande, Sameer K.
- Subjects
Statistics - Methodology ,Statistics - Machine Learning - Abstract
Most implementations of Bayesian additive regression trees (BART) one-hot encode categorical predictors, replacing each one with several binary indicators, one for every level or category. Regression trees built with these indicators partition the discrete set of categorical levels by repeatedly removing one level at a time. Unfortunately, the vast majority of partitions cannot be built with this strategy, severely limiting BART's ability to partially pool data across groups of levels. Motivated by analyses of baseball data and neighborhood-level crime dynamics, we overcame this limitation by re-implementing BART with regression trees that can assign multiple levels to both branches of a decision tree node. To model spatial data aggregated into small regions, we further proposed a new decision rule prior that creates spatially contiguous regions by deleting a random edge from a random spanning tree of a suitably defined network. Our re-implementation, which is available in the flexBART package, often yields improved out-of-sample predictive performance and scales better to larger datasets than existing implementations of BART., Comment: Software available at https://github.com/skdeshpande91/flexBART
- Published
- 2022
8. Pre-analysis protocol for an observational study on the effects of adolescent sports participation on health in early adulthood
- Author
-
Kokandakar, Ajinkya H, Lin, Yuzhou, Jin, Steven, Weiss, Jordan, Rabinowitz, Amanda R, May, Reuben A Buford, Small, Dylan, and Deshpande, Sameer K
- Subjects
Statistics - Applications - Abstract
We will study the impact of adolescent sports participation on early-adulthood health using longitudinal data from the National Study of Youth and Religion. We focus on two primary outcomes measured at ages 23--28 -- self-rated health and total score on the PHQ9 Patient Depression Questionnaire -- and control for several potential confounders related to demographics and family socioeconomic status. Comparing outcomes between sports participants and matched non-sports participants with similar confounders is straightforward. Unfortunately, an analysis based on such a broad exposure cannot probe the possibility that participation in certain types of sports (e.g. collision sports like football or soccer) may have larger effects on health than others. In this study, we introduce a hierarchy of exposure definitions, ranging from broad (participation in any after-school organized activity) to narrow (e.g. participation in limited-contact sports). We will perform separate matched observational studies, one for each definition, to estimate the health effects of several levels of sports participation. In order to conduct these studies while maintaining a fixed family-wise error rate, we developed an ordered testing approach that exploits the logical relationships between exposure definitions. Our study will also consider several secondary outcomes including body mass index, life satisfaction, and problematic drinking behavior.
- Published
- 2022
9. Bayesian Causal Forests & the 2022 ACIC Data Challenge: Scalability and Sensitivity
- Author
-
Kokandakar, Ajinkya H., Kang, Hyunseung, and Deshpande, Sameer K.
- Subjects
Statistics - Applications - Abstract
We demonstrate how Hahn et al.'s Bayesian Causal Forests model (BCF) can be used to estimate conditional average treatment effects for the longitudinal dataset in the 2022 American Causal Inference Conference Data Challenge. Unfortunately, existing implementations of BCF do not scale to the size of the challenge data. Therefore, we developed flexBCF -- a more scalable and flexible implementation of BCF -- and used it in our challenge submission. We investigate the sensitivity of our results to the choice of propensity score estimation method and the use of sparsity-inducing regression tree priors. While we found that our overall point predictions were not especially sensitive to these modeling choices, we did observe that running BCF with flexibly estimated propensity scores often yielded better-calibrated uncertainty intervals.
- Published
- 2022
10. A Bayesian analysis of the time through the order penalty in baseball
- Author
-
Brill, Ryan S., Deshpande, Sameer K., and Wyner, Abraham J.
- Subjects
Statistics - Applications - Abstract
As a baseball game progresses, batters appear to perform better the more times they face a particular pitcher. The apparent drop-off in pitcher performance from one time through the order to the next, known as the Time Through the Order Penalty (TTOP), is often attributed to within-game batter learning. Although the TTOP has largely been accepted within baseball and influences many managers' in-game decision making, we argue that existing approaches of estimating the size of the TTOP cannot disentangle continuous evolution in pitcher performance over the course of the game from discontinuities between successive times through the order. Using a Bayesian multinomial regression model, we find that, after adjusting for confounders like batter and pitcher quality, handedness, and home field advantage, there is little evidence of strong discontinuity in pitcher performance between times through the order. Our analysis suggests that the start of the third time through the order should not be viewed as a special cutoff point in deciding whether to pull a starting pitcher., Comment: Accepted to JQAS
- Published
- 2022
- Full Text
- View/download PDF
11. Posterior contraction and uncertainty quantification for the multivariate spike-and-slab LASSO
- Author
-
Shen, Yunyi and Deshpande, Sameer K.
- Subjects
Mathematics - Statistics Theory - Abstract
We study the asymptotic properties of Deshpande et al.\ (2019)'s multivariate spike-and-slab LASSO (mSSL) procedure for simultaneous variable and covariance selection in the sparse multivariate linear regression problem. In that problem, $q$ correlated responses are regressed onto $p$ covariates and the mSSL works by placing separate spike-and-slab priors on the entries in the matrix of marginal covariate effects and off-diagonal elements in the upper triangle of the residual precision matrix. Under mild assumptions about these matrices, we establish the posterior contraction rate for the mSSL posterior in the asymptotic regime where both $p$ and $q$ diverge with $n.$ By ``de-biasing'' the corresponding MAP estimates, we obtain confidence intervals for each covariate effect and residual partial correlation. In extensive simulation studies, these intervals displayed close-to-nominal frequentist coverage in finite sample settings but tended to be substantially longer than those obtained using a version of the Bayesian bootstrap that randomly re-weights the prior. We further show that the de-biased intervals for individual covariate effects are asymptotically valid.
- Published
- 2022
12. Estimating sparse direct effects in multivariate regression with the spike-and-slab LASSO
- Author
-
Shen, Yunyi, Solís-Lemus, Claudia, and Deshpande, Sameer K.
- Subjects
Statistics - Methodology - Abstract
The multivariate regression interpretation of the Gaussian chain graph model simultaneously parametrizes (i) the direct effects of $p$ predictors on $q$ outcomes and (ii) the residual partial covariances between pairs of outcomes. We introduce a new method for fitting sparse Gaussian chain graph models with spike-and-slab LASSO (SSL) priors. We develop an Expectation Conditional Maximization algorithm to obtain sparse estimates of the $p \times q$ matrix of direct effects and the $q \times q$ residual precision matrix. Our algorithm iteratively solves a sequence of penalized maximum likelihood problems with self-adaptive penalties that gradually filter out negligible regression coefficients and partial covariances. Because it adaptively penalizes individual model parameters, our method is seen to outperform fixed-penalty competitors on simulated data. We establish the posterior contraction rate for our model, buttressing our method's excellent empirical performance with strong theoretical guarantees. Using our method, we estimated the direct effects of diet and residence type on the composition of the gut microbiome of elderly adults.
- Published
- 2022
13. Measuring the robustness of Gaussian processes to kernel choice
- Author
-
Stephenson, William T., Ghosh, Soumya, Nguyen, Tin D., Yurochkin, Mikhail, Deshpande, Sameer K., and Broderick, Tamara
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning ,Statistics - Computation - Abstract
Gaussian processes (GPs) are used to make medical and scientific decisions, including in cardiac care and monitoring of atmospheric carbon dioxide levels. Notably, the choice of GP kernel is often somewhat arbitrary. In particular, uncountably many kernels typically align with qualitative prior knowledge (e.g.\ function smoothness or stationarity). But in practice, data analysts choose among a handful of convenient standard kernels (e.g.\ squared exponential). In the present work, we ask: Would decisions made with a GP differ under other, qualitatively interchangeable kernels? We show how to answer this question by solving a constrained optimization problem over a finite-dimensional space. We can then use standard optimizers to identify substantive changes in relevant decisions made with a GP. We demonstrate in both synthetic and real-world examples that decisions made with a GP can exhibit non-robustness to kernel choice, even when prior draws are qualitatively interchangeable to a user., Comment: AISTATS 2022
- Published
- 2021
14. The personal and the social: Twin contributors to climate action
- Author
-
Bradley, Graham L., Deshpande, Sameer, and Paas, Karlien H.W.
- Published
- 2024
- Full Text
- View/download PDF
15. Confidently Comparing Estimators with the c-value
- Author
-
Trippe, Brian L., Deshpande, Sameer K., and Broderick, Tamara
- Subjects
Statistics - Methodology - Abstract
Modern statistics provides an ever-expanding toolkit for estimating unknown parameters. Consequently, applied statisticians frequently face a difficult decision: retain a parameter estimate from a familiar method or replace it with an estimate from a newer or more complex one. While it is traditional to compare estimates using risk, such comparisons are rarely conclusive in realistic settings. In response, we propose the "c-value" as a measure of confidence that a new estimate achieves smaller loss than an old estimate on a given dataset. We show that it is unlikely that a large c-value coincides with a larger loss for the new estimate. Therefore, just as a small p-value supports rejecting a null hypothesis, a large c-value supports using a new estimate in place of the old. For a wide class of problems and estimates, we show how to compute a c-value by first constructing a data-dependent high-probability lower bound on the difference in loss. The c-value is frequentist in nature, but we show that it can provide validation of shrinkage estimates derived from Bayesian models in real data applications involving hierarchical models and Gaussian processes., Comment: Accepted for publication in the Journal of the American Statistical Association
- Published
- 2021
16. Approximate Cross-Validation for Structured Models
- Author
-
Ghosh, Soumya, Stephenson, William T., Nguyen, Tin D., Deshpande, Sameer K., and Broderick, Tamara
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning ,Statistics - Computation ,Statistics - Methodology - Abstract
Many modern data analyses benefit from explicitly modeling dependence structure in data -- such as measurements across time or space, ordered words in a sentence, or genes in a genome. A gold standard evaluation technique is structured cross-validation (CV), which leaves out some data subset (such as data within a time interval or data in a geographic region) in each fold. But CV here can be prohibitively slow due to the need to re-run already-expensive learning algorithms many times. Previous work has shown approximate cross-validation (ACV) methods provide a fast and provably accurate alternative in the setting of empirical risk minimization. But this existing ACV work is restricted to simpler models by the assumptions that (i) data across CV folds are independent and (ii) an exact initial model fit is available. In structured data analyses, both these assumptions are often untrue. In the present work, we address (i) by extending ACV to CV schemes with dependence structure between the folds. To address (ii), we verify -- both theoretically and empirically -- that ACV quality deteriorates smoothly with noise in the initial fit. We demonstrate the accuracy and computational benefits of our proposed methods on a diverse set of real-world applications., Comment: 25 pages, 8 figures. NeurIPS 2020 camera ready. v2 fixes typos and provides additional empirical results. Code: https://github.com/SoumyaTGhosh/structured-infinitesimal-jackknife
- Published
- 2020
17. VCBART: Bayesian trees for varying coefficients
- Author
-
Deshpande, Sameer K., Bai, Ray, Balocchi, Cecilia, Starling, Jennifer E., and Weiss, Jordan
- Subjects
Statistics - Methodology - Abstract
The linear varying coefficient models posits a linear relationship between an outcome and covariates in which the covariate effects are modeled as functions of additional effect modifiers. Despite a long history of study and use in statistics and econometrics, state-of-the-art varying coefficient modeling methods cannot accommodate multivariate effect modifiers without imposing restrictive functional form assumptions or involving computationally intensive hyperparameter tuning. In response, we introduce VCBART, which flexibly estimates the covariate effect in a varying coefficient model using Bayesian Additive Regression Trees. With simple default settings, VCBART outperforms existing varying coefficient methods in terms of covariate effect estimation, uncertainty quantification, and outcome prediction. We illustrate the utility of VCBART with two case studies: one examining how the association between later-life cognition and measures of socioeconomic position vary with respect to age and socio-demographics and another estimating how temporal trends in urban crime vary at the neighborhood level. An R package implementing VCBART is available at https://github.com/skdeshpande91/VCBART
- Published
- 2020
18. Crime in Philadelphia: Bayesian Clustering with Particle Optimization
- Author
-
Balocchi, Cecilia, Deshpande, Sameer K., George, Edward I., and Jensen, Shane T.
- Subjects
Statistics - Applications ,Statistics - Methodology - Abstract
Accurate estimation of the change in crime over time is a critical first step towards better understanding of public safety in large urban environments. Bayesian hierarchical modeling is a natural way to study spatial variation in urban crime dynamics at the neighborhood level, since it facilitates principled ``sharing of information'' between spatially adjacent neighborhoods. Typically, however, cities contain many physical and social boundaries that may manifest as spatial discontinuities in crime patterns. In this situation, standard prior choices often yield overly-smooth parameter estimates, which can ultimately produce mis-calibrated forecasts. To prevent potential over-smoothing, we introduce a prior that partitions the set of neighborhoods into several clusters and encourages spatial smoothness within each cluster. In terms of model implementation, conventional stochastic search techniques are computationally prohibitive, as they must traverse a combinatorially vast space of partitions. We introduce an ensemble optimization procedure that simultaneously identifies several high probability partitions by solving one optimization problem using a new local search strategy. We then use the identified partitions to estimate crime trends in Philadelphia between 2006 and 2017. On simulated and real data, our proposed method demonstrates good estimation and partition selection performance.
- Published
- 2019
19. Expected Hypothetical Completion Probability
- Author
-
Deshpande, Sameer K. and Evans, Katherine
- Subjects
Statistics - Applications - Abstract
Using high-resolution player tracking data made available by the National Football League (NFL) for their 2019 Big Data Bowl competition, we introduce the Expected Hypothetical Completion Probability (EHCP), a objective framework for evaluating plays. At the heart of EHCP is the question "on a given passing play, did the quarterback throw the pass to the receiver who was most likely to catch it?" To answer this question, we first built a Bayesian non-parametric catch probability model that automatically accounts for complex interactions between inputs like the receiver's speed and distances to the ball and nearest defender. While building such a model is, in principle, straightforward, using it to reason about a hypothetical pass is challenging because many of the model inputs corresponding to a hypothetical are necessarily unobserved. To wit, it is impossible to observe how close an un-targeted receiver would be to his nearest defender had the pass been thrown to him instead of the receiver who was actually targeted. To overcome this fundamental difficulty, we propose imputing the unobservable inputs and averaging our model predictions across these imputations to derive EHCP. In this way, EHCP can track how the completion probability evolves for each receiver over the course of a play in a way that accounts for the uncertainty about missing inputs., Comment: This paper elaborates on work done for the NFL 2019 Big Data Bowl contest. Manuscript accepted at the Journal of Quantitative Analysis in Sports
- Published
- 2019
20. Protocol for an Observational Study of the Association of High School Football Participation on Health in Late Adulthood
- Author
-
Gaulton, Timothy G., Deshpande, Sameer K., Small, Dylan S., and Neuman, Mark D.
- Subjects
Statistics - Applications - Abstract
American football is the most popular high school sport and is among the leading cause of injury among adolescents. While there has been considerable recent attention on the link between football and cognitive decline, there is also evidence of higher than expected rates of pain, obesity, and lower quality of life among former professional players, either as a result of repetitive head injury or through different mechanisms. Previously hidden downstream effects of playing football may have far-reaching public health implications for participants in youth and high school football programs. Our proposed study is a retrospective observational study that compares 1,153 high school males who played varsity football with 2,751 male students who did not. 1,951 of the control subjects did not play any sport and the remaining 800 controls played a non-contact sport. Our primary outcome is self-rated health measured at age 65. To control for potential confounders, we adjust for pre-exposure covariates with matching and model-based covariance adjustment. We will conduct an ordered testing procedure designed to use the full pool of 2,751 controls while also controlling for possible unmeasured differences between students who played sports and those who did not. We will quantitatively assess the sensitivity of the results to potential unmeasured confounding. The study will also assess secondary outcomes of pain, difficulty with activities of daily living, and obesity, as these are both important to individual well-being and have public health relevance.
- Published
- 2019
21. Protocol for an observational study on the effects of playing football in adolescence on mental health in early adulthood
- Author
-
Deshpande, Sameer K., Hasegawa, Raiden B., Weiss, Jordan, and Small, Dylan S.
- Subjects
Statistics - Applications - Abstract
More than 1 million students play high school American football annually, but many health professionals have recently questioned its safety or called for its ban. These concerns have been partially driven by reports of chronic traumatic encephalopathy (CTE), increased risks of neurodegenerative disease, and associations between concussion history and later-life cognitive impairment and depression among retired professional football players. A recent observational study of a cohort of men who graduated from a Wisconsin high school in 1957 found no statistically significant harmful effects of playing high school football on a range of cognitive, psychological, and socio-economic outcomes measured at ages 35, 54, 65, and 72. Unfortunately, these findings may not generalize to younger populations, thanks to changes and improvements in football helmet technology and training techniques. In particular, these changes may have led to increased perceptions of safety but ultimately more dangerous styles of play, characterized by the frequent sub-concussive impacts thought to be associated with later-life neurological decline. In this work, we replicate the methodology of that earlier matched observational study using data from the National Longitudinal Study of Adolescent to Adult Health (Add Health). These include adolescent and family co-morbidities, academic experience, self-reported levels of general health and physical activity, and the score on the Add Health Picture Vocabulary Test. Our primary outcome is the CES-D score measured in 2008 when subjects were aged 24 -- 34 and settling into early adulthood. We also examine several secondary outcomes related to physical and psychological health, including suicidality. Our results can provide insight into the natural history of potential football-related decline and dysfunction., Comment: Updated tables summarizing the matches constructed
- Published
- 2018
22. Anaesthesia for Laparoscopic Surgery
- Author
-
Nwachukwu, Cyril E., Deshpande, Sameer, Ikechebelu, Joseph Ifeanyichukwu, Okohue, Jude Ehiabhi, editor, Ikechebelu, Joseph Ifeanyichukwu, editor, Ola, Bolarinde, editor, Kalu, Emmanuel, editor, and Ibeanu, Okechukwu, editor
- Published
- 2022
- Full Text
- View/download PDF
23. The social marketing paradox: challenges and opportunities for the discipline
- Author
-
Akbar, M. Bilal, Foote, Liz, Lawson, Alison, French, Jeff, Deshpande, Sameer, and Lee, Nancy R.
- Published
- 2022
- Full Text
- View/download PDF
24. Protocol for an Observational Study on the Effects of Early-Life Participation in Contact Sports on Later-Life Cognition in a Sample of Monozygotic and Dizygotic Swedish Twins Reared Together and Twins Reared Apart
- Author
-
Weiss, Jordan, Rabinowitz, Amanda R., Deshpande, Sameer K., Hasegawa, Raiden B., and Small, Dylan S.
- Subjects
Statistics - Applications - Abstract
A large body of work links traumatic brain injury (TBI) in adulthood to the onset of Alzheimer's disease (AD). AD is the chief cause of dementia, leading to reduced cognitive capacity and autonomy and increased mortality risk. More recently, researchers have sought to investigate whether TBI experienced in early-life may influence trajectories of cognitive dysfunction in adulthood. It has been speculated that early-life participation in collision sports may lead to poor cognitive and mental health outcomes. However, to date, the few studies to investigate this relationship have produced mixed results. We propose to extend this literature by conducting a prospective study on the effects of early-life participation in collision sports on later-life cognitive health using the Swedish Adoption/Twin Study on Aging (SATSA). The SATSA is unique in its sampling of monozygotic and dizygotic twins reared together (respectively MZT, DZT) and twins reared apart (respectively MZA, DZA). The proposed analysis is a prospective study of 660 individuals comprised of 270 twin pairs and 120 singletons. Seventy-eight (11.8% individuals reported participation in collision sports. Our primary outcome will be an indicator of cognitive impairment determined by scores on the Mini-Mental State Examination (MMSE). We will also consider several secondary cognitive outcomes including verbal and spatial ability, memory, and processing speed. Our sample will be restricted to individuals with at least one MMSE score out of seven repeated assessments spaced approximately three years apart. We will adjust for age, sex, and education in each of our models., Comment: Updated methodology and tables
- Published
- 2018
25. Simultaneous Variable and Covariance Selection with the Multivariate Spike-and-Slab Lasso
- Author
-
Deshpande, Sameer K., Rockova, Veronika, and George, Edward I.
- Subjects
Statistics - Methodology - Abstract
We propose a Bayesian procedure for simultaneous variable and covariance selection using continuous spike-and-slab priors in multivariate linear regression models where q possibly correlated responses are regressed onto p predictors. Rather than relying on a stochastic search through the high-dimensional model space, we develop an ECM algorithm similar to the EMVS procedure of Rockova & George (2014) targeting modal estimates of the matrix of regression coefficients and residual precision matrix. Varying the scale of the continuous spike densities facilitates dynamic posterior exploration and allows us to filter out negligible regression coefficients and partial covariances gradually. Our method is seen to substantially outperform regularization competitors on simulated data. We demonstrate our method with a re-examination of data from a recent observational study of the effect of playing high school football on several later-life cognition, psychological, and socio-economic outcomes.
- Published
- 2017
- Full Text
- View/download PDF
26. Corporate social marketing
- Author
-
Campbell, Alexander, primary, Deshpande, Sameer, additional, and Rundle-Thiele, Sharyn, additional
- Published
- 2022
- Full Text
- View/download PDF
27. Causal Inference with Two Versions of Treatment
- Author
-
Hasegawa, Raiden B., Deshpande, Sameer K., Small, Dylan S., and Rosenbaum, Paul R.
- Abstract
Causal effects are commonly defined as comparisons of the potential outcomes under treatment and control, but this definition is threatened by the possibility that either the treatment or the control condition is not well defined, existing instead in more than one version. This is often a real possibility in nonexperimental or observational studies of treatments because these treatments occur in the natural or social world without the laboratory control needed to ensure identically the same treatment or control condition occurs in every instance. We consider the simplest case: Either the treatment condition or the control condition exists in two versions that are easily recognized in the data but are of uncertain, perhaps doubtful, relevance, for example, branded Advil versus generic ibuprofen. Common practice does not address versions of treatment: Typically, the issue is either ignored or explicitly stated but assumed to be absent. Common practice is reluctant to address two versions of treatment because the obvious solution entails dividing the data into two parts with two analyses, thereby (a) reducing power to detect versions of treatment in each part, (b) creating problems of multiple inference in coordinating the two analyses, and (c) failing to report a single primary analysis that uses everyone. We propose and illustrate a new method of analysis that begins with a single primary analysis of everyone that would be correct if the two versions do not differ, adds a second analysis that would be correct were there two different effects for the two versions, controls the family-wise error rate in all assertions made by the several analyses, and yet pays no price in power to detect a constant treatment effect in the primary analysis of everyone. Our method can be applied to analyses of constant additive treatment effects on continuous outcomes. Unlike conventional simultaneous inferences, the new method is coordinating several analyses that are valid under different assumptions, so that one analysis would never be performed if one knew for certain that the assumptions of the other analysis are true. It is a multiple assumptions problem rather than a multiple hypotheses problem. We discuss the relative merits of the method with respect to more conventional approaches to analyzing multiple comparisons. The method is motivated and illustrated using a study of the possibility that repeated head trauma in high school football causes an increase in risk of early onset cognitive decline.
- Published
- 2020
- Full Text
- View/download PDF
28. Online banking and privacy: redesigning sales strategy through social exchange
- Author
-
Liyanaarachchi, Gajendra, Deshpande, Sameer, and Weaven, Scott
- Published
- 2021
- Full Text
- View/download PDF
29. Causal Inference with Two Versions of Treatment
- Author
-
Hasegawa, Raiden B., Deshpande, Sameer K., Small, Dylan S., and Rosenbaum, Paul R.
- Subjects
Statistics - Methodology - Abstract
Causal effects are commonly defined as comparisons of the potential outcomes under treatment and control, but this definition is threatened by the possibility that the treatment or control condition is not well-defined, existing instead in more than one version. A simple, widely applicable analysis is proposed to address the possibility that the treatment or control condition exists in two versions with two different treatment effects. This analysis loses no power in the main comparison of treatment and control, provides additional information about version effects, and controls the family-wise error rate in several comparisons. The method is motivated and illustrated using an on-going study of the possibility that repeated head trauma in high school football causes an increase in risk of early on-set dementia.
- Published
- 2017
30. A Hierarchical Bayesian Model of Pitch Framing
- Author
-
Deshpande, Sameer K. and Wyner, Abraham J.
- Subjects
Statistics - Applications - Abstract
Since the advent of high-resolution pitch tracking data (PITCHf/x), many in the sabermetrics community have attempted to quantify a Major League Baseball catcher's ability to "frame" a pitch (i.e. increase the chance that a pitch is called as a strike). Especially in the last three years, there has been an explosion of interest in the "art of pitch framing" in the popular press as well as signs that teams are considering framing when making roster decisions. We introduce a Bayesian hierarchical model to estimate each umpire's probability of calling a strike, adjusting for pitch participants, pitch location, and contextual information like the count. Using our model, we can estimate each catcher's effect on an umpire's chance of calling a strike.We are then able to translate these estimated effects into average runs saved across a season. We also introduce a new metric, analogous to Jensen, Shirley, and Wyner's Spatially Aggregate Fielding Evaluation metric, which provides a more honest assessment of the impact of framing.
- Published
- 2017
- Full Text
- View/download PDF
31. Confidently Comparing Estimates with the c-value.
- Author
-
Trippe, Brian L., Deshpande, Sameer K., and Broderick, Tamara
- Subjects
- *
NULL hypothesis , *DECISION theory , *STATISTICIANS , *STATISTICS , *DATA modeling - Abstract
Modern statistics provides an ever-expanding toolkit for estimating unknown parameters. Consequently, applied statisticians frequently face a difficult decision: retain a parameter estimate from a familiar method or replace it with an estimate from a newer or more complex one. While it is traditional to compare estimates using risk, such comparisons are rarely conclusive in realistic settings. In response, we propose the "c-value" as a measure of confidence that a new estimate achieves smaller loss than an old estimate on a given dataset. We show that it is unlikely that a large c-value coincides with a larger loss for the new estimate. Therefore, just as a small p-value supports rejecting a null hypothesis, a large c-value supports using a new estimate in place of the old. For a wide class of problems and estimates, we show how to compute a c-value by first constructing a data-dependent high-probability lower bound on the difference in loss. The c-value is frequentist in nature, but we show that it can provide validation of shrinkage estimates derived from Bayesian models in real data applications involving hierarchical models and Gaussian processes. Supplementary materials for this article are available online. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. Developing a Framework of Sustainable Consumption in Retailing Contexts.
- Author
-
Saha, Victor, Jebarajakirthy, Charles, Sreen, Naman, Goyal, Praveen, Mani, Venkatesh, and Deshpande, Sameer
- Abstract
Over the years, the retail sector has been showing increasing interest in inculcating sustainable consumption practices and behaviors among their consumers. This study, accordingly, explores how retailers can contribute to addressing consumers' various psychological dispositions to co-create sustainable consumption in the retail context. In that regard, the MAPED framework has been developed corresponding to White et al.'s SHIFT framework for exploring the process and mechanism of co-creating sustainable consumption in the retail sector. This comprehensive conceptual framework has been developed using a methodological approach of critical review and analysis of the extant academic literature on value co-creation and sustainable consumption. Accordingly, this study attempts to contribute to both sustainable consumption and value co-creation literature. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. Hemoglobin Levels Among Male Agricultural Workers: Analyses From the Demographic and Health Surveys to Investigate a Marker for Chronic Kidney Disease of Uncertain Etiology
- Author
-
Lin, Yuzhou, Heng, Siyu, Anand, Shuchi, Deshpande, Sameer K., and Small, Dylan S.
- Published
- 2022
- Full Text
- View/download PDF
34. Market-oriented corporate digital responsibility to manage data vulnerability in online banking
- Author
-
Liyanaarachchi, Gajendra, Deshpande, Sameer, and Weaven, Scott
- Published
- 2021
- Full Text
- View/download PDF
35. Community perspectives and engagement in sustainable solid waste management (SWM) in Fiji: A socioecological thematic analysis
- Author
-
Sewak, Aarti, Deshpande, Sameer, Rundle-Thiele, Sharyn, Zhao, Fang, and Anibaldi, Renata
- Published
- 2021
- Full Text
- View/download PDF
36. Base force and moment based finite element model correlation method
- Author
-
Sairajan, K.K., Deshpande, Sameer S., Patnaik, M.N.M., and Poomani, D.
- Published
- 2021
- Full Text
- View/download PDF
37. Social Marketing @ Griffith
- Author
-
Harris, Jessica, primary, Rundle-Thiele, Sharyn, additional, Dietrich, Timo, additional, Deshpande, Sameer, additional, Carins, Julia, additional, and Parkinson, Joy, additional
- Published
- 2022
- Full Text
- View/download PDF
38. Market-Oriented Digital Responsibility to Ensure Digital Security
- Author
-
Liyanaarachchi, Gajendra, primary and Deshpande, Sameer, additional
- Published
- 2022
- Full Text
- View/download PDF
39. Social Marketing Quarterly
- Author
-
Deshpande, Sameer, primary, Robinette, Tina, additional, Kirby, Susan, additional, McDivitt, Judith, additional, and Olabisi, Abisola, additional
- Published
- 2022
- Full Text
- View/download PDF
40. Tungsten Interconnect Resistance Reduction Enabling Energy Efficient and High Performance Applications for 2nm Node and Beyond
- Author
-
Thareja, Gaurav, primary, Pal, Ashish, additional, Ma, Quan, additional, Ching, Chi, additional, Patel, Sahil, additional, Gao, Xingyao, additional, Dag, Sefa, additional, Qi, Zhimin, additional, Zhang, Aixi, additional, Yue, Shiyu, additional, Lei, Wei, additional, Xu, Yi, additional, Lei, Yu, additional, Jiang, Hao, additional, You, Shi, additional, Zheng, Wenkai, additional, Hung, Raymond, additional, Costrini, Gregory, additional, Zhu, Qing, additional, Tran, Randy, additional, Gupta, Rohit, additional, Reddy, Vinod, additional, Vyas, Pratik B., additional, Hassan, Sajjad, additional, Cai, Man Ping, additional, Shen, Gang, additional, Chen, Zhebo, additional, Hou, Wenting, additional, Lei, Jianxin, additional, Wang, Rongjun, additional, Shen, Walters, additional, Deshpande, Sameer, additional, Huey, Sidney, additional, Tang, Jianshe, additional, Naik, Mehul, additional, Kesapragada, Sree, additional, Ayyagari-Sangamali, Buvna, additional, Bazizi, El Mehdi, additional, and Tang, Xianmin, additional
- Published
- 2023
- Full Text
- View/download PDF
41. The personal and the social: Twin contributors to climate action
- Author
-
Bradley, Graham L., primary, Deshpande, Sameer, additional, and Paas, Karlien H.W., additional
- Published
- 2023
- Full Text
- View/download PDF
42. Protocol for an Observational Study on the Effects of Playing High School Football on Later Life Cognitive Functioning and Mental Health
- Author
-
Deshpande, Sameer K., Hasegawa, Raiden B., Rabinowitz, Amanda R., Whyte, John, Roan, Carol L., Tabatabaei, Andrew, Baiocchi, Michael, Karlawish, Jason H., Master, Christina L., and Small, Dylan S.
- Subjects
Statistics - Applications - Abstract
A potential causal relationship between head injuries sustained by NFL players and later-life neurological decline may have broad implications for participants in youth and high school football programs. However, brain trauma risk at the professional level may be different than that at the youth and high school levels and the long-term effects of participation at these levels is as-yet unclear. To investigate the effect of playing high school football on later life depression and cognitive functioning, we propose a retrospective observational study using data from the Wisconsin Longitudinal Study (WLS) of graduates from Wisconsin high schools in 1957. We compare 1,153 high school males who played varsity football to 2,751 male students who did not. 1,951 of the control subjects did not play any sport and the remaining 800 controls played a non-contact sport. We focus on two primary outcomes measured at age 65: a composite cognitive outcome measuring verbal fluency and memory and the modified CES-D depression score. To control for potential confounders we adjust for pre-exposure covariates such as IQ with matching and model-based covariate adjustment. We will conduct an ordered testing procedure that uses all 2,751 controls while controlling for possible unmeasured differences between students who played sports and those who did not. We will quantitatively assess the sensitivity of the results to potential unmeasured confounding. The study will also consider several secondary outcomes of clinical interest such as aggression and heavy drinking. The rich set of pre-exposure variables, relatively unbiased sampling, and longitudinal nature of the WLS dataset make the proposed analysis unique among related studies that rely primarily on convenience samples of football players with reported neurological symptoms., Comment: Prior to performing the proposed analysis, we will register this pre-analysis plan on clincialtrials.gov
- Published
- 2016
43. Estimating an NBA player's impact on his team's chances of winning
- Author
-
Deshpande, Sameer K. and Jensen, Shane T.
- Subjects
Statistics - Applications - Abstract
Traditional NBA player evaluation metrics are based on scoring differential or some pace-adjusted linear combination of box score statistics like points, rebounds, assists, etc. These measures treat performances with the outcome of the game still in question (e.g. tie score with five minutes left) in exactly the same way as they treat performances with the outcome virtually decided (e.g. when one team leads by 30 points with one minute left). Because they ignore the context in which players perform, these measures can result in misleading estimates of how players help their teams win. We instead use a win probability framework for evaluating the impact NBA players have on their teams' chances of winning. We propose a Bayesian linear regression model to estimate an individual player's impact, after controlling for the other players on the court. We introduce several posterior summaries to derive rank-orderings of players within their team and across the league. This allows us to identify highly paid players with low impact relative to their teammates, as well as players whose high impact is not captured by existing metrics., Comment: To appear in the Journal of Quantitative Analysis of Sport
- Published
- 2016
- Full Text
- View/download PDF
44. The value of values in business purchase decisions
- Author
-
Anwer, Ehtisham, Deshpande, Sameer, Derry, Robbin, and Basil, Debra Z.
- Published
- 2020
- Full Text
- View/download PDF
45. Examining the Potential Disconnect between Parents' Perceptions and Reality Regarding the Physical Activity Levels of Their Children
- Author
-
Faulkner, Guy, Solomon, Vaeda, Berry, Tanya, Deshpande, Sameer, Latimer-Cheung, Amy E., Rhodes, Ryan, Spence, John, and Tremblay, Mark S.
- Abstract
Background: Parental support plays an important role in facilitating the participation of children in physical activity. However, there is evidence that parents overestimate their child's level of physical activity -- this may lead to inaction in promotion attempts by parents. This potential disconnect between parental perceptions and reality was recently the focus of the 'Think Again' social marketing campaign developed by PartipACTION. Purpose: To qualitatively explore parents' perceptions of the Think Again advertisements, and the possible disconnect between perceptions and reality regarding their children's physical activity levels. Method: Semi-structured interviews were conducted with 12 mothers and 12 fathers of children aged 5-11 years attending a supervised recreation class. A thematic analysis was applied to the collected data. Results: The advertisements were generally well received by the parents in serving as a reminder of how much physical activity their children should be getting. Less than half of parents believed their children were attaining physical activity guidelines although the majority believed they were sufficiently active given perceived time constraints for both them and their child. Most parents believed they could accurately estimate how active their child was but that other parents may have difficulty due to reliance on schools and organized recreation to provide opportunities for physical activity. Conclusion: PSAs have a role to play in increasing parental awareness of physical activity guidelines and communicating the importance of physical activity. More creative approaches will be needed to address the disconnect in the perceptions between sufficient and recommended levels of physical activity.
- Published
- 2014
46. Social advocacy: a conceptual model to extend post-intervention effectiveness.
- Author
-
Campbell, Alexander, Deshpande, Sameer, Rundle-Thiele, Sharyn, and West, Tracey
- Abstract
Commercial marketing literature highlights benefits from brand advocates who recruit and promote in the interest of the commercial entity. However, a similar focus is lacking on how advocacy can extend the effectiveness of social change initiatives. We utilise a case study to demonstrate the benefit of social advocacy and its impact on behaviour change, and thereby propose an advocacy model. To develop this conceptual model, we discuss several key areas; behaviour change and advocacy, advocate identification, and how to influence advocacy within communities and individuals. This research provides a guiding framework for practitioners to develop programs and interventions with advocacy triggers and strategies to enhance the longevity and effectiveness of social change programs through participant-based advocacy. Thus, giving intervention programs in a variety of organisational structures e.g. non-profit, corporate, government etc. a specific model to increase the effectiveness of social programs. Our paper extends behaviour change literature by leveraging social marketing concepts to modify and extend the transtheoretical model. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
47. Evaluating plate discipline in Major League Baseball with Bayesian Additive Regression Trees.
- Author
-
Yee, Ryan and Deshpande, Sameer K.
- Subjects
BASEBALL ,BATTING (Baseball) ,DECISION theory ,REGRESSION trees ,CONFIRMATORY factor analysis - Abstract
We introduce a three-step framework to determine at which pitches Major League batters should swing. Unlike traditional plate discipline metrics, which implicitly assume that all batters should always swing at (resp. take) pitches inside (resp. outside) the strike zone, our approach explicitly accounts not only for the players and umpires involved in the pitch but also in-game contextual information like the number of outs, the count, baserunners, and score. We first fit flexible Bayesian nonparametric models to estimate (i) the probability that the pitch is called a strike if the batter takes the pitch; (ii) the probability that the batter makes contact if he swings; and (iii) the number of runs the batting team is expected to score following each pitch outcome (e.g. swing and miss, take a called strike, etc.). We then combine these intermediate estimates to determine whether swinging increases the batting team's run expectancy. Our approach enables natural uncertainty propagation so that we can not only determine the optimal swing/take decision but also quantify our confidence in that decision. We illustrate our framework using a case study of pitches faced by Mike Trout in 2019. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
48. Use of Social Marketing to Improve Science Teaching in Maharashtra, India: 2014–18
- Author
-
Deshpande, Sameer, Basil, Debra Z., editor, Diaz-Meneses, Gonzalo, editor, and Basil, Michael D., editor
- Published
- 2019
- Full Text
- View/download PDF
49. Toward developing an environmental efficacy construct
- Author
-
Basil, Debra Z., Basil, Michael, Lavack, Anne Marie, and Deshpande, Sameer
- Published
- 2020
- Full Text
- View/download PDF
50. Input, Outcome, and Impact: A Program-Informed Model to Improve the Effectiveness of Corporate Social Marketing
- Author
-
Campbell, Alexander, primary, Deshpande, Sameer, additional, Kumar, Sunil, additional, Rundle-Thiele, Sharyn, additional, and West, Tracey, additional
- Published
- 2023
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.