18 results on '"Liqun Diao"'
Search Results
2. Classification Trees with Mismeasured Responses
- Author
-
Liqun Diao and Grace Y. Yi
- Subjects
Mathematics (miscellaneous) ,Psychology (miscellaneous) ,Library and Information Sciences ,Statistics, Probability and Uncertainty - Published
- 2023
3. Utilisation des arbres décisionnels dans la recherche en surveillance de la santé de la population : application aux données d’enquête sur la santé mentale des jeunes de l’étude COMPASS
- Author
-
Katelyn Battista, Liqun Diao, Karen A. Patte, Joel A. Dubin, and Scott T. Leatherdale
- Subjects
General Medicine - Abstract
Introduction Dans la recherche en surveillance de la santé de la population, les données d’enquête sont couramment analysées à l’aide de méthodes de régression. Or ces méthodes disposent d’une capacité limitée à analyser les relations complexes. À l’opposé, les modèles d’arbres décisionnels sont parfaitement adaptés pour segmenter les populations et étudier les interactions complexes entre facteurs, et leur utilisation dans la recherche en santé est en pleine croissance. Cet article fournit un aperçu de la méthodologie des arbres décisionnels et de leur application aux données d’enquête sur la santé mentale des jeunes. Methods La performance de deux techniques courantes de construction d’arbres décisionnels, soit l’arbre de classification et de régression (CART) et l’arbre d’inférence conditionnelle (CTREE), est comparée aux modèles classiques de régression linéaire et logistique par le biais d’une application aux résultats en santé mentale des jeunes de l’étude COMPASS. Les données ont été recueillies auprès de 74 501 élèves de 136 écoles au Canada. L’anxiété, la dépression et le bien‑être psychologique ont été mesurés, de même que 23 variables sociodémographiques et facteurs de prédiction des comportements liés à la santé. La performance du modèle a été évaluée au moyen de mesures de prédiction de l’exactitude, de la parcimonie et de l’importance relative des variables. Results Les modèles d’arbres décisionnels et les modèles de régression ont systématiquement mis en évidence les mêmes ensembles de facteurs de prédiction les plus importants pour chaque résultat, ce qui indique un niveau général de concordance entre méthodes. Trois modèles ont présenté une exactitude prédictive plus faible, mais se caractérisent par une plus grande parcimonie et accordent une importance relative plus élevée aux principaux facteurs de différenciation. Conclusion Les arbres décisionnels permettent de cerner les sous-groupes à risque élevé qu’il convient de cibler dans le cadre des efforts de prévention et d’intervention. Ils constituent donc un outil utile pour répondre aux questions de recherche auxquelles les méthodes de régression classiques ne peuvent pas répondre.
- Published
- 2023
4. Doubly weighted mean score estimating functions with a partially observed effect modifier
- Author
-
Meaghan S. Cuerden, Liqun Diao, Cecilia A. Cotton, and Richard J. Cook
- Subjects
Statistics and Probability - Published
- 2023
5. Adaptive response‐dependent two‐phase designs: Some results on robustness and efficiency
- Author
-
Liqun Diao, Ce Yang, and Richard Cook
- Subjects
Cohort Studies ,Statistics and Probability ,Research Design ,Epidemiology ,Humans ,Computer Simulation ,Matrix Metalloproteinase 3 ,Biomarkers - Abstract
Large cohort studies now routinely involve biobanks in which biospecimens are stored for use in future biomarker studies. In such settings, two-phase response-dependent sampling designs involve subsampling individuals in the cohort, assaying their biospecimen to measure an expensive biomarker, and using this data to estimate key parameters of interest under budgetary constraints. When analyses are based on inverse probability weighted estimating functions, recent work has described adaptive two-phase designs in which a preliminary phase of subsampling based on a standard design facilitates approximation of an optimal selection model for a second subsampling phase. In this article, we refine the definition of an optimal subsampling scheme within the framework of adaptive two-phase designs, describe how adaptive two-phase designs can be used when analyses are based on likelihood or conditional likelihood, and consider the setting of a continuous biomarker where the nuisance covariate distribution is estimated nonparametrically at the design stage and analysis stage as required; efficiency and robustness issues are investigated. We also explore these methods for the surrogate variable problem and describe a generalization to accommodate multiple stages of phase II subsampling. A study involving individuals with psoriatic arthritis is considered for illustration, where the aim is to assess the association between the biomarker MMP-3 and the development of joint damage.
- Published
- 2022
6. Nested doubly robust estimating equations for causal analysis with an incomplete effect modifier
- Author
-
Liqun Diao and Richard J. Cook
- Subjects
Statistics and Probability ,05 social sciences ,Effect modifier ,Estimating equations ,01 natural sciences ,Doubly robust ,010104 statistics & probability ,0502 economics and business ,Applied mathematics ,0101 mathematics ,Statistics, Probability and Uncertainty ,050205 econometrics ,Mathematics ,Causal analysis - Published
- 2021
7. Polya tree-based nearest neighborhood regression
- Author
-
Haoxin Zhuang, Liqun Diao, and Grace Yi
- Subjects
Statistics and Probability ,Computational Theory and Mathematics ,Statistics, Probability and Uncertainty ,Theoretical Computer Science - Published
- 2022
8. Using Decision Trees to Examine Environmental and Behavioural Factors Associated with Youth Anxiety, Depression, and Flourishing
- Author
-
Katelyn Battista, Karen A. Patte, Liqun Diao, Joel A. Dubin, and Scott T. Leatherdale
- Subjects
Canada ,Schools ,Adolescent ,Depression ,Health, Toxicology and Mutagenesis ,Decision Trees ,Public Health, Environmental and Occupational Health ,Humans ,decision trees ,mental health ,youth ,school climate ,home environment ,Female ,Anxiety ,Anxiety Disorders - Abstract
Modifiable environmental and behavioural factors influence youth mental health; however, past studies have primarily used regression models that quantify population average effects. Decision trees are an analytic technique that examine complex relationships between factors and identify high-risk subgroups to whom intervention measures can be targeted. This study used decision trees to examine associations of various risk factors with youth anxiety, depression, and flourishing. Data were collected from 74,501 students across Canadian high schools participating in the 2018–2019 COMPASS Study. Students completed a questionnaire including validated mental health scales and 23 covariates. Decision trees were grown to identify key factors and subgroups for anxiety, depression, and flourishing outcomes. Females lacking both happy home life and sense of connection to school were at greatest risk for higher anxiety and depression levels. In contrast with previous literature, behavioural factors such as diet, movement and substance use did not emerge as differentiators. This study highlights the influence of home and school environments on youth mental health using a novel decision tree analysis. While having a happy home life is most important in protecting against youth anxiety and depression, a sense of connection to school may mitigate the negative influence of a poor home environment.
- Published
- 2022
9. Polya tree Monte Carlo method
- Author
-
Haoxin Zhuang, Liqun Diao, and Grace Y. Yi
- Subjects
Statistics and Probability ,Computational Mathematics ,Computational Theory and Mathematics ,Applied Mathematics - Published
- 2023
10. A Vine Copula Model for Climate Trend Analysis using Canadian Temperature Data
- Author
-
Liqun Diao, Haoxin Zhuang, and Grace Y. Yi
- Subjects
Vine copula ,Quasi-maximum likelihood ,Trend analysis ,Longitudinal data ,Econometrics ,Climate change ,Mathematics - Published
- 2021
11. A DSA Algorithm for Mortality Forecasting
- Author
-
Yechao Meng, Chengguo Weng, and Liqun Diao
- Subjects
Statistics and Probability ,Economics and Econometrics ,Computer science ,Mortality forecasting ,Lee–Carter model ,Target population ,Machine learning ,computer.software_genre ,01 natural sciences ,Digital Signature Algorithm ,010104 statistics & probability ,03 medical and health sciences ,Statistics ,Feature (machine learning) ,Mortality prediction ,0101 mathematics ,Selection (genetic algorithm) ,030304 developmental biology ,Common factor model ,0303 health sciences ,business.industry ,social sciences ,Artificial intelligence ,Statistics, Probability and Uncertainty ,business ,computer - Abstract
Borrowing information from populations with similar structural mortality patterns and trajectories has been well recognized as a useful strategy for the mortality forecasting of a target population. This paper presents a flexible framework for the selection of populations from a given candidate pool to assist a target population in mortality forecasting. The defining feature of the framework is the deletion-substitution-addition (DSA) algorithm, which is entirely data-driven and versatile to work with any multiple-population model for mortality prediction. In numerical studies, the framework with an extended augmented common factor model is applied to the Human Mortality Database, and the superiority of the proposed framework is evident in mortality forecasting performance.
- Published
- 2020
12. Censoring Unbiased Regression Trees and Ensembles
- Author
-
Liqun Diao, Robert L. Strawderman, and Jon A. Steingrimsson
- Subjects
Statistics and Probability ,Cart ,Computer science ,business.industry ,05 social sciences ,Machine learning ,computer.software_genre ,01 natural sciences ,Censoring (statistics) ,Ensemble learning ,Article ,Doubly robust ,Regression ,Random forest ,010104 statistics & probability ,ComputingMethodologies_PATTERNRECOGNITION ,0502 economics and business ,Statistics ,Statistics::Methodology ,Artificial intelligence ,0101 mathematics ,Statistics, Probability and Uncertainty ,business ,computer ,050205 econometrics - Abstract
This article proposes a novel paradigm for building regression trees and ensemble learning in survival analysis. Generalizations of the classification and regression trees (CART) and random forests (RF) algorithms for general loss functions, and in the latter case more general bootstrap procedures, are both introduced. These results, in combination with an extension of the theory of censoring unbiased transformations (CUTs) applicable to loss functions, underpin the development of two new classes of algorithms for constructing survival trees and survival forests: censoring unbiased regression trees and censoring unbiased regression ensembles. For a certain “doubly robust” CUT of squared error loss, we further show how these new algorithms can be implemented using existing software (e.g., CART, RF). Comparisons of these methods to existing ensemble procedures for predicting survival probabilities are provided in both simulated settings and through applications to four datasets. It is shown that these new methods either improve upon, or remain competitive with, existing implementations of random survival forests, conditional inference forests, and recursively imputed survival trees.
- Published
- 2018
13. Doubly robust survival trees
- Author
-
Annette M. Molinaro, Liqun Diao, Jon A. Steingrimsson, and Robert L. Strawderman
- Subjects
0301 basic medicine ,Statistics and Probability ,Epidemiology ,Computer science ,Estimator ,Recursive partitioning ,Missing data ,01 natural sciences ,Censoring (statistics) ,Doubly robust ,010104 statistics & probability ,03 medical and health sciences ,030104 developmental biology ,Risk groups ,Inverse probability ,Statistics ,Treatment decision making ,0101 mathematics - Abstract
Estimating a patient's mortality risk is important in making treatment decisions. Survival trees are a useful tool and employ recursive partitioning to separate patients into different risk groups. Existing 'loss based' recursive partitioning procedures that would be used in the absence of censoring have previously been extended to the setting of right censored outcomes using inverse probability censoring weighted estimators of loss functions. In this paper, we propose new 'doubly robust' extensions of these loss estimators motivated by semiparametric efficiency theory for missing data that better utilize available data. Simulations and a data analysis demonstrate strong performance of the doubly robust survival trees compared with previously used methods. Copyright © 2016 John Wiley & Sons, Ltd.
- Published
- 2016
14. Regression Tree Credibility Model
- Author
-
Liqun Diao and Chengguo Weng
- Subjects
Statistics and Probability ,Economics and Econometrics ,050208 finance ,business.industry ,Computer science ,05 social sciences ,Binary algorithm ,Decision tree ,Feature selection ,Predictive analytics ,Net (mathematics) ,Machine learning ,computer.software_genre ,01 natural sciences ,Credibility theory ,010104 statistics & probability ,0502 economics and business ,Covariate ,Credibility ,Artificial intelligence ,0101 mathematics ,Statistics, Probability and Uncertainty ,business ,computer - Abstract
This paper applies machine learning techniques to credibility theory and proposes a regression-tree-based algorithm to integrate covariate information into credibility premium prediction. The recursive binary algorithm partitions a collective of individual risks into mutually exclusive sub-collectives, and applies the classical Buhlmann-Straub credibility formula for the prediction of individual net premiums. The algorithm provides a flexible way to integrate covariate information into individual net premiums prediction. It is appealing for capturing non-linear and/or interaction covariate effects. It automatically selects influential covariate variables for premium prediction and requires no additional ex-ante variable selection procedure. The superiority in the prediction accuracy of the proposed algorithm is demonstrated by extensive simulation studies. The proposed method is applied to the U.S. Medicare data for illustration purposes.
- Published
- 2018
15. A copula model for marked point processes
- Author
-
Liqun Diao, Ker-Ai Lee, and Richard J. Cook
- Subjects
Multivariate statistics ,Models, Statistical ,Computer science ,Applied Mathematics ,Platelet Transfusion ,General Medicine ,Biostatistics ,Joint analysis ,Intensity function ,Thrombocytopenia ,Markov Chains ,Point process ,Copula (probability theory) ,Recurrence ,Chronic Disease ,Multivariate Analysis ,Statistics ,Econometrics ,Humans ,Computer Simulation ,Marked point process ,Random variable ,Randomized Controlled Trials as Topic - Abstract
Many chronic diseases feature recurring clinically important events. In addition, however, there often exists a random variable which is realized upon the occurrence of each event reflecting the severity of the event, a cost associated with it, or possibly a short term response indicating the effect of a therapeutic intervention. We describe a novel model for a marked point process which incorporates a dependence between continuous marks and the event process through the use of a copula function. The copula formulation ensures that event times can be modeled by any intensity function for point processes, and any multivariate model can be specified for the continuous marks. The relative efficiency of joint versus separate analyses of the event times and the marks is examined through simulation under random censoring. An application to data from a recent trial in transfusion medicine is given for illustration.
- Published
- 2013
16. Doubly robust survival trees
- Author
-
Jon Arni, Steingrimsson, Liqun, Diao, Annette M, Molinaro, and Robert L, Strawderman
- Subjects
Models, Statistical ,Risk Factors ,Humans ,Mortality ,Survival Analysis ,Article ,Data Accuracy - Abstract
Estimating a patient's mortality risk is important in making treatment decisions. Survival trees are a useful tool and employ recursive partitioning to separate patients into different risk groups. Existing 'loss based' recursive partitioning procedures that would be used in the absence of censoring have previously been extended to the setting of right censored outcomes using inverse probability censoring weighted estimators of loss functions. In this paper, we propose new 'doubly robust' extensions of these loss estimators motivated by semiparametric efficiency theory for missing data that better utilize available data. Simulations and a data analysis demonstrate strong performance of the doubly robust survival trees compared with previously used methods. Copyright © 2016 John WileySons, Ltd.
- Published
- 2015
17. Statistical analysis of recurrent adverse events
- Author
-
Ker-Ai Lee, Richard J. Cook, and Liqun Diao
- Subjects
medicine.medical_specialty ,business.industry ,Emergency medicine ,medicine ,Statistical analysis ,Regression analysis ,Adverse effect ,business - Published
- 2014
18. Composite likelihood for joint analysis of multiple multistate processes via copulas
- Author
-
Liqun Diao and Richard J. Cook
- Subjects
Statistics and Probability ,Quasi-maximum likelihood ,Likelihood Functions ,Models, Statistical ,Computer science ,Markov process ,General Medicine ,Joint analysis ,symbols.namesake ,Data Interpretation, Statistical ,Statistics ,Spondylarthritis ,Econometrics ,symbols ,Humans ,Statistics, Probability and Uncertainty ,Biostatistics - Abstract
A copula-based model is described which enables joint analysis of multiple progressive multistate processes. Unlike intensity-based or frailty-based approaches to joint modeling, the copula formulation proposed herein ensures that a wide range of marginal multistate processes can be specified and the joint model will retain these marginal features. The copula formulation also facilitates a variety of approaches to estimation and inference including composite likelihood and two-stage estimation procedures. We consider processes with Markov margins in detail, which are often suitable when chronic diseases are progressive in nature. We give special attention to the setting in which individuals are examined intermittently and transition times are consequently interval-censored. Simulation studies give empirical insight into the different methods of analysis and an application involving progression in joint damage in psoriatic arthritis provides further illustration.
- Published
- 2014
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.