8 results on '"Garcìa Escudero, L"'
Search Results
2. Exploring solutions via monitoring for cluster weighted robust models
- Author
-
Cappozzo, A, Garcìa-Escudero, L. A, Greselin, F., Mayo-Iscar, A., Porzio, GC, Rampichini, C, Bocci, C, Cappozzo, A, Garcìa-Escudero, L, Greselin, F, and Mayo-Iscar, A
- Subjects
SECS-S/01 - STATISTICA ,Cluster-weighted modeling, Outliers, Trimmed BIC, Eigenvalue constraint, Monitoring, Constrained estimation, Model-based clustering - Abstract
Depending on the selected hyper-parameters, cluster weighted modeling may produce a set of diverse solutions. Particularly, the user can manually specify the number of mixture components, the degree of heteroscedasticity of the clusters in the explanatory variables and of the errors around the regression lines. In addition, when performing robust inference, the level of impartial trimming enforced in the estimation needs to be selected. This flexibility gives rise to a variety of “legitimate” solutions. To mitigate the problem of model selection, we propose a two stage monitoring procedure to identify a set of “good models”. An application to the benchmark tone perception data showcases the benefits of the approach.
- Published
- 2021
3. Robust estimation of mixtures of Skew Normal Distributions
- Author
-
Garcìa-Escudero, L, Greselin, F, McLachlan, G, Mayo-Iscar, A, Garcìa-Escudero, L, Greselin, F, Mclachlan, G, and Mayo-Iscar, A
- Subjects
Clustering, Robustness, Trimming, Constrained estimation, Skew data, model-based classification, Finite mixture models ,SECS-S/01 - STATISTICA - Abstract
Recently, observed departures from the classical Gaussian mixture model in real datasets motivated the introduction of mixtures of skew t, and remarkably widened the application of model based clustering and classification to great many real datasets. Unfortunately,when data contamination occurs, classical inference for these models could be severely affected. In this paper we introduce robust estimation of mixtures of skew normal, to resist sparse outliers and even pointwise contamination that may arise in data collection. Hence, in each component, the skewed nature of the data is explicitly modeled, while any departure from it is dealt by the robust approach. Some applications on real data show the effectiveness of the proposal
- Published
- 2016
4. Robust estimation for mixtures of skew data
- Author
-
Garcìa-Escudero, L, Greselin, F, McLachlan, G, Mayo-Iscar, A, Garcìa-Escudero, L, Greselin, F, Mclachlan, G, and Mayo-Iscar, A
- Subjects
SECS-S/01 - STATISTICA ,Skew data, Heterogeneity, mixture models, robust estimation, constrained estimation, trimming, Maximum likelihood, Expectation-maximization - Abstract
Recently, observed departures from the classical Gaussian mixture model in real datasets have led to the introduction of more flexible tools for modeling heterogeneous skew data. Among the latest proposals in the literature, we consider mixtures of skew normal, to incorporate asymmetry in components, as well as mixtures of t, to down-weight the contribution of extremal observations. Clearly, mixtures of skew t have widened the application of model based clustering and classification to great many real datasets, as they can adapt to both asymmetry and leptokurtosis in the grouped data. Unfortunately, when data contamination occurs far from the bulk of the data, or even between the groups, classical inference for these models is not reliable. Our proposal is to address robust estimation of mixtures of skew normal, to resist sparse outliers and even pointwise contamination that could arise in data collection. We introduce a constructive way to obtain a robust estimator for the mixture of skew normal model, by incorporating impartial trimming and constraints in the EM algorithm. At each E-step, a low percentage of less plausible observations, under the estimated model, is tentatively trimmed; at the M-step, constraints on the scatter matrices are employed to avoid singularities and reduce spurious maximizers. Some applications on artificial and real data show the effectiveness of our proposal, and the joint role of trimming and constraints to achieve robustness
- Published
- 2015
5. Fuzzy clustering of multivariate skew data
- Author
-
Garcìa-Escudero, LA, Greselin, F, Mayo-Iscar, A, Colubi, A, Gatu, C, Garcìa-Escudero, L, Greselin, F, and Mayo-Iscar, A
- Subjects
fuzzy clustering, skew data, robust statistics ,SECS-S/01 - STATISTICA - Abstract
With the increasing availability of multivariate datasets, asymmetric structures in the data ask for more realistic assumptions, with respect to the incredibly useful paradigm given by the Gaussian distribution. Moreover, in performing ML estimation we know that a few outliers in the data can affect the estimation, hence providing unreliable inference. Challenged by such issues, more flexible and solid tools for modeling heterogeneous skew data are needed. Our fuzzy clustering method is based on mixtures of Skew Gaussian components, endowed by the joint usage of impartial trimming and constrained estimation of scatter matrices, in a modified maximum likelihood approach. The algorithm generates a set of membership values, that are used to fuzzy partition the data set and to contribute to the robust estimates of the mixture parameters. The new methodology has been shown to be resistant to different types of contamination, by applying it on artificial data. A brief discussion on the tuning parameters has been developed, also with the help of some heuristic tools for their choice. Finally, synthetic and real dataset are analyzed, to show how intermediate membership values are estimated for observations lying at cluster overlap, while cluster cores are composed by observations that are assigned to a cluster in a crisp way.
- Published
- 2018
6. Extending robust fuzzy clustering to skew data
- Author
-
Garcìa-Escudero, LA, Greselin, F, Mayo-Iscar, A, Ana Colubi, Erricos J. Kontoghiorghes and Herman K. Van Dijk, Garcìa-Escudero, L, Greselin, F, and Mayo-Iscar, A
- Subjects
SECS-S/01 - STATISTICA ,Fuzzy clustering, skew components, skew Gaussian distribution, impartial trimming, outliers, robust statistics, robust inference - Abstract
Clustering is an important technique in exploratory data analysis, with applications in image processing, object classification, target recognition, data mining etc. The aim is to partition data according to natural classes present in it, assigning data points that are more similar to the same cluster. We solved this ill-posed problem by adopting a fuzzy clustering method, based on mixtures of skew Gaussian, endowed by the joint usage of trimming and constrained estimation of scatter matrices. A set of membership values are used to fuzzy partition the data and to contribute to the robust estimates of the mixture parameters. The purpose is to adopt the basic skew Gaussian component for the mixture and apply impartial trimming to the data, to model the skew core of the clusters and to adapt to any type of tail behaviour. The choice of the skew Gaussian components is motivated by the fact that, with the increased availability of multivariate datasets, often underlying asymmetric structures appear. In these cases, the extremely useful paradigm for clustering given by the mixtures of Gaussian distributions appeared somehow unrealistic. Moreover, impartial trimming provides robust ML estimation, even in presence of outliers in the data. Finally, synthetic and real data are analyzed, to show how intermediate membership values are estimated for observations lying at cluster overlap, while cluster cores are composed by observations that are assigned to a cluster in a crisp way.
- Published
- 2018
7. Robust clustering for heterogeneous skew data
- Author
-
Garcìa-Escudero, LA, Greselin, F, Mayo-Iscar, A., Mola, F, Conversano, C, Garcìa-Escudero, L, Greselin, F, and Mayo-Iscar, A
- Subjects
Clustering, robust estimation, skew data ,SECS-S/01 - STATISTICA - Abstract
The existing robust methods for model-based classification and clustering deal with elliptically contoured components. Here we introduce robust estimation for mixtures of skew-normal, by the joint usage of trimming and constraints. The model allows to fit heterogeneous skew data with great flexibility.
- Published
- 2015
8. Robust estimation for mixtures of Gaussian factor analyzers
- Author
-
Garcia Escudero, L., Gordaliza, A., Greselin, F., Ingrassia, Salvatore, Mayo Iscar, A., Gijbels, I, Hubert, M, Park, BU, Welsch, R, Garcìa Escudero, L, Gordaliza, A, Greselin, F, Ingrassia, S, and Mayo Iscar, A
- Subjects
SECS-S/01 - STATISTICA ,Trimming, Factor analysis, Mixture Models, EM, Robust estimation, Constrained estimation, Dimension reduction - Abstract
Mixtures of Gaussian factors are powerful tools for modeling an unobserved heterogeneous population, offering at the same time dimension reduction and model-based clustering. Unfortunately, the high prevalence of spurious solutions and the disturbing effects of outlying observations, along maximum likelihood estimation, open serious issues. We consider restrictions for the component covariances, to avoid spurious solutions, and trimming, to provide robustness against violations of normality assumptions of the underlying latent factors. A detailed AECM algorithm for this new approach is presented. Simulation results and an application to the AIS dataset show the aim and effectiveness of the proposed methodology
- Published
- 2015
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.