40 results on '"em-type algorithm"'
Search Results
2. ANALYZING SKEWED FINANCIAL DATA USING SKEW SCALE-SHAP MIXTURES OF MULTIVARIATE NORMAL DISTRIBUTIONS.
- Author
-
TAMANDI, M. and AMIRI, M.
- Subjects
GAUSSIAN distribution ,PROBABILITY theory ,DECISION making ,BIG data ,MAXIMUM likelihood statistics - Abstract
This paper introduces an innovative family of statistical models called the multivariate skew scale-shape mixtures of normal distributions. These models serve as a versatile tool in statistical analysis by efficiently characterizing the skewed and leptokurtic nature commonly observed in multivariate datasets. Their applicability shines in real-world scenarios where data often deviate from standard statistical assumptions due to the presence of outliers. We present an EM-type algorithm designed for maximizing likelihood estimation and evaluate the model's effectiveness through real-world data applications. Through rigorous testing against various datasets, we assess the performance and practicality of the proposed algorithm in real statistical scenarios. The results demonstrate the remarkable performance of this new family of distributions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. A novel finite mixture model based on generalized scale mixtures of generalized normal distributions with application to stock dataset.
- Author
-
Guan, Ruijie, Cheng, Weihu, Jiao, Junjun, and Zeng, Jie
- Subjects
- *
CONDITIONAL expectations , *GAUSSIAN distribution , *PARAMETER estimation , *MODELS & modelmaking , *MIXTURES - Abstract
AbstractThis paper introduces a novel family of distributions known as generalized scale mixtures of generalized normal distributions (GSMGN). These distributions incorporate two additional shape parameters that serve to regulate the shape and tails of the distribution. A finite mixture model based on this family is presented to address clustering heterogeneous data in the presence of leptokurtic and heavy-tailed outcomes. The estimation of the parameters of this model are obtained by developing an ECM-PLA ensemble algorithm which combine the profile likelihood approach (PLA) and the classical Expectation Conditional Maximization (ECM) algorithm, and the observed information matrix is obtained. The applicability of this new family and the numerical performance of the proposed methodology is discussed through simulated and real data examples. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Estimation in shape mixtures of skew-normal linear regression models via ECM coupled with Gibbs sampling.
- Author
-
Alizadeh Ghajari, Zakaria, Zare, Karim, and Shokri, Soheil
- Subjects
- *
MARKOV chain Monte Carlo , *GIBBS sampling , *REGRESSION analysis , *EXPECTATION-maximization algorithms - Abstract
In this paper, we study linear regression models in which the error term has shape mixtures of skew-normal distribution. This type of distribution belongs to the skew-normal (SN) distribution class that can be used for heavy tails and asymmetry data. For the first time, for the classical (non-Bayesian) estimation of the parameters of the SN family, we apply the Markov chains Monte Carlo ECM (MCMC-ECM) algorithm where the samples are generated by Gibbs sampling, denoted by Gibbs-ECM, and also, we extend two other types of the EM algorithm for the above model. Finally, the proposed method is evaluated through a simulation and compared with the Numerical Math-ECM algorithm and Monte Carlo ECM (MC-ECM) using a real data set. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. A novel finite mixture model based on the generalized scale mixtures of asymmetric generalized normal distributions: properties, estimation methodology and applications
- Author
-
Guan, Ruijie, Jiao, Junjun, Cheng, Weihu, and Hu, Guozhi
- Published
- 2024
- Full Text
- View/download PDF
6. The generalized scale mixtures of asymmetric generalized normal distributions with application to stock data
- Author
-
Ruijie Guan, Aidi Liu, and Weihu Cheng
- Subjects
generalized asymmetric normal distribution ,generalized scale mixtures ,em-type algorithm ,order statistics ,Mathematics ,QA1-939 - Abstract
In this paper, we introduced a family of distributions with a very flexible shape named generalized scale mixtures of generalized asymmetric normal distributions (GSMAGN). We investigated the main properties of the new family including moments, skewness, kurtosis coefficients and order statistics. A variant of the expectation maximization (EM)-type algorithm was established by combining the proflie likihood approach (PLA) with the classical expectation conditional maximization (ECM) algorithm for parameter estimation of this model. This approach with analytical expressions in the E-step and tractable M-step can greatly improve the computational speed and efficiency of the algorithm. The performance of the proposed algorithm was assessed by some simulation studies. The feasibility of the proposed methodology was illustrated through two real datasets.
- Published
- 2024
- Full Text
- View/download PDF
7. Influence diagnostics for skew-t censored linear regression models.
- Author
-
Oliveiraa, Marcos S., Oliveiraa, Daniela C. R., and Lachos, Victor H.
- Subjects
SKEWNESS (Probability theory) ,REGRESSION analysis ,ALGORITHMS - Abstract
This paper proposes some diagnostics procedures for the skew-t linear regression model with censored response. The skew-t distribution is an attractive family of asymmetrical heavy-tailed densities that includes the normal, skew-normal and student’s-t distributions as special cases. Inspired by the power and wide applicability of the EM-type algorithm, local and global influence analysis, based on the conditional expectation of the complete-data log-likelihood function are developed, following Zhu and Lee’s approach. For the local influence analysis, four specific perturbation schemes are discussed. Two real data sets, from education and economics, which are right and left censoring, respectively, are analyzed in order to illustrate the usefulness of the proposed methodology [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
8. Multivariate measurement error models with normal mean‐variance mixture distributions.
- Author
-
Mirfarah, Elham, Naderi, Mehrdad, Lin, Tsung‐I, and Wang, Wan‐Lun
- Abstract
The class of normal mean‐variance mixture (NMVM) distributions is a rich family of asymmetric and heavy‐tailed distributions and has been widely considered in parametric modeling of the data for robust statistical inference. This paper proposes an extension of measurement error models by assuming the NMVM distributions for the unobserved covariates and error terms in the model, referred to as the NMVM‐MEM. An expectation conditional maximization either (ECME) algorithm is developed to compute the maximum likelihood (ML) estimates of model parameters. Furthermore, an information‐based approach is performed to derive the asymptotic covariance matrix of ML estimators. The analysis of a blood pressure dataset illustrates the superiority of NMVM‐MEM to accommodate asymmetry and outliers over the normal counterpart. Two simulation studies are undertaken to validate our proposed techniques. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
9. Parsimonious mixture-of-experts based on mean mixture of multivariate normal distributions.
- Author
-
Sepahdar, Afsaneh, Madadi, Mohsen, Balakrishnan, Narayanaswamy, and Jamalizadeh, Ahad
- Subjects
- *
DISTRIBUTION (Probability theory) , *REGRESSION analysis , *MACHINE learning , *NONLINEAR regression , *GAUSSIAN distribution , *STATISTICS , *WEIBULL distribution , *MIXTURES - Abstract
The mixture-of-experts (MoE) paradigm attempts to learn complex models by combining several “experts” via probabilistic mixture models. Each expert in the MoE model handles a small area of the data space in which a gating function controls the data-to-expert assignment. The MoE framework has been used extensively in designing non-linear models in machine learning and statistics to model the heterogeneity in data for the purpose of regression, classification and clustering. The existing MoE of multi-target regression (MoE-MTR) models for continuous data is based on multivariate normal distributions. However, in many practical situations, for a set of data, a group or groups of observations may exhibit asymmetric and heavy-tailed behaviour, and inference based on symmetric distributions in such situations can unduly affect the fit of the regression model. We introduce here a novel robust multivariate nonnormal MoE model by the use of mean mixture of normal distributions. The proposed model can handle the issues of MoE-MTR models regarding possibly skewed, heavytailed and noisy data. Maximum likelihood estimates of model parameters are developed based on an expectation-maximization (EM)-type algorithm. Parsimony is also obtained by imposing suitable constraints on the expert dispersion matrices. The usefulness of the proposed methodology is illustrated using simulated and real data sets. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
10. Semiparametric inference for the scale-mixture of normal partial linear regression model with censored data.
- Author
-
Naderi, Mehrdad, Mirfarah, Elham, Bernhardt, Matthew, and Chen, Ding-Geng
- Subjects
- *
REGRESSION analysis , *CENSORING (Statistics) , *DATA modeling - Abstract
In the censored data exploration, the classical linear regression model which assumes normally distributed random errors is perhaps one of the commonly used frameworks. However, practical studies have often criticized the classical linear regression model because of its sensitivity to departure from the normality and partial nonlinearity. This paper proposes to solve these potential issues simultaneously in the context of the partial linear regression model by assuming that the random errors follow a scale-mixture of normal (SMN) family of distributions. The postulated method allows us to model data with great flexibility, accommodating heavy tails and outliers. By implementing the B-spline approximation and using the convenient hierarchical representation of the SMN distributions, a computationally analytical EM-type algorithm is developed for obtaining maximum likelihood (ML) parameter estimates. Various simulation studies are conducted to investigate the finite sample properties, as well as the robustness of the model in dealing with the heavy tails distributed datasets. Real-world data examples are finally analyzed for illustrating the usefulness of the proposed methodology. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
11. Clustering asymmetrical data with outliers:Parsimonious mixtures of contaminated mean-mixture of normal distributions
- Author
-
Naderi, Mehrdad, Nooghabi, Mehdi Jabbari, Naderi, Mehrdad, and Nooghabi, Mehdi Jabbari
- Abstract
Mixture modeling has emerged as a statistical tool to perform unsupervised model-based clustering for heterogeneous data. A framework of using contaminated mean-mixture of normal distributions as the components of the mixture model is designed to accommodate asymmetric data with outliers. Fourteen parsimonious variants of the postulated model are introduced by employing an eigenvalue decomposition of the component scale matrices. Simultaneously clustering and outliers detection is an outstanding advantage of the proposed model in analyzing non-normally distributed data. A computationally feasible and flexible EM-type algorithm is outlined for obtaining maximum likelihood parameter estimates. Moreover, the score vector and empirical information matrix for calculating asymptotic standard errors of the parameter estimates are derived by offering an information-based approach. The applicability of the proposed method is demonstrated through the analysis of simulated and real datasets with varying proportions of outliers.
- Published
- 2024
12. Mixture of functional linear models and its application to CO2-GDP functional data
- Author
-
Wang, Shaoli, Huang, Mian, Wu, Xing, and Yao, Weixin
- Subjects
Mixtures of functional linear regressions ,Identifiability ,EM-type algorithm ,Kernel regression ,Functional principal component analysis ,Conditional bootstrap ,Hypothesis test ,Statistics ,Computation Theory and Mathematics ,Econometrics ,Statistics & Probability - Abstract
Functional linear models are important tools for studying the relationship between functional response and covariates. However, if subjects come from an inhomogeneous population that demonstrates different linear relationship between the response and covariates among different subpopulations/clusters, a single functional linear model is no longer adequate for the data. A new class of mixtures of functional linear models for the analysis of heterogeneous functional data is introduced. Identifiability is established for the proposed class of mixture models under mild conditions. The proposed estimation procedures combine the ideas of local kernel regression, functional principal component analysis and EM algorithm. A generalized likelihood ratio test based on a conditional bootstrap is given as to whether the regression coefficient functions are constant. A Monte Carlo simulation study is conducted to examine the finite sample performance of the new methodology. Finally, the analysis of CO2-GDP data reveals the dynamic patterns of relationship between CO2 and GDP among different countries.
- Published
- 2016
13. The skew-t censored regression model: parameter estimation via an EM-type algorithm.
- Author
-
Lachos, Victor H., Bazán, Jorge L., Castro, Luis M., and Jiwon Park
- Subjects
REGRESSION analysis ,PARAMETER estimation - Abstract
The skew-t distribution is an attractive family of asymmetrical heavy-tailed densities that includes the normal, skew-normal and Student’s-t distributions as special cases. In this work, we propose an EM-type algorithm for computing the maximum likelihood estimates for skew-t linear regression models with censored response. In contrast with previous proposals, this algorithm uses analytical expressions at the E-step, as opposed to Monte Carlo simulations. These expressions rely on formulas for the mean and variance of a truncated skew-t distribution, and can be computed using the R library MomTrunc. The standard errors, the prediction of unobserved values of the response and the log-likelihood function are obtained as a by-product. The proposed methodology is illustrated through the analyses of simulated and a real data application on Letter-Name Fluency test in Peruvian students. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
14. Analysis of skewed data by using compound Poisson exponential distribution with applications to insurance claims.
- Author
-
Meraou, Mohammed A., Al-Kandari, Noriah M., Raqab, Mohammad Z., and Kundu, Debasis
- Subjects
- *
POISSON distribution , *DISTRIBUTION (Probability theory) , *PROBABILITY density function , *INSURANCE claims , *RANDOM variables , *EXPECTATION-maximization algorithms - Abstract
The main aim of this paper is to introduce a new family of distributions, namely compound zero-truncated Poisson exponential distribution of which exponential distribution is a special case. The proposed family of distributions represents the zero truncated-Poisson sum of independent and identically distributed exponential random variables. The proposed distribution has two parameters and its probability density function can be skewed and unimodal. It can be used quite effectively in analyzing skewed data. We suggest to use expectation–maximization (EM)-type algorithm to estimate the unknown parameters, and it is observed that it is easy to implement in practice. We further consider the bivariate version of the proposed model which has three parameters and provides different properties. We have performed extensive simulation studies to see the performances of the proposed EM algorithm, and a real data set has been analyzed to see the effectiveness of the proposed models. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
15. Detection of Change Points in Spatiotemporal Data in the Presence of Outliers and Heavy-Tailed Observations
- Author
-
Sun, Bin, Wu, Yuehua, Cameletti, Michela, editor, and Finazzi, Francesco, editor
- Published
- 2018
- Full Text
- View/download PDF
16. A robust class of multivariate fatigue distributions based on normal mean-variance mixture model.
- Author
-
Sasaei, Mahsa, Pourmousa, Reza, Balakrishnan, Narayanaswamy, and Jamalizadeh, Ahad
- Abstract
The Birnbaum–Saunders (BS) distribution, introduced in 1969, is a popular univariate fatigue life distribution which has been widely used to model right-skewed lifetime and reliability data. In this paper, a new class of generalized multivariate BS distributions is proposed based on mean-variance mixture models to accommodate strongly skewed and heavy tailed multivariate lifetime data. Some special cases of this class as well as their properties are then discussed. We present a hierarchical representation which facilitates an efficient EM-type algorithm for the computation of maximum likelihood estimates. Empirical results from a simulation study and real data analyses show that this class of distributions outperforms many existing extensions of the BS distribution in modeling lifetime data. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
17. The inverse Gaussian process with a skew-normal distribution as a degradation model.
- Author
-
Chen, Xudan, Sun, Xinli, Ding, Xiong, and Tang, Jue
- Subjects
- *
INVERSE Gaussian distribution , *GAUSSIAN processes , *GAUSSIAN distribution , *ALGORITHMS - Abstract
This article incorporates a skew-normal distribution into an inverse Gaussian (IG) process to represent the unit-to-unit variability of the degradation rate, while a symmetrical distribution or approximately symmetrical distribution is commonly adopted in the IG process. Then we derive the corresponding lifetime distribution and mean-time-to-failure including two special cases under the condition that the skew-normal distribution reduces to an half-normal distribution or a normal distribution. Moreover, an improved EM-type algorithm is presented to overcome the difficulties in estimating parameters. Finally, two simulation studies and a case application are used to illustrate the advantages of the proposed model. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
18. Multivariate Restricted Skew-Normal Scale Mixture of Birnbaum-Saunders Distribution.
- Author
-
Samary, H., Khodadadi, Z., and Jafarpour, H.
- Subjects
- *
MULTIVARIATE analysis , *SKEWNESS (Probability theory) , *ALGORITHMS , *MIXTURE distributions (Probability theory) , *DISTRIBUTION (Probability theory) - Abstract
In spite of widespread use as well as theoretical properties of the multivariate scale mixture of normal distributions, practical studies show a lack of stability and robustness against asymmetric features such as asymmetry and heavy tails. In this paper, we develop a new multivariate model by assuming the Birnbaum-Saunders distribution for the mixing variable in the scale mixture of the restricted skew-normal distribution. An analytically simple and efficient EM-type algorithm is adopted for iteratively computing maximum likelihood estimate of model parameters. To account standard errors, the observed information matrix is derived analytically by offering an information-based approach. Results obtained from real and simulated datasets are reported to illustrate the practical utility of the proposed methodology. [ABSTRACT FROM AUTHOR]
- Published
- 2020
19. Asymmetric heavy-tailed vector auto-regressive processes with application to financial data.
- Author
-
Maleki, Mohsen, Wraith, Darren, Mahmoudi, Mohammad R., and Contreras-Reyes, Javier E.
- Subjects
- *
FINANCIAL databases , *AUTOREGRESSIVE models , *TIME series analysis , *RANDOM noise theory , *EXPECTATION-maximization algorithms , *WHITE noise , *DATA - Abstract
Vector Auto-regressive (VAR) models are commonly used for modelling multivariate time series and the typical distributional form is to assume a multivariate normal. However, the assumption of Gaussian white noise in multivariate time series is often not reasonable in applications where there are extreme and/or skewed observations. In this setting, inference based on using a Gaussian distributional form will provide misleading results. In this paper, we extended the multivariate setting of autoregressive process, by considering the multivariate scale mixture of skew-normal (SMSN) distributions for VAR innovations. The multivariate SMSN family is able to be represented in a hierarchical form which relatively easily facilitates simulation and an EM-type algorithm to estimate the model parameters. The performance of the proposed model is illustrated by using simulated and real datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
20. An Asynchronous Distributed Expectation Maximization Algorithm for Massive Data: The DEM Algorithm.
- Author
-
Srivastava, Sanvesh, DePalma, Glen, and Liu, Chuanhai
- Subjects
- *
GRID computing , *ALGORITHMS , *GROUP process , *MESSAGE passing (Computer science) , *EXPECTATION-maximization algorithms - Abstract
The family of expectation--maximization (EM) algorithms provides a general approach to fitting flexible models for large and complex data. The expectation (E) step of EM-type algorithms is time-consuming in massive data applications because it requires multiple passes through the full data. We address this problem by proposing an asynchronous and distributed generalization of the EM called the distributed EM (DEM). Using DEM, existing EM-type algorithms are easily extended to massive data settings by exploiting the divide-and-conquer technique and widely available computing power, such as grid computing. The DEM algorithm reserves two groups of computing processes called workers and managers for performing the E step and the maximization step (M step), respectively. The samples are randomly partitioned into a large number of disjoint subsets and are stored on the worker processes. The E step of DEM algorithm is performed in parallel on all the workers, and every worker communicates its results to the managers at the end of local E step. The managers perform the M step after they have received results from a γ-fraction of the workers, where γ is a fixed constant in (0, 1]. The sequence of parameter estimates generated by the DEM algorithm retains the attractive properties of EM: convergence of the sequence of parameter estimates to a local mode and linear global rate of convergence. Across diverse simulations focused on linear mixed-effects models, the DEM algorithm is significantly faster than competing EM-type algorithms while having a similar accuracy. The DEM algorithm maintains its superior empirical performance on a movie ratings database consisting of 10 million ratings. Supplementary material for this article is available online. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
21. Clustering asymmetrical data with outliers: Parsimonious mixtures of contaminated mean-mixture of normal distributions.
- Author
-
Naderi, Mehrdad and Nooghabi, Mehdi Jabbari
- Subjects
- *
OUTLIER detection , *MIXTURES , *EIGENVALUES , *GAUSSIAN distribution - Abstract
Mixture modeling has emerged as a statistical tool to perform unsupervised model-based clustering for heterogeneous data. A framework of using contaminated mean-mixture of normal distributions as the components of the mixture model is designed to accommodate asymmetric data with outliers. Fourteen parsimonious variants of the postulated model are introduced by employing an eigenvalue decomposition of the component scale matrices. Simultaneously clustering and outliers detection is an outstanding advantage of the proposed model in analyzing non-normally distributed data. A computationally feasible and flexible EM-type algorithm is outlined for obtaining maximum likelihood parameter estimates. Moreover, the score vector and empirical information matrix for calculating asymptotic standard errors of the parameter estimates are derived by offering an information-based approach. The applicability of the proposed method is demonstrated through the analysis of simulated and real datasets with varying proportions of outliers. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. Time series models based on the unrestricted skew-normal process.
- Author
-
Zarrin, Parisa, Maleki, Mohsen, Khodadai, Zahra, and Arellano-Valle, Reinaldo B.
- Subjects
- *
TIME series analysis , *MATHEMATICAL statistics , *PROBABILITY theory , *MONTE Carlo method , *DATA analysis - Abstract
The standard location and scale unrestricted (or unified) skew-normal (SUN) family studied by Arellano-Valle and Genton [On fundamental skew distributions. J Multivar Anal. 2005;96:93-116] and Arellano-Valle and Azzalini [On the unification of families of skew-normal distributions. Scand J Stat. 2006;33:561-574], allows the modelling of data which is symmetrically or asymmetrically distributed. The family has a number of advantages suitable for the analysis of stochastic processes such as Auto-Regressive Moving-Average (ARMA) models, including being closed under linear combinations, being able to satisfy the consistency condition of Kolmogorov’s theorem and providing the guarantee of the existence of such a SUN stochastic process. The family is able to be represented in a hierarchical form which can be used for the ease of simulation. In addition, it facilitates an EM-type algorithm to estimate the model parameters. The performances and suitability of the proposed model are demonstrated on simulations and using two real data sets in applications. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
23. Finite mixture of regression models for censored data based on scale mixtures of normal distributions.
- Author
-
Zeller, Camila Borelli, Cabral, Celso Rômulo Barbosa, Lachos, Víctor Hugo, and Benites, Luis
- Abstract
In statistical analysis, particularly in econometrics, the finite mixture of regression models based on the normality assumption is routinely used to analyze censored data. In this work, an extension of this model is proposed by considering scale mixtures of normal distributions (SMN). This approach allows us to model data with great flexibility, accommodating multimodality and heavy tails at the same time. The main virtue of considering the finite mixture of regression models for censored data under the SMN class is that this class of models has a nice hierarchical representation which allows easy implementation of inferences. We develop a simple EM-type algorithm to perform maximum likelihood inference of the parameters in the proposed model. To examine the performance of the proposed method, we present some simulation studies and analyze a real dataset. The proposed algorithm and methods are implemented in the new R package CensMixReg. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
24. EM-Type algorithms for heavy-tailed logistic mixed models.
- Author
-
Santos, Cristiano C. and Loschi, Rosangela H.
- Subjects
- *
LOGISTIC regression analysis , *MAXIMUM likelihood statistics , *MARKOV chain Monte Carlo , *RANDOM effects model , *MULTIPLE integrals - Abstract
This paper aims at evaluating different aspects of Monte Carlo expectation – maximization algorithm to estimate heavy-tailed mixed logistic regression (MLR) models. As a novelty it also proposes a multiple chain Gibbs sampler to generate of the latent variables distributions thus obtaining independent samples. In heavy-tailed MLR models, the analytical forms of the full conditional distributions for the random effects are unknown. Four different Metropolis–Hastings algorithms are assumed to generate from them. We also discuss stopping rules in order to obtain more efficient algorithms in heavy-tailed MLR models. The algorithms are compared through the analysis of simulated andAscaris Suumdata. [ABSTRACT FROM PUBLISHER]
- Published
- 2017
- Full Text
- View/download PDF
25. Maximum likelihood estimation and parameter interpretation in elliptical mixed logistic regression.
- Author
-
Santos, Cristiano and Loschi, Rosangela
- Abstract
We introduce the class of elliptical mixed logistic model with focus on the normal/independent subclass. Parameter interpretation in mixed logistic model is not straightforward since the odds ratio is random. For the proposed models, we obtain the odds ratio distribution and its summaries used to interpret the fixed effects and to measure the heterogeneity among the clusters thus extending previous results. Fisher information is also obtained. A Monte Carlo expectation-maximization algorithm is considered to obtain the maximum likelihood estimates. A simulation study is performed comparing normal and heavy-tailed models. It also address the effect of the misspecification of the random effect distribution and other model aspects in the parameter interpretation. A data analysis is performed showing the utility of heavy-tailed mixed logistic model. Among the main conclusions, we note that the misspecification of the random effect distribution influences the fixed effects interpretation and the quantification of the among clusters heterogeneity. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
26. Inference and further probabilistic properties of the $$ SUN_{n,2}$$ -distribution.
- Author
-
Amiri, Mehdi, Jamalizadeh, Ahad, and Towhidi, Mina
- Subjects
SKEWNESS (Probability theory) ,DISTRIBUTION (Probability theory) ,MULTIVARIATE analysis ,MAXIMUM likelihood statistics ,STOCHASTIC processes - Abstract
In this paper the normal-skew-normal distribution, proposed by Gomez et al. (Statistics 47:411-421, ), is extended to the multivariate case. It is also a special case of $$SUN_{n,2}$$ -distribution, recently studied by Arellano-Valle and Genton (Chil J Stat 2:17-34, ). We show that the proposed distribution can be expressed as a shape mixture of the multivariate extended skew-normal distribution. Applying this property leads to deriving stochastic representations for the proposed distribution. Also we give some basic properties for this new family. Computational techniques using EM-type algorithms are employed for iteratively computing maximum likelihood estimates. Finally, an application of the new distribution is illustrated using some real data sets. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
27. Robust mixture regression modeling based on the normal mean-variance mixture distributions.
- Author
-
Naderi, Mehrdad, Mirfarah, Elham, Wang, Wan-Lun, and Lin, Tsung-I
- Subjects
- *
REGRESSION analysis , *CONDITIONAL expectations , *QUANTILE regression , *BUILDING additions , *RANDOM variables , *MISSING data (Statistics) , *LATENT variables - Abstract
Mixture regression models (MRMs) are widely used to capture the heterogeneity of relationships between the response variable and one or more predictors coming from several non-homogeneous groups. Since the conventional MRMs are quite sensitive to departures from normality caused by extra skewness and possible heavy tails, various extensions built on more flexible distributions have been put forward in the last decade. The class of normal mean-variance mixture (NMVM) distributions that arise from scaling both the mean and variance of a normal random variable with a common mixing distribution encompasses many prominent (symmetric or asymmetrical) distributions as special cases. A unified approach to robustifying MRMs is proposed by considering the class of NMVM distributions for component errors. An expectation conditional maximization either (ECME) algorithm, which incorporates membership indicators and the latent scaling variables as the missing data, is developed for carrying out maximum likelihood (ML) estimation of model parameters. Four simulation studies are conducted to examine the finite-sample property of ML estimators and the robustness of the proposed model against outliers for contaminated and noisy data. The usefulness and superiority of our methodology are demonstrated through applications to two real datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
28. Mixture of functional linear models and its application to CO[formula omitted]-GDP functional data.
- Author
-
Wang, Shaoli, Huang, Mian, Wu, Xing, and Yao, Weixin
- Subjects
- *
FUNCTIONAL analysis , *LINEAR statistical models , *CLUSTER analysis (Statistics) , *PRINCIPAL components analysis , *COEFFICIENTS (Statistics) - Abstract
Functional linear models are important tools for studying the relationship between functional response and covariates. However, if subjects come from an inhomogeneous population that demonstrates different linear relationship between the response and covariates among different subpopulations/clusters, a single functional linear model is no longer adequate for the data. A new class of mixtures of functional linear models for the analysis of heterogeneous functional data is introduced. Identifiability is established for the proposed class of mixture models under mild conditions. The proposed estimation procedures combine the ideas of local kernel regression, functional principal component analysis and EM algorithm. A generalized likelihood ratio test based on a conditional bootstrap is given as to whether the regression coefficient functions are constant. A Monte Carlo simulation study is conducted to examine the finite sample performance of the new methodology. Finally, the analysis of CO 2 -GDP data reveals the dynamic patterns of relationship between CO 2 and GDP among different countries. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
29. A new mixture model on the simplex
- Author
-
Ongaro, A, Migliorati, S, Ascari, R, Ongaro, Andrea, Migliorati, Sonia, Ascari, Roberto, Ongaro, A, Migliorati, S, Ascari, R, Ongaro, Andrea, Migliorati, Sonia, and Ascari, Roberto
- Abstract
This paper is meant to introduce a significant extension of the flexible Dirichlet (FD) distribution, which is a quite tractable special mixture model for compositional data, i.e. data representing vectors of proportions of a whole. The FD model displays several theoretical properties which make it suitable for inference, and fairly easy to handle from a computational viewpoint. However, the rigid type of mixture structure implied by the FD makes it unsuitable to describe many compositional datasets. Furthermore, the FD only allows for negative correlations. The new extended model, by considerably relaxing the strict constraints among clusters entailed by the FD, allows for a more general dependence structure (including positive correlations) and greatly expands its applicative potential. At the same time, it retains, to a large extent, its good properties. EM-type estimation procedures can be developed for this more complex model, including ad hoc reliable initialization methods, which permit to keep the computational issues at a rather uncomplicated level. Accurate evaluation of standard error estimates can be provided as well.
- Published
- 2020
30. A value-at-risk analysis of carry trades using skew-GARCH models.
- Author
-
Wang, Yu-Jen, Chung, Huimin, and Guo, Jia-Hau
- Subjects
VALUE at risk ,GARCH model ,MONEY ,INTEREST rates ,DISTRIBUTION (Economic theory) ,RATE of return ,HETEROSCEDASTICITY - Abstract
We carry out a value-at-risk (VaR) analysis of an extremely popular strategy in the currency markets, namely, 'carry trades,' whereby a position purchased in high interest rate currencies is funded by selling low interest rate currencies. Since the natural outcome of the truncated normal distribution of interest-rate spreads combined with the normal distribution of exchange rate returns is a skew-normal distribution, we consider a skew-normal innovation with zero mean for our analysis of carry trade returns using generalized autoregressive conditional heteroskedasticity (GARCH) models. The stress testing results reveal that skew-normal or densities are suitable for the measurement of VaR for carry trade returns involving, for example, taking up a long position in Australian Dollars or Argentine Peso which are funded by selling Japanese Yen. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
31. Joint generalized estimating equations for multivariate longitudinal binary outcomes with missing data: an application to acquired immune deficiency syndrome data.
- Author
-
Lipsitz, Stuart R., Fitzmaurice, Garrett M., Ibrahim, Joseph G., Sinha, Debajyoti, Parzen, Michael, and Lipshultz, Steven
- Subjects
EQUATIONS ,MULTIVARIATE analysis ,LONGITUDINAL method ,BINARY number system ,AIDS ,HEART diseases ,HIV infections - Abstract
In a large, prospective longitudinal study designed to monitor cardiac abnormalities in children born to women who are infected with the human immunodeficiency virus, instead of a single outcome variable, there are multiple binary outcomes (e.g. abnormal heart rate, abnormal blood pressure and abnormal heart wall thickness) considered as joint measures of heart function over time. In the presence of missing responses at some time points, longitudinal marginal models for these multiple outcomes can be estimated by using generalized estimating equations (GEEs), and consistent estimates can be obtained under the assumption of a missingness completely at random mechanism. When the missing data mechanism is missingness at random, i.e. the probability of missing a particular outcome at a time point depends on observed values of that outcome and the remaining outcomes at other time points, we propose joint estimation of the marginal models by using a single modified GEE based on an EM-type algorithm. The method proposed is motivated by the longitudinal study of cardiac abnormalities in children who were born to women infected with the human immunodeficiency virus, and analyses of these data are presented to illustrate the application of the method. Further, in an asymptotic study of bias, we show that, under a missingness at random mechanism in which missingness depends on all observed outcome variables, our joint estimation via the modified GEE produces almost unbiased estimates, provided that the correlation model has been correctly specified, whereas estimates from standard GEEs can lead to substantial bias. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
32. A two-sample robust Bayesian Mendelian Randomization method accounting for linkage disequilibrium and idiosyncratic pleiotropy with applications to the COVID-19 outcomes.
- Author
-
Wang A, Liu W, and Liu Z
- Subjects
- Bayes Theorem, Genetic Pleiotropy, Genome-Wide Association Study, Humans, Linkage Disequilibrium, Models, Genetic, COVID-19 genetics, Mendelian Randomization Analysis methods
- Abstract
Mendelian randomization (MR) is a statistical method exploiting genetic variants as instrumental variables to estimate the causal effect of modifiable risk factors on an outcome of interest. Despite wide uses of various popular two-sample MR methods based on genome-wide association study summary level data, however, those methods could suffer from potential power loss or/and biased inference when the chosen genetic variants are in linkage disequilibrium (LD), and also have relatively large direct effects on the outcome whose distribution might be heavy-tailed which is commonly referred to as the idiosyncratic pleiotropy phenomenon. To resolve those two issues, we propose a novel Robust Bayesian Mendelian Randomization (RBMR) model that uses the more robust multivariate generalized t $t$ -distribution to model such direct effects in a probabilistic model framework which can also incorporate the LD structure explicitly. The generalized t $t$ -distribution can be represented as a Gaussian scaled mixture so that our model parameters can be estimated by the expectation maximization (EM)-type algorithms. We compute the standard errors by calibrating the evidence lower bound using the likelihood ratio test. Through extensive simulation studies, we show that our RBMR has robust performance compared with other competing methods. We further apply our RBMR method to two benchmark data sets and find that RBMR has smaller bias and standard errors. Using our proposed RBMR method, we find that coronary artery disease is associated with increased risk of critically ill coronavirus disease 2019. We also develop a user-friendly R package RBMR (https://github.com/AnqiWang2021/RBMR) for public use., (© 2022 Wiley Periodicals LLC.)
- Published
- 2022
- Full Text
- View/download PDF
33. Modelos de regressão para dados censurados sob a classe de distribuições de misturas de escala normal assimétricas
- Author
-
Guzman, Daniel Camilo Fuentes, Ferreira, Clécio da Silva, Zeller, Camila Borelli, Matos, Larissa Avila, and Magalhães, Tiago Maia
- Subjects
Modelo de regressão para dados censurado ,Algoritmo MCEM ,Heavy tails ,Misturas de escala normal assimétricas ,Skew scale mixtures of normal distributions ,CIENCIAS EXATAS E DA TERRA::MATEMATICA [CNPQ] ,Caudas pesadas ,EM-type algorithm ,Censored regression model - Abstract
Um problema frequente na análise de regressão é quando a observação da variável resposta é censurada para alguns indivíduos. Isto ocorre em várias situações práticas, por razões como limitações do equipamento de medição ou do desenho experimental. Estes fenômenos podem ser modelados mediante modelos estatísticos e matemáticos. No âmbito dos modelos de regressão censurados, os erros aleatórios são rotineiramente considerados como tendo uma distribuição normal, principalmente por conveniência matemática. No entanto, este método tem sido criticado na literatura por causa de sua sensibilidade a desvios da suposição de normalidade. Nessa dissertação, primeiro estabelecemos uma nova ponte entre o modelo de regressão censurado e a classe de distribuições assimétricas estudadas por Ferreira et al. [13]. As misturas de escala assimétricas das distribuições normais são frequentemente utilizadas para procedimentos estatísticos que envolvem dados assimétricos e caudas pesadas. A principal virtude dos membros dessa família de distribuições é que eles são fáceis de serem simulados e também fornecem algoritmos tipo Esperança-Maximização (EM) para a estimativa de máxima verosimilhança. Neste trabalho, estendemos o algoritmo EM para o algoritmo MCEM para modelos de regressão lineares censurados. O algoritmo do tipo EM foi discutido com ênfase nas distribuições Normal Assimétrica, t-Student Assimétrica, Slash Assimétrica e Normal-Contaminada Assimétrica. Os métodos propostos são verificados através da análise de vários estudos de simulação e aplicação em conjuntos de dados reais. A frequent problem in regression analysis is when the observation of the response variable is censored for some subjects. This occurs in several practical situations, for reasons such as limitations of the measuring equipment or the experimental design. These phenomena can be modeled using statistical and mathematical models. In the framework of censored regression models the random errors are routinely assumed to have a normal distribution, mainly for mathematical convenience. However, this method has been criticized in the literature because of its sensitivity to deviations from the normality assumption. In this dissertation, we first establish a new link between the censored regression model and the class of asymmetric distributions studied by Ferreira et al. [13]. Skew scale mixtures of normal distributions are often used for statistical procedures involving asymmetric data and heavy-tailed. The main virtue of the members of this family of distributions is that they are easy to simulate and also provide expectation-maximization (EM) algorithms for maximum likelihood estimation. In this work, we extend the EM algorithm for the MCEM algorithm for linear regression models censored. The EM-type algorithm has been discussed with an emphasis on the Skew-normal, Skew Student-t-normal, Skew slash and Skew-contaminated normal distributions. The proposed methods are verified through the analysis of several simulation studies and applying in real datasets.
- Published
- 2018
34. Inference and further probabilistic properties of the S U N n , 2 -distribution
- Author
-
Amiri, Mehdi, Jamalizadeh, Ahad, and Towhidi, Mina
- Published
- 2015
- Full Text
- View/download PDF
35. Mixture of linear experts model for censored data: A novel approach with scale-mixture of normal distributions.
- Author
-
Mirfarah, Elham, Naderi, Mehrdad, and Chen, Ding-Geng
- Subjects
- *
CENSORING (Statistics) , *GAUSSIAN distribution , *MATHEMATICAL errors , *GAUSSIAN mixture models , *DATA modeling , *MIXTURES - Abstract
Mixture of linear experts (MoE) model is one of the widespread statistical frameworks for modeling, classification, and clustering of data. Built on the normality assumption of the error terms for mathematical and computational convenience, the classical MoE model has two challenges: (1) it is sensitive to atypical observations and outliers, and (2) it might produce misleading inferential results for censored data. The aim is then to resolve these two challenges, simultaneously, by proposing a robust MoE model for model-based clustering and discriminant censored data with the scale-mixture of normal (SMN) class of distributions for the unobserved error terms. An analytical expectation–maximization (EM) type algorithm is developed in order to obtain the maximum likelihood parameter estimates. Simulation studies are carried out to examine the performance, effectiveness, and robustness of the proposed methodology. Finally, a real dataset is used to illustrate the superiority of the new model. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
36. Mixture of functional linear models and its application to CO2-GDP functional data
- Author
-
Wang, S, Huang, M, Wu, X, and Yao, W
- Subjects
Functional principal component analysis ,Statistics & Probability ,Mixtures of functional linear regressions ,Statistics ,Kernel regression ,Identifiability ,Computation Theory and Mathematics ,Econometrics ,Hypothesis test ,EM-type algorithm ,Conditional bootstrap - Abstract
Functional linear models are important tools for studying the relationship between functional response and covariates. However, if subjects come from an inhomogeneous population that demonstrates different linear relationship between the response and covariates among different subpopulations/clusters, a single functional linear model is no longer adequate for the data. A new class of mixtures of functional linear models for the analysis of heterogeneous functional data is introduced. Identifiability is established for the proposed class of mixture models under mild conditions. The proposed estimation procedures combine the ideas of local kernel regression, functional principal component analysis and EM algorithm. A generalized likelihood ratio test based on a conditional bootstrap is given as to whether the regression coefficient functions are constant. A MonteCarlo simulation study is conducted to examine the finite sample performance of the new methodology. Finally, the analysis of CO2-GDP data reveals the dynamic patterns of relationship between CO2 and GDP among different countries.
- Published
- 2016
- Full Text
- View/download PDF
37. A flexible factor analysis based on the class of mean-mixture of normal distributions.
- Author
-
Hashemi, Farzane, Naderi, Mehrdad, Jamalizadeh, Ahad, and Bekker, Andriette
- Subjects
- *
GAUSSIAN distribution , *STATISTICS , *DATA reduction , *FACTOR analysis , *DATA structures - Abstract
Factor analysis is a statistical technique for data reduction and structure detection that traditionally relies on the normality assumption for factors. However, due to the presence of non-normal features such as asymmetry and heavy tails in many practical situations, the first two moments cannot adequately explain the factors. An extension of the factor analysis model is introduced by assuming a generalization of the multivariate restricted skew-normal distribution for the vector of unobserved factors. An efficient and computationally tractable EM-type algorithm is adopted for computing the maximum likelihood estimates by presenting a hierarchical representation of the proposed model. Finally, the efficiency and advantages of the proposed novel methodology are demonstrated through both simulated and real benchmark datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
38. Clustering right-skewed data stream via Birnbaum–Saunders mixture models: A flexible approach based on fuzzy clustering algorithm.
- Author
-
Hashemi, Farzane, Naderi, Mehrdad, and Mashinchi, Mashallah
- Subjects
FUZZY algorithms ,GAUSSIAN mixture models ,MIXTURES ,CLUSTER sampling - Abstract
Despite the widespread use of Gaussian mixture model for clustering datasets, practical applications show that the skewed and leptokurtic mixture models can be considered as promising alternatives. This paper proposes a finite mixture of Birnbaum–Saunders (FM-BS) distributions for analyzing and clustering right-skewed, leptokurtic, and multimodal lifetime datasets. The maximum likelihood (ML) estimates of the proposed model are obtained by developing a computationally analytical expectation–maximization (EM) type algorithm, as well as a fuzzy classification maximum likelihood (FCML) type algorithm, that combines the advantages of fuzzy clustering and robust statistical estimators. Simulation studies demonstrate the accuracy and computational efficiency of the FCML algorithm to estimate parameters of the FM-BS distributions and to cluster samples drawn from the FM-BS distributions. Finally, some real datasets have been analyzed to illustrate how well the proposed FM-BS model estimates the membership values. • A finite mixture model for clustering right-skewed, and multimodal data is proposed. • The EM and FCML type algorithms are implemented for computing ML estimates. • Asymptotic standard errors of parameter estimate are obtained through. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
39. Estimation in the pile-up model with application to fluorescence lifetime measurements
- Author
-
Rebafka, Tabea, Laboratoire Traitement et Communication de l'Information (LTCI), Télécom ParisTech-Institut Mines-Télécom [Paris] (IMT)-Centre National de la Recherche Scientifique (CNRS), Laboratoire Outils d'Analyse des Données (LOAD), Département Métrologie Instrumentation & Information (DM2I), Laboratoire d'Intégration des Systèmes et des Technologies (LIST (CEA)), Direction de Recherche Technologique (CEA) (DRT (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Direction de Recherche Technologique (CEA) (DRT (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Laboratoire d'Intégration des Systèmes et des Technologies (LIST (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay, Télécom ParisTech, François Roueff, Laboratoire d'Intégration des Systèmes et des Technologies (LIST), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Direction de Recherche Technologique (CEA) (DRT (CEA)), and Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Laboratoire d'Intégration des Systèmes et des Technologies (LIST)
- Subjects
Adaptive estimator ,analyse statistique ,traitement du signal ,Estimation de paramètres ,spectrum analysis ,Pile-up model ,[STAT.TH]Statistics [stat]/Statistics Theory [stat.TH] ,EM-type algorithm ,Tchebychev ,Approximation de Tchebychev ,échantillonneur de Gibbs ,statistical analysis ,analyse de spectre ,[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing ,Gibbs sampler ,algorithme EM ,Minimax estimation ,fluorescence ,[PHYS.PHYS.PHYS-INS-DET]Physics [physics]/Physics [physics]/Instrumentation and Detectors [physics.ins-det] ,signal processing ,modèle d’empilement ,[STAT.ME]Statistics [stat]/Methodology [stat.ME] ,Mesures de Gibbs ,Exponential mixtures ,Cramér-Rao bound - Abstract
This thesis studies the so-called pile-up model and proposes adequate estimators. An observation of the pile-up model is the minimum of a random number of variables from the target distribution. The pile-up distribution is the result of a non linear distortion of the target distribution. The goal is to identify the target distribution from observations of the pile-up model. The model is motivated by the application TCSPC in time-resolved fluorescence, where the extent of distortion is determined by a tuning parameter selected by the user. A study of the Cramér-Rao bound provides the best value of this parameter. Simulations with a Gibbs sampler confirm the theoretical results on a significant reduction of the variance compared to the current practice. Another estimator is proposed by a maximum likelihood approach based on a new contrast and whose computation time is satisfactory. In many cases the estimator can be computed by an EM-type algorithm. Furthermore, the consistence as well as the limit distribution is established. A comparison to the current practice in fluorescence shows that a reduction of the acquisition time by a factor 10 is possible. In the last part, a non parametric estimator of the mixing density of an infinite mixture of exponential densities is proposed. The estimator is based on orthogonal series and it is shown to be optimal in the sense that its mean integrated square error achieves the minimax rate on some specific smoothness spaces. Moreover, the estimator can be adapted to the pile-up model, when the target distribution is an infinite exponential mixture.; Cette thèse étudie le modèle d’empilement et propose des estimateurs appropriés. Une observation de ce modèle est le minimum d’un nombre aléatoire de variables de la loi initiale. La distribution du modèle d’empilement est le résultat d’une distorsion non linéaire de la loi initiale. L’objectif est d’identifier la loi initiale à partir des observations du modèle d’empilement. Le modèle est motivé par l’application TCSPC en fluorescence, où l’ampleur de la distorsion est déterminée par un paramètre de réglage sélectionné par l’utilisateur. Une étude de la borne de Cramér-Rao fournit la meilleure valeur de ce paramètre. Des simulations avec un échantillonneur de Gibbs confirment les résultats théoriques sur une réduction significative de la variance en comparaison avec la pratique habituelle. Un autre estimateur est proposé par une approche de maximum de vraisemblance basé sur un nouveau contraste et dont le temps de calcul est satisfaisant. Dans des nombreux cas, l’estimateur peut se calculer par un algorithme de type E. M. Par ailleurs, la consistance ainsi que la loi limite de cet estimateur sont établies. Une comparaison avec la pratique actuelle en fluorescence montre qu’une réduction du temps d’acquisition d’un facteur 10 est envisageable. Finalement, un estimateur non paramétrique de la densité mélangeante d’un mélange infini de lois exponentielles est proposé. Celui-ci est basé sur des séries orthogonales et se montre optimal dans le sens que son erreur quadratique atteint la vitesse minimax dans des espaces de régularité bien choisis. Cet estimateur est aussi adapté au modèle d’empilement, lorsque la loi initiale est un mélange infini de lois exponentielles.
- Published
- 2009
40. Item response prediction for incomplete response matrix using the EM-type item response theory with application to adaptive online ability evaluation system.
- Author
-
Hirose, Hideo and Sakumura, Takenori
- Abstract
The item response theory (IRT) gives us the valuable information about the difficulties of problems as well as the abilities of students, whereas the classical test method provides only the abilities of students with pre-determined scores to each problem. To enhance the use of the IRT, we have developed a concise IRT evaluation Web system via the drag-and-drop Excel file in which 0/1 scores of the test result are stored. In addition, we have introduced an online adaptive IRT system to assess the students' abilities more accurately with fewer problems. In such a system, the item bank is pre-stored and the problem difficulties are determined in advance. However, as the number of online adaptive examinees becomes large, the calibration for parameters to problems, incorporating the new examinees' results for problem difficulties, may be needed. For the calibration, parameter estimation methods of problem difficulties and students' abilities for incomplete response matrices are required. In this paper, we propose a new method to estimate the problem difficulties and students' abilities for incomplete item response matrices via the LIRT, which is based on the item response theory and the EM-type algorithm. Then, we show a calibration procedure expressing the problem difficulties and students' abilities to some online adaptive system. We have found the estimates for discrimination parameters vary to some extent from the beginning to the end. However, the estimates for the difficulty parameters do not vary much, which corresponds to that the estimates for the ability parameters do not vary much. [ABSTRACT FROM PUBLISHER]
- Published
- 2012
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.