40 results on '"Notodiputro, Khairil A."'
Search Results
2. Small area estimation with multiple covariates under structural measurement error models
- Author
-
Wulandari, Ita, Kurnia, Anang, Notodiputro, Khairil Anwar, and Fitrianto, Anwar
- Published
- 2023
- Full Text
- View/download PDF
3. Small Area Estimation of Sub-District’s Per Capita Expenditure through Area Effects Selection using LASSO Method
- Author
-
Pusponegoro, Novi Hidayat, Kurnia, Anang, Notodiputro, Khairil Anwar, Soleh, Agus Mohamad, and Astuti, Erni Tri
- Published
- 2021
- Full Text
- View/download PDF
4. Density Estimation of Neonatal Mortality Rate Using Empirical Bayes Deconvolution in Central Java Province, Indonesia
- Author
-
Novkaniza, Fevi, Notodiputro, Khairil Anwar, Mangku, I Wayan, and Sadik, Kusman
- Published
- 2021
- Full Text
- View/download PDF
5. Spatial Empirical Best Predictor of Small Area Poverty Indicator.
- Author
-
Handayani, Dian, Notodiputro, Khairil Anwar, Saefuddin, Asep, Mangku, I Wayan, and Kurnia, Anang
- Subjects
POVERTY areas ,SAMPLE size (Statistics) ,POVERTY ,FORECASTING - Abstract
Information about some poverty indicators is important not only for the large administrative level but also for lower administrative level. This information can be obtained from many surveys. Unfortunately, many surveys are usually designed to satisfy accuracy for large populations. As a result, it is often encountered that the sample size from some sub-populations which can be obtained from a survey is too small to produce a reliable direct estimator. The sub-population which the selected sample from it is not large enough to produce a reliable direct estimator is also called a small area. In this paper, we propose the spatial empirical best predictor (SEBP) for some poverty indicators in some small areas. The SEBP is derived under a unit-level spatial lognormal mixed model which incorporates spatial dependence into the covariance structure. The mean square prediction error (MSPE) of the SEBP is estimated by the parametric bootstrap method. A simulation study was conducted to evaluate the performance of the SEBP compared to the direct estimates as well as the empirical best predictor (EBP). Further, the SEBP was also applied to obtain the estimates of some poverty indicators for some sub-districts in Bogor, Indonesia. The results showed that there is a substantial reduction in MSPE of the SEBP over the direct estimates and the EBP for almost all sub-districts. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Comparison of GARCH, LSTM, and Hybrid GARCH-LSTM Models for Analyzing Data Volatility.
- Author
-
Atul Mualifah, Laily Nissa, Soleh, Agus Mohamad, and Notodiputro, Khairil Anwar
- Subjects
ARTIFICIAL intelligence ,TIME series analysis ,GARCH model ,MACHINE learning ,STATISTICAL models - Abstract
Most statistical methods in time series analytics assume that the residuals are independently and identically distributed with zero mean and constant variance. In real cases, this assumption may be violated. Nowadays, data are dynamic and highly volatile, particularly in finance. The Generalized Autoregressive Conditional Heteroscedasticity (GARCH) model is a statistical method for nonconstant conditional variance that can capture the volatility data. Recently, artificial intelligence methods are gaining popularity and have promising performance, one of those is the Long Short-Term Memory (LSTM) method. However, due to the filtering process by forget gate in the LSTM cell some information is missing, which can decrease the prediction’s accuracy. This study proposes a method, namely Hybrid GARCH-LSTM, to overcome those limitation. The performance of the proposed method is evaluated in the simulation and empirical data and compared with GARCH and LSTM model. The results show that the Hybrid GARCH–LSTM model is able to recognize the volatility pattern of data well and outperforms all the other models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. BHF and copula models in small area estimation for household per capita expenditure in Bogor District.
- Author
-
BELINDA, NADIRA SRI, NOTODIPUTRO, KHAIRIL ANWAR, and SOLEH, AGUS MOHAMAD
- Subjects
- *
HOUSEHOLDS , *PER capita , *SKEWNESS (Probability theory) , *DATA analysis - Abstract
Small area statistics are required when the sample size is small to produce estimates with adequate precision. The assumptions underlying Battese-Harter-Fuller (BHF) unit-level models may often be unrealistic in some applications. Copula is an alternative approach when the assumptions are violated. This research discusses the performance of BHF and Copula in small area estimation (SAE) for estimating household per capita expenditure in sub-district levels. This study presents household per capita expenditure, which has a skewed distribution. Due to the fact that the data contains outliers, an appropriate method to handle outliers is also considered. In this research, the Gaussian and the Clayton Copulas are used. The results showed that the performance of BHF was better than Gaussian and Clayton Copulas, as indicated by small root mean square error (RMSE) with an average of 1.14, while the average RMSE of Gaussian copula was 2.71 and Clayton copula was 2.63. Furthermore, the coefficient of variation (CV) of BHF was also smaller compared to Gaussian and Clayton Copulas, and the resulting estimates can be categorized as reliable based on the CV of less than 25%. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. Generalized Linear Mixed Model and missing values handling using imputation methods on longitudinal data with Poisson distribution response.
- Author
-
Zubedi, Fahrezal, Notodiputro, Khairil Anwar, and Sartono, Bagus
- Subjects
- *
MISSING data (Statistics) , *POISSON distribution , *MULTIPLE imputation (Statistics) , *LONGITUDINAL method , *MEDIAN (Mathematics) , *CITIES & towns , *POVERTY areas - Abstract
Longitudinal data consists of repeated observations made from time to time on each individual. The correlation between observations within the same unit in longitudinal data makes the Generalized Linear Mixed Model (GLMM) an appropriate method for the analysis of longitudinal data. GLMM will not have a good estimation if the data contains missing values. To solve this problem, the imputation method is used. This paper discusses two imputation methods, namely mean and median row. The missing value is filled with the average or median value of all the values in the subject. It is known that poverty cases in districts/cities in Eastern Indonesia contain the missing value. Thus, poverty cases in these areas can be used as case studies of the data used. The purpose of the study is to form a GLMM model longitudinal with the Poisson response variable on poverty data in Eastern Indonesia to obtain factors that positively and negatively affect poverty in eastern Indonesia and obtain the right method of median row and mean row in the imputation methods to handle missing values based on the AIC value of the resulting estimate using GLMM. A mean row method is recommended to impute the missing values in Longitudinal data of poverty cases in Eastern Indonesia's districts/cities. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Flood disaster study in Indonesia with generalized linear mixed model tree approach.
- Author
-
Al Mahkya, Dani, Notodiputro, Khairil Anwar, and Sartono, Bagus
- Subjects
- *
DISASTER victims , *FLOOD warning systems , *FLOODS , *DISASTERS , *RANDOM variables , *SECONDARY analysis - Abstract
Some of the collected data may have a non-normal distribution. One approach that can be used to model the phenomenon of data having non-normal response variables and random effects is the Generalized Linear Mixed Model (GLMM). In its development, the GLMM model can be combined with a decision tree-based method approach called the GLMM tree. Flooding is one of the problems faced by all local governments in Indonesia. Based on data from bnpb.go.id, in 2021 there were 724 flood disasters spread across almost all provinces in Indonesia. And the number of victims reached 4,682,923 people and 99,841 buildings were affected. This study aims to model the phenomenon of flooding that occurred in Indonesia with the GLMM tree approach. The focus of the research that will be discussed is related to flood victims and flood disasters that occurred in Indonesia. The data used in this study is secondary data obtained from several related institutions' websites. This research is expected to be input in terms of decision making and so on. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
10. Selection of variables based on nonconcave penalized likelihood using lasso, elastic net, and SCAD method.
- Author
-
Diwidian, Femmy, Notodiputro, Khairil Anwar, and Sartono, Bagus
- Subjects
- *
LOW birth weight , *LEAST squares , *DEMOGRAPHIC surveys , *REGRESSION analysis , *LINEAR statistical models - Abstract
Variable selection is essential in linear regression analysis to improve predictability and select significant variables. Estimating the regression coefficient on high-dimensional data cannot be done using the least squares method, so it requires specific analytical techniques. Approaches that can take on high-dimensional data include SCAD, LASSO, and Elastic Net. This research will analyze the most crucial method between SCAD, LASSO, and Elastic Net on Low Birth Weight (LBW) data in East Nusa Tenggara (NTT). Two methods are used in this study, first, comparing the SCAD, LASSO, and Elastic Net methods using simulation data, and second, applying the logistic regression method to actual data. The data used in this study is the LBW data by fertile women in NTT from the 2017 IDHS (Indonesian Demographic and Health Survey) data. The analysis shows that the results obtained through simulation and data reveal, based on the value of the AIC model goodness test, the SCAD is better than the other methods with the smallest AIC value of 17. 58878, smaller than the AIC LASSO value of 17.90169 and Elnet of 17.88728. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. Evaluation of naïve and covariance algorithms in variable selection methods.
- Author
-
Nurfadilah, Khalilah, Notodiputro, Khairil Anwar, Sartono, Bagus, Santi, Vera Maya, and Warti, Rini
- Subjects
- *
MALE models , *ALGORITHMS , *MATHEMATICAL variables - Abstract
One of the essential aspects of modeling is the model's simplicity. Several methods can simplify the model, including Ridge regression, LASSO, and Elastic Net. We developed an algorithm for selecting the variables in these models, namely Naïve and Covariance. Previous research revealed that the Covariance algorithm is superior in terms of time compared to the Naïve algorithm. It is evaluated by applying models and algorithms to male sex behavior data with the criteria of goodness, namely the simplicity of the model and the minimum AIC value. Based on the study's results, it found that the Covariance algorithm still outperformed the Naïve algorithm in all three models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. Linear Mixed Model for Analyzing Longitudinal Data: A Simulation Study of Children Growth Differences
- Author
-
Pusponegoro, Novi Hidayat, Rachmawati, Ro’fah Nur, Notodiputro, Khairil Anwar, and Sartono, Bagus
- Published
- 2017
- Full Text
- View/download PDF
13. Group LASSO for Rainfall Data Modeling in Indramayu District, West Java, Indonesia
- Author
-
Rachmawati, Ro’fah Nur, Pusponegoro, Novi Hidayat, Muslim, Agus, Notodiputro, Khairil Anwar, and Sartono, Bagus
- Published
- 2017
- Full Text
- View/download PDF
14. Hierarchical Bayesian Models for Small Area Estimation under Overdispersed Count Data.
- Author
-
Wulandari, Ita, Notodiputro, Khairil Anwar, Fitrianto, Anwar, and Kurnia, Anang
- Subjects
- *
MARKOV chain Monte Carlo , *BAYESIAN analysis - Abstract
Bayesian analysis was applied to small area models with overdispersed response variables. The benefits of implementing this strategy by Markov Chain Monte Carlo methods make inference straightforward and computationally feasible. In this paper, we apply the strategy into area-level modeling to predict the under-five mortality rate at the district level in Java Island, the most populated region in Indonesia. The result shows that the zero-inflated negative binomial model yields the reduced relative standard error and relative mean squared error when compared to district estimates, the zeroinflated generalized Poisson and Poisson models. [ABSTRACT FROM AUTHOR]
- Published
- 2023
15. The spatial empirical Bayes predictor of the small area mean for a lognormal variable of interest and spatially correlated random effects
- Author
-
Handayani, Dian, Folmer, Henk, Kurnia, Anang, and Notodiputro, Khairil Anwar
- Published
- 2018
- Full Text
- View/download PDF
16. Classification of household poverty in West Java using the generalized mixed-effects trees model.
- Author
-
RAHMAWATI, FARDILLA, NOTODIPUTRO, KHAIRIL ANWAR, and SADIK, KUSMAN
- Subjects
- *
STATISTICAL models , *MACHINE learning , *RANDOM effects model , *FIXED effects model , *REGRESSION analysis - Abstract
Dealing with fixed effects and random effects can be accomplished by combining statistical modeling and machine learning techniques. This paper discusses the modeling of fixed effects and random effects using a statistical machine-learning approach. We used the generalized mixed-effects trees (GMET), a tree-based mixed-effect model for dealing with response variables that belong to the exponential family of distributions. In this study, both simulation and actual/empirical data utilized the GMET method to discover data conditions that were appropriate for employing this approach. The simulation data was generated using different response variable generations, as well as different values of the variance of random effect and fixed effect coefficients. The findings indicated that the GMET performs similarly for different response variable generation scenarios. However, it performed better when the fixed effect value and the variance of random effects were large. When applied to the empirical data, the GMET method describes fixed effects and random effects and classifies household poverty status quite well based on the area under curve (AUC) value. It has also revealed that important variables for poverty classification are the number of household members, owning land, the type of main fuel used for cooking, and the main source of water used for drinking. In order to address the socioeconomic disparity that leads to poverty, the government may become concerned about these factors. In addition to that information, the use of regional typology as a random effect in the model has also contributed to the variation of household poverty status. Based on research, the fixed effects in mixed models do not need to be linear and GMET may be employed in grouped data structures, giving the GMET technique the ability to compete with other approaches/methods. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
17. Performance of copula and nested error regression models in estimating per capita expenditure of sub-district in Pidie Regency.
- Author
-
HASANAH, NUR, NOTODIPUTRO, KHAIRIL ANWAR, and SARTONO, BAGUS
- Subjects
- *
CONSUMPTION (Economics) , *PER capita , *COPULA functions , *ERROR analysis in mathematics - Abstract
In unit-level small area estimation (SAE), the commonly used nested error regression (NER) model assumes normality which is not always the case. To handle non-normal data, researchers in statistics have developed a novel approach using exchangeable and extendible copula called the multivariate exchangeable copula (MEC) model. This study compares the performance of parametric MEC and NER models in estimating the sub-district average of per capita expenditure (PCE) in Pidie Regency, Aceh Province. This study presents PCE, which has a skewed distribution of the three-parameter skew-normal. The parametric MEC model uses a Gaussian copula from the Elliptical family and an empirical best unbiased prediction (EBUP) estimator. Meanwhile, the NER model uses an empirical best linear unbiased prediction (EBLUP) estimator. The results reveal that at a 95% confidence level, the parametric MEC model outperforms the NER model with a smaller root of mean squared error (RMSE) and provides a more precise estimate of the sub-district average of PCE. This study highlights the importance of considering the parametric MEC model as an alternative method for skewed data in unit-level SAE. The results of this study have the potential to support the achievement of Goal 1 (to end poverty) and Goal 10 (to reduce inequality) of the sustainable development goals (SDGs) by providing average PCE estimates at the sub-district level. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. Modelling of infant mortality in West Sumatra using generalized linear mixed model.
- Author
-
Sari, Fitri Mudia, Notodiputro, Khairil Anwar, and Sartono, Bagus
- Subjects
- *
INFANT mortality , *LOW birth weight , *SANITATION , *POOR people , *LIFE expectancy - Abstract
The infant Mortality rate (IMR) is an indicator normally used as an index of economic improvement, an indicator of the nice of existence, and the principle aspect figuring out the life expectancy of a society. IMR can be observed as longitudinal data, a combination of cross-section data and time-series data, where the same cross-section unit is observed at different times. When the data is repeatedly observed with a specific interval of time, then the time will correlate or not mutually exclusive. To overcome this, time is used as a random effect, so the appropriate method used is the Generalized Linear Mixed Model (GLMM). This study pursuits to version infant mortality data the use of the GLMM and have a look at the variables that affect the quantity of infant deaths in West Sumatra. Based on the results of the analysis, the quantity of low birth weight babies, the percentage of births assisted by non-medical personnel, the percentage of households that have access to proper sanitation services, the percentage of households that have adequate drinking water sources, the percentage of poor people, the quantity of health workers, and the quantity of health facilities has a very significant influence on infant mortality in the province of West Sumatra. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
19. Modeling the influence of climatic factors on the number of dengue hemorrahagic fever (DHF) patients in DKI Jakarta 2017-2020 using generalized linear mixed model.
- Author
-
Sundari, Marta, Notodiputro, Khairil Anwar, and Sartono, Bagus
- Subjects
- *
DENGUE , *TIME measurements , *POISSON regression , *LINEAR statistical models - Abstract
The number of DBD patients in DKI Jakarta in 2017-2020 is counted data so Poisson regression can be used to modellling the relationship between climatic factors and the number of DBD patients. However, in its application, this model has violated the overdispersion assumption so that handling is carried out using the Generalized Linear Mixed Model (GLMM) with Poisson regression and Negative Binom regression. The GLMM model was used to accommodate the random effects of measurement time and measurement location. For both models, the Autoregressive 1 (AR1) variance matrix is used because there is a strong correlation between observations and previous observations. The GLMM model with Negative Binom regression is considered the best model because it has a lower AIC value than the GLMM model AIC with Poisson regression. In this model, only the variables of average temperature per month and average humidity per month have a significant effect on the number of DBD patients in DKI Jakarta in 2017-2020 at the 5% significance level. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
20. Selection of variables in logistic linear mixed model with L1-penalty (Case study: Low birth weight in Indonesia).
- Author
-
Kurniawati, Yenni, Notodiputro, Khairil Anwar, Sartono, Bagus, Afendi, Farit M., and Raharjo, Mulianto
- Subjects
- *
LOW birth weight , *MULTIPLE birth , *GOODNESS-of-fit tests , *MOTHERS - Abstract
The generalized Mixed Linear Model is applicable in Indonesia's Low Birth Weight (LBW) case. During 2003-2028, this case happened 2.6% to 8.9% of total birth. The lowest low birth weight case was in Jambi province, and the highest was in Central Sulawesi. The forms of LBW models include census blocks as random influences, and the fixed effects used are parents' background, household socio-economic conditions, maternal conditions. The link function used was logit because the response is binary. Therefore, this research aims to estimate the GLMM of each province and then do the selection using the glmmLasso model by L1-norm. The results indicate that the results of the selection method using the glmmLasso method are simpler than the GLMM for both Jambi and Central Sulawesi provinces. This result is supported by the goodness of fit (BIC and AIC), where the BIC for glmmLasso for Jambi and Central Sulawesi provinces are 135.488 and 214.783, respectively. Equivalent to the goodness of the model, estimation of the random effect influences for the glmmLasso model is also smaller than GLMM. The variable with a significant effect on LBW in Jambi province is the mother's age, and two variables are shrunk toward zero. However, the glmmLasso LWB's model of Central Sulawesi has four variables that have a significant effect. Those variables are the father's occupation, mother's education level, level of the wealth index, and multiple births, and just one variable is reduced. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
21. Multi-input transfer function model for Covid-19 incidences in Jakarta.
- Author
-
Yohansa, Meicheil, Notodiputro, Khairil Anwar, Erfiani, Afendi, Farit M., and Raharjo, Mulianto
- Subjects
- *
TRANSFER functions , *COVID-19 , *COVID-19 pandemic , *TIME series analysis , *TASK forces - Abstract
Indonesia is currently facing the Covid-19 pandemic, which all countries in the world have also faced in the past year. The contribution of daily cases from each province is the key to the high and low number of positive cases of Covid- 19 in.Indonesia. Based on data from the Covid-19 handling task force for Indonesia, it was noted that Jakarta is the province with the highest spread of Covid-19 cases in Indonesia. Daily positive cases in Jakarta contributed 25% to national cases. For this reason, this study aims to analyze the incidence of positive cases of Covid-19 in.Jakarta through a prediction model for time series data. The multi-input transfer function model used is one of the multivariate time series models to predict Covid-19 cases in Jakarta based on several external variables. The external variables or input series used are the Covid-19 cases in Bodetabek, close contact data, and death case data. All variables used are time-series data from April 2020 to April 2021. The analysis results show that the three input variables significantly correlate in predicting Covid-19 cases in Jakarta. The multi-input transfer function model that has been formed has a pretty good performance in predicting Covid-19 cases in Jakarta with a MAPE of 10%. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
22. Modeling weekly COVID-19 new cases in Jakarta with growth curve time series models.
- Author
-
Hidayati, Putri Azizatun, Notodiputro, Khairil Anwar, Kurnia, Anang, Afendi, Farit M., and Raharjo, Mulianto
- Subjects
- *
TIME series analysis , *COVID-19 pandemic , *GROWTH curves (Statistics) , *CONFIDENCE intervals - Abstract
Growth curves are widely used in modeling the growth process. These curves can capture the pattern of the growth process to explain the characteristic of the growth process. In this research, growth curves are modelled to time series form models in the growth rate form by allowing lagged in the model. These growth time series models are applied in weekly new cases of positive COVID-19 in.Jakarta to forecast the number of new cases in the five weeks ahead in the testing data. Three schemes are made to be modelled. The growth curves that employed in this research are Logistic and Richard growth curves. Evaluations value of the Richard and Logistic models are evaluated by RMSEP of the forecasting result in testing data and RMSE of the prediction in training data to determine which model is better to fit in the weekly new cases of positive COVID-19 in.Jakarta. Time series model based on Richard growth curve seems to have better performance than Logistic growth curve in forecasting weekly new cases of COVID-19 in.Jakarta, because Richard time series models have smaller RMSE and RMSEP in almost all scheme than Logistic models. Other than that more testing data fall into the 95% confidence interval of weekly new cases in Richard time series model than Logistic time series model in all scheme. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
23. A comparison of fixed effect and mixed effect models in analyzing telecommunication products.
- Author
-
Rahmawati, Fardilla, Notodiputro, Khairil Anwar, Rahman, La Ode Abdul, Afendi, Farit M., and Raharjo, Mulianto
- Subjects
- *
TELECOMMUNICATION , *INTERNET sales , *TIME management - Abstract
Mixed model is a model that combines fixed factors and random factors while fixed model is a model that only contains fixed factors. Observations made over time with the same object being observed are called repeated measurement. This research was conducted to determine the determinant factors of internet data quota sales which are influenced by SA (Sales Area), MC (Mutual Check), PC (Product Category), and time factors using a nested linear mixed model with repeated measurement and fixed model with repeated measurement. SA, PC, and time factors as fixed factors while the MC factor nested in SA as a random factor. The results showed that in nested linear mixed model with repeated measurement, the interaction effect between three fixed factors, namely between SA, PC, and time have a significant effect on the sales volume of internet data quota. In fixed model the analysis used the average value of internet data quota sales for each MC, so there is no interaction effect between three fixed factors in the fixed model. This shows that the fixed model is simpler than the mixed model. The nested mixed model with repeated measurement can better explain the effect of MC in SA because it includes random factors, namely the MC factor nested in SA. The fixed model with repeated measurement can better explain the effect of SA because SA is a fixed factor which is the average of MC. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
24. Comparison between binomial generalized linear mixmodels (binomial GLMM) and Beta-Binomial hierarchical generalized linear model (Beta- BinomialHGLM) for modeling poverty data in West Java.
- Author
-
Rohimah, Siti Rohmah, Notodiputro, Khairil Anwar, Sartono, Bagus, Afendi, Farit M., and Raharjo, Mulianto
- Subjects
- *
FIXED effects model , *RANDOM effects model , *POOR people , *SCHOOL rankings , *RANDOM variables , *DATA modeling - Abstract
One of the problems faced by the Province of West Java is poverty. The response data used in this study is the percentage of the poor who have a non-normal distribution. Fixed effects variables consist of life expectancy, school year expectancy, and school enrollment rates aged 16-18 years. Years and clusters variables will be included as independent variables but are random effects. Therefore, for this data case, the appropriate model to be developed is Binomial GLMM and Beta-Binomial HGLM. The modeling performed on the percentage of poor population data gives relatively the same results between the model approaches using Binomial GLMM and Beta-Binomial HGLM with a random effect of years. The random effect in both models shows that there is a variation in the percentage of poor people in years, but the variabilitybetween years cannot be shown by the data. Likewise, the modeling carried out on the percentage of poor population data gives relatively the same results between the model approaches using Binomial GLMM and Beta-Binomial HGLM with clusters random effect. Fixed effects variables and random effects have a significant effect on the percentage of poor people for all models. The model obtained by involving all fixed effects and random effects (clusters) is the best model because itprovides the smallest CAIC value compared to models using random effects (Years). [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
25. Method generalized linear model and generalized linear mixed model for panel data Human Development Index (HDI) in Indonesia.
- Author
-
Adnyani, Luh Putu Widya, Notodiputro, Khairil Anwar, Sartono, Bagus, Afendi, Farit M., and Raharjo, Mulianto
- Subjects
- *
HUMAN Development Index , *PANEL analysis , *JUNIOR high school students , *ELEMENTARY school teachers , *HIGH school teachers - Abstract
The Generalized Linear Model method and the Generalized Linear Mixed Model method are methods that delete the assumption of data normality for the response variables. The two methods are compared and applied to the Human Development Index (HDI) panel data to see the best model and determine what factors affect HDI. The predictor variables used include: GRDP, ratio of elementary school teachers to elementary students, ratio of junior high school teachers to junior high school students, ratio of high school teachers to high school students, number of health workers, population and poverty ratio. The data source comes from BPS by taking three provinces classified into provinces with very high HDI represented by DKI Jakarta province, Province with high HDI represented by West Java province, and provinces with moderate HDI represented by NTB province. The results showed that neither the GLM method nor the GLMM method could be used to model HDI data. The factors that influence the HDI for the data for the provinces of Jakarta and NTB are the GRDP factor, the ratio of elementary school teachers to elementary students, the ratio of junior high school teachers to junior high school students, the number of health workers, and the total population. In the data for the province of Jakarta, an anomaly in economic theory is found which states that an increase in GRDP causes a decrease in the HDI value. This indicates that the GRDP factor for the DKI Jakarta province isnot sufficient to represent the economic dimension as one of the indicators in forming the HDI value. The factors that influence the HDI value in the province of West Java are the GRDP factor and the poverty ratio, while other factors have no significant effect on the HDI value. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
26. Generalized linear mixed models: Application for consumer price index in Indonesia.
- Author
-
Anggara, Dimas, Notodiputro, Khairil Anwar, Sartono, Bagus, Afendi, Farit M., and Raharjo, Mulianto
- Subjects
- *
CONSUMER price indexes , *PANEL analysis , *GAMMA distributions , *LINEAR statistical models - Abstract
The objective of this study is to model the Generalized Linear Mixed Model (GLMM) on the Consumer Price Index (CPI) data for 34 provinces as panel data in Indonesia. This modeling aims to predict and to see the causes of the CPI movement. This paper contains the formulation, interpretation and inference of the formed GLMM model. The modeling results show that the CPI response variable is assumed to follow the Gamma distribution with a regional random effect better than other models in this paper, this is indicated by the smaller AIC value of 70.2. This study provides a theoretical contribution to dealing with nonnormal distributed data in the CPI as panel data in Indonesia (in this case, we assume that the CPI have Gamma and Lognormal Distributed). In addition, this research is also practically useful for determining the most suitable modeling for CPI data as panel data in Indonesia. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
27. Comparison of GEE and GLMM methods for longitudinal data (Case study: Determinants of the percentage of poor people in Indonesia, 2015-2019).
- Author
-
Sihombing, Pardomuan Robinson, Notodiputro, Khairil A., and Sartono, Bagus
- Subjects
- *
POOR people , *PANEL analysis , *LONGITUDINAL method , *GENERALIZED estimating equations , *PERCENTILES , *EXPONENTIAL families (Statistics) , *FOREIGN investments - Abstract
The development model of the GLM for longitudinal data that has not normally distributed (but still in the exponential family) and correlates with response variables is the Generalized Estimating Equations (GEE) and Generalized Linear Mixed-effects Model (GLMM) models. This study compares the GEE model with the GLMM on longitudinal data in modeling poor people in Indonesia in 2015-2019. The data source used is from the publication of the Central Statistics Agency. Based on the smaller RMSE and AIC criteria, the GLMM model is better than the GEE model in modeling the percentage of poor people in Indonesia. The Gini ratio, the rate of Households in Slums, and the percentage of Informal Workers have a significant positive effect on the percentage of poor people. Meanwhile, the percentage of households having access to HDI, economic growth, domestic and foreign investment value have a significant negative impact on poor people. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
28. A study of fixed-b asymptotic distribution models for analysing determinants of drop-out rates in Central Java.
- Author
-
Setyowati, Indah Rini, Notodiputro, Khairil Anwar, Kurnia, Anang, Purnama, Budi, Nugraha, Dewanta Arya, and Anwar, Fuad
- Subjects
- *
ASYMPTOTIC distribution , *JUNIOR high school students , *HETEROSCEDASTICITY , *AUTOCORRELATION (Statistics) , *PANEL analysis - Abstract
This paper discusses fixed-b asymptotic distribution models which have been developed for panel data suffered by the problems of autocorrelation and heteroscedasticity. The fixed-b asymptotic distribution models can be used to infer the parameter of interest. For the purpose of inference the Bartlett Kernel Fixed-b Critical Values have been utilized. This technique is applied to analyze drop-out rates for the elementary and junior high school students in Central Java. The results showed that the drop-out rates in Central Java Province are significantly affected by morbidity rates, GRDP, and gross enrolment rates. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
29. Small Area Estimation of Sub-District's Per Capita Expenditure through Area Effects Selection using LASSO Method.
- Author
-
Pusponegoro, Novi Hidayat, Kurnia, Anang, Notodiputro, Khairil Anwar, Soleh, Agus Mohamad, and Astuti, Erni Tri
- Subjects
SAMPLE size (Statistics) ,PER capita - Abstract
This paper discusses a small area estimation (SAE) problem when the number of small areas is relatively small compared to size of the observations. This problem is known as a sparsity problem which can caused slow convergence in obtaining the parameter estimates. The sparsity problem on small area can be imposed by assigning zero for i- th area with adequate sample size, whereas it preserve the nonzero value for i-t h small area. The sparsity of area specific effects vector brings heavy tails if the SAE method cannot properly handle this complexity of specific area effect characteristic. Thus, the aim of this study is to investigate the sparsity issue by developing small area estimation model using the LASSO method to shrinkage the parameter estimates and select the area specific effects properly. The simulation results showed that the LASSO method produced the smallest mean square error (MSE) while the precision of the prediction were not significantly different when compared to other methods. The LASSO method was also applied to estimate the mean of per capita expenditure of sub-district levels in Kepulauan Bangka Belitung Province and produced smaller MSE when compared to other methods. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
30. Density Estimation of Neonatal Mortality Rate Using Empirical Bayes Deconvolution in Central Java Province, Indonesia.
- Author
-
Novkaniza, Fevi, Notodiputro, Khairil Anwar, Mangku, I Wayan, and Sadik, Kusman
- Subjects
NEONATAL mortality ,DEATH rate ,POISSON distribution ,NEONATAL death ,GAMMA distributions ,INFANT mortality - Abstract
This article is concerned with the density estimation of Neonatal Mortality Rate (NMR) in Central Java Province, Indonesia. Neonatal deaths contribute to 73% of infant deaths in Central Java Province. The number of neonatal deaths for 35 districts/municipalities in Central Java Province is considered as Poisson distributed surrogate with NMR as the rate of Poisson distribution. It is assumed that each number of neonatal deaths by district/municipality in Central Java Province were realizations of unobserved NMR, which come from unknown prior density. We applied the Empirical Bayes Deconvolution (EBD) method for estimating the unknown prior density of NMR based on Poisson distributed surrogate. We used secondary data from the Health Profiles of Central Java Province, Indonesia, in 2018. The density estimation of NMR by the EBD method showed that the resulting prior estimate is relatively close to the Gamma distribution based on Poisson surrogate. This is implying that the suitability of the obtained prior density estimation as a conjugate prior for Poisson distribution. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
31. Multivariate Fay-Herriot models for small area estimation with application to household consumption per capita expenditure in Indonesia.
- Author
-
Ubaidillah, Azka, Notodiputro, Khairil Anwar, Kurnia, Anang, and Mangku, I. Wayan
- Subjects
- *
PER capita , *HOUSEHOLDS , *ESTIMATES - Abstract
Multivariate Fay-Herriot (MFH) models become popular methods to produce reliable parameter estimates of some related multiple characteristics of interest that are commonly produced from many surveys. This article studies the application of MFH models for estimating household consumption per capita expenditure (HCPE) on food and HCPE of non-food. Both of those associated direct estimates, which are obtained from the National Socioeconomic Surveys conducted regularly by Statistics Indonesia, have a strong correlation. The effects of correlation in MFH models are evaluated by employing a simulation study. The simulation showed that the strength of correlation between variables of interest, instead of the number of domains, plays a prominent role in MFH models. The application showed that MFH models have more efficient than univariate models in terms of standard errors of regression parameter estimates. The roots of mean squared errors (RMSEs) of the estimates obtained from the empirical best linear unbiased prediction (EBLUP) estimators of MFH models are smaller than RMSEs obtained from the direct estimators. Based on MFH model, the HCPE estimates of food by districts in Central Java, Indonesia, are higher than the HCPE estimates of non-food. The average of HCPE estimates of food and non-food in Central Java, Indonesia in 2015 are IDR 383,100.6 and IDR 280,653.6, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
32. A Comparative Study of Approximation Methods for Maximum Likelihood Estimation in Generalized Linear Mixed Models (GLMM).
- Author
-
Handayani, Dian, Notodiputro, Khairil Anwar, Sadik, Kusman, and Kurnia, Anang
- Subjects
- *
APPROXIMATION theory , *MAXIMUM likelihood statistics , *INTEGRALS , *LINEAR statistical models , *MEAN square algorithms - Abstract
Maximum likelihood estimates in GLMM are often difficult to be obtained since the calculation involves high dimensional integrals. It is not easy to find analytical solutions for the integral so that the approximation approach is needed. In this paper, we discuss several approximation methods to solve high dimension integrals including the Laplace, Penalized Quasi likelihood (PQL) and Adaptive Gaussian Quadrature (AGQ) approximations. The performance of these methods was evaluated through simulation studies. The 'true' parameter in the simulation was set to be similar with parameter estimates obtained by analyzing a real data, particularly salamander data (McCullagh & Nelder, 1989). The simulation results showed that the Laplace approximation produced better estimates when compared to PQL and AGQ approximations in terms of their relative biases and mean square errors. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
33. Small Area Estimation of Per Capita Expenditures Using Robust Empirical Best Linear Unbiased Prediction (REBLUP).
- Author
-
Salma, Admi, Sadik, Kusman, and Notodiputro, Khairil Anwar
- Subjects
SMALL area statistics ,PARAMETER estimation ,PREDICTION models ,OUTLIERS (Statistics) ,PER capita - Abstract
A small area is an area with small sample size to estimate parameters in survey sampling. The direct estimation will produce inaccurate estimation since the sample size is not enough to produce estimation with acceptable precision. Small Area Estimation (SAE) is a solution to obtain more precise estimation in a small area. A well-known method in SAE is an empirical best linear unbiased prediction (EBLUP). EBLUP is the estimator of small area means. It will provide an accurate estimation under normality assumptions but it can be sensitive when the data are contaminated by outliers. In this article, we discussed a resistant method in SAE, i.e. robust empirical best linear unbiased prediction (REBLUP). We apply REBLUP from unit-level models to the data obtained from the National Socio-economic Survey (SUSENAS). The means of per capita expenditures are calculated for all small areas. We compare the estimates of per capita expenditure in the small area using direct estimation, EBLUP and REBLUP methods using data that contain outliers. The result shows that REBLUP estimation has produced more accurate estimates when compared to the other methods. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
34. Locally weighted scatter‐plot smoothing for analysing temperature changes and patterns in Australia.
- Author
-
Wanishsakpong, Wandee and Notodiputro, Khairil Anwar
- Subjects
- *
GLOBAL temperature changes , *METEOROLOGY , *REGRESSION analysis , *CLUSTER analysis (Statistics) , *TEMPERATURE measurements - Abstract
ABSTRACT: The mean maximum monthly temperature data were recorded at 112 stations in Australia. The data (1990–2015) were downloaded from the Australian Bureau of Meteorology (BOM) website. Missing values were imputed using regression models based on information from the nearest stations, as well as the time periods. The data were deseasonalized to remove seasonal variations and then cluster analysis techniques were used to group the stations into six clusters. For each cluster, locally weighted scatter‐plot smoothing (LOESS) and double exponential smoothing (DES) were used to analyse temperature changes and patterns. The results showed that LOESS produced better fits as well as smoother curves compared with the DES. The trends in temperature were increasing in all clusters, whereas the patterns showed periodicity of the temperatures. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
35. A Study Of Area Clustering Using Factor Analysis in Small Area Estimation (An Analysis of Per Capita Expenditures of Subdistricts Level in Regency and Municipality of Bogor).
- Author
-
Wahyudi, Notodiputro, Khairil Anwar, Kurnia, Anang, and Anisa, Rahma
- Subjects
- *
CLUSTER analysis (Statistics) , *FACTOR analysis , *SMALL area statistics , *PER capita , *LINEAR statistical models , *MATHEMATICAL variables - Abstract
Empirical Best Linear Unbiased Prediction (EBLUP) is one of indirect estimating methods which used to estimate parameters of small areas. EBLUP methods works in using auxiliary variables of area while adding the area random effects. In estimating non-sampled area, the standard EBLUP can no longer be used due to no information of area random effects. To obtain more proper estimation methods for non sampled area, the standard EBLUP model has to be modified by adding cluster information. The aim of this research was to study clustering methods using factor analysis by means of simulation, provide better cluster information. The criteria used to evaluate the goodness of fit of the methods in the simulation study were the mean percentage of clustering accuracy. The results of the simulation study showed the use of factor analysis in clustering has increased the average percentage of accuracy particularly when using Ward method. The method was taken into account to estimate the per capita expenditures based on Small Area Estimation (SAE) techniques. The method was eventually used to estimate the per capita expenditures from SUSENAS and the quality of the estimates was measured by RMSE. This research has shown that the standard-modified EBLUP model provided with factor analysis better estimates when compared with standard EBLUP model and the standard-modified EBLUP without the factor analysis. Moreover, it was also shown that the clustering information is important in estimating non sampled area. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
36. Estimation of Unemployment Rates Using Small Area Estimation Model by Combining Time Series and Cross-Sectional Data.
- Author
-
Muchlisoh, Siti, Kurnia, Anang, Notodiputro, Khairil Anwar, and Mangku, I. Wayan
- Subjects
ESTIMATION theory ,UNEMPLOYMENT statistics ,TIME series analysis ,CROSS-sectional method ,LABOR supply statistics - Abstract
Labor force surveys conducted over time by the rotating panel design have been carried out in many countries, including Indonesia. Labor force survey in Indonesia is regularly conducted by Statistics Indonesia (Badan Pusat Statistik- BPS) and has been known as the National Labor Force Survey (Sakernas). The main purpose of Sakernas is to obtain information about unemployment rates and its changes over time. Sakernas is a quarterly survey. The quarterly survey is designed only for estimating the parameters at the provincial level. The quarterly unemployment rate published by BPS (official statistics) is calculated based on only cross-sectional methods, despite the fact that the data is collected under rotating panel design. The study purpose to estimate a quarterly unemployment rate at the district level used small area estimation (SAE) model by combining time series and cross-sectional data. The study focused on the application and comparison between the Rao-Yu model and dynamic model in context estimating the unemployment rate based on a rotating panel survey. The goodness of fit of both models was almost similar. Both models produced an almost similar estimation and better than direct estimation, but the dynamic model was more capable than the Rao-Yu model to capture a heterogeneity across area, although it was reduced over time. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
37. Trend and pattern classification of surface air emperature change in the Arctic region.
- Author
-
Wanishsakpong, Wandee, McNeil, Nittaya, and Notodiputro, Khairil A.
- Subjects
EARTH temperature ,CLIMATE change research ,TIME series analysis ,REGRESSION analysis - Abstract
Monthly seasonally adjusted temperatures above latitude 45ºN were investigated from January 1973 to November 2013. The study area was divided into 69 sub-regions of similar size each in the shape of an igloo brick. The data were filtered with a second-order autoregressive process to remove autocorrelation. Two sub-regions did not have sufficient data due to substantial numbers of missing values. Factor analysis was then applied to the remaining 67 sub-regions and was used to classify regions with similar temperature changes. As a result, 63 sub-regions could be classified based on 12 factors but 4 sub-regions could not be grouped due to uniqueness. The temperatures for each group of sub-regions were found to increase during 1973-2013. The largest temperature increases of 0.19 ºC/decade were found in northern and southern Siberia and part of the Arctic Ocean. In northern Canada, Alaska, the northern Pacific Ocean and eastern Siberia the temperatures increased by at least 0.16 ºC/decade. In Iceland, Norway, Sweden and part of the Pacific and Arctic Oceans the temperature increased by around 0.15 ºC/decade. In northeastern Canada, Greenland and its surrounding Atlantic Ocean and the Arctic Ocean the temperature increased by about 0.15 ºC/decade. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
38. Nested generalized linear mixed model with ordinal response: Simulation and application on poverty data in Java Island.
- Author
-
Widyaningsih, Yekti, Saefuddin, Asep, Notodiputro, Khairil A., and Wigena, Aji H.
- Subjects
JAVA programming language ,GENERALIZATION ,LINEAR systems ,MATHEMATICAL models ,SIMULATION methods & models ,PARAMETER estimation ,ALGORITHMS - Abstract
The objective of this research is to build a nested generalized linear mixed model using an ordinal response variable with some covariates. There are three main jobs in this paper, i.e. parameters estimation procedure, simulation, and implementation of the model for the real data. At the part of parameters estimation procedure, concepts of threshold, nested random effect, and computational algorithm are described. The simulations data are built for 3 conditions to know the effect of different parameter values of random effect distributions. The last job is the implementation of the model for the data about poverty in 9 districts of Java Island. The districts are Kuningan, Karawang, and Majalengka chose randomly in West Java; Temanggung, Boyolali, and Cilacap from Central Java; and Blitar, Ngawi, and Jember from East Java. The covariates in this model are province, number of bad nutrition cases, number of farmer families, and number of health personnel. In this modeling, all covariates are grouped as ordinal scale. Unit observation in this research is sub-district (kecamatan) nested in district, and districts (kabupaten) are nested in province. For the result of simulation, ARB (Absolute Relative Bias) and RRMSE (Relative Root of mean square errors) scale is used. They show that prov parameters have the highest bias, but more stable RRMSE in all conditions. The simulation design needs to be improved by adding other condition, such as higher correlation between covariates. Furthermore, as the result of the model implementation for the data, only number of farmer family and number of medical personnel have significant contributions to the level of poverty in Central Java and East Java province, and only district 2 (Karawang) of province 1 (West Java) has different random effect from the others. The source of the data is PODES (Potensi Desa) 2008 from BPS (Badan Pusat Statistik). [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
39. Estimating the Variance of Estimator of the Latent Factor Linear Mixed Model Using Supplemented Expectation-Maximization Algorithm.
- Author
-
Angraini, Yenni, Notodiputro, Khairil Anwar, Folmer, Henk, Saefuddin, Asep, and Toharudin, Toni
- Subjects
- *
ALGORITHMS , *EXPECTATION-maximization algorithms , *POLITICAL attitudes , *GAUSSIAN distribution - Abstract
This paper deals with symmetrical data that can be modelled based on Gaussian distribution, such as linear mixed models for longitudinal data. The latent factor linear mixed model (LFLMM) is a method generally used for analysing changes in high-dimensional longitudinal data. It is usual that the model estimates are based on the expectation-maximization (EM) algorithm, but unfortunately, the algorithm does not produce the standard errors of the regression coefficients, which then hampers testing procedures. To fill in the gap, the Supplemented EM (SEM) algorithm for the case of fixed variables is proposed in this paper. The computational aspects of the SEM algorithm have been investigated by means of simulation. We also calculate the variance matrix of beta using the second moment as a benchmark to compare with the asymptotic variance matrix of beta of SEM. Both the second moment and SEM produce symmetrical results, the variance estimates of beta are getting smaller when number of subjects in the simulation increases. In addition, the practical usefulness of this work was illustrated using real data on political attitudes and behaviour in Flanders-Belgium. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
40. The use of small scale prototypes in image reconstruction from projections.
- Author
-
Barnett, Glen, Crowe, Susan, Hudson, Malcolm, Leung, Pui-Lam, Notodiputro, Khairil, Proudfoot, Richard, and Sims, John
- Abstract
We introduce a class of small scale simulation models—‘prototypes'—which reproduce many of the known properties of maximum likelihood and related reconstruction methods used in emission tomography, and greatly simplify the development of new methods. We introduce an iterative Fisher-scoring algorithm and demonstrate, by use of the prototype models, its superior speed of convergence when compared with the standard EM algorithm. [ABSTRACT FROM PUBLISHER]
- Published
- 1989
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.