Published

2017-01-16

Robust mixture regression based on the skew t distribution

Mixtura robusta de modelos de regresión basada en la distribución t asimétrica

Keywords:

Mixture regression models, robust regression, maximum likelihood, EM algorithm, skew t distribution (en)
Algoritmo EM, máxima verosimilitud, mixtura de regresiones, distribución t asimétrica. (es)

Downloads

Authors

  • Fatma Zehra Doğru Ankara University
  • Olcay Arslan Ankara University
In this study, we propose a robust mixture regression procedure based on the skew t distribution to model heavy-tailed and/or skewed errors in a mixture regression setting. Using the scale mixture representation of the skew  t distribution, we give an Expectation Maximization (EM) algorithm to compute the maximum likelihood (ML) estimates for the paramaters of interest. The performance of proposed estimators is demonstrated by a simulation study and a real data example.

En este estudio se explora una mixtura robusta de modelos de regresión basada en la distribución t asimétrica, con el propósito de modelar colas pesadas o asimétricas en los errores, en un escenario de mixtura de regresiones. Se usa un algoritmo EM para obtener los estimadores máximo verosímiles empleando una mixtura de escala de la distribución t asimétrica. El comportamiento de los estimadores propuestos se ilustra a través de une estudio de simulación y de un ejemplo con datos reales.

References

Akaike, H. (1973), Information theory and an extension of the maximum likelihood principle, in B. N. Petrov & F. Caski, eds, ‘Proceeding of the Second International Symposium on Information Theory’, Akademiai Kiado, Budapest, pp. 267–281.

Azzalini, A. (1986), ‘Further results on a class of distributions which includes the normal ones’, Statistica 46, 199–208.

Azzalini, A. & Capitaino, A. (2003), ‘Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t distribution’, Journal of the Royal Statistical Society: Series B 65, 367–389.

Bai, X. (2010), Robust mixture of regression models, Master’s thesis, Kansas State University.

Bai, X., Yao, W. & Boyer, J. E. (2012), ‘Robust fitting of mixture regression models’, Computational Statistics and Data Analysis 56, 2347–2359.

Basford, K. E., Greenway, D. R., McLachlan, G. J. & Peel, D. (1997), ‘Standard errors of fitted means under normal mixture’, Computational Statistics 12, 1– 17.

Bashir, S. & Carter, E. (2012), ‘Robust mixture of linear regression models’, Communications in Statistics-Theory and Methods 41, 3371–3388.

Bozdogan, H. (1993), Choosing the number of component clusters in the mixture model using a new informational complexity criterion of the inverse-fisher information matrix, in ‘Information and Classification’, Springer Berlin Heidelberg, pp. 40–54.

Cohen, A. C. (1984), ‘Some effects of inharmonic partials on interval perception’, Music Perception 1, 323–349.

Dempster, A. P., Laird, N. M. & Rubin, D. B. (1977), ‘Maximum likelihood from incomplete data via the E-M algorithm’, Journal of the Royal Statistical Society: Series B 39, 1–38.

Dias, J. G. & Wedel, M. (2004), ‘An empirical comparison of em, sem and mcmc performance for problematic gaussian mixture likelihoods’, Statistics and Computing 14, 323–332.

Dogru, F. Z. (2015), Robust Parameter Estimation in Mixture Regression Models, PhD thesis, Ankara University.

Dogru, F. Z. & Arslan, O. (2014), Robust mixture regression modelling based on the skew t distribution, in ‘International Conference on Robust Statistics (ICORS14)’, Martin-Luther-University Halle-Wittenberg/Germany.

Gupta, A. (2003), ‘Multivariate skew t distribution’, Statistics 37, 359–363.

Fatma Zehra Dogru & Olcay Arslan Gupta, A., Chang, F. & Huang, W. (2002), ‘Some skew symmetric models’, Random

Henning, C. (2013), fpc: Flexible procedure for clustering. R Package Version 2.1-5.

Henze, N. (1986), ‘A probabilistic representation of the skew-normal distribution’, Scandinavian Journal of Statistics 13, 271–275.

Lange, K. L., Little, J. A. & Taylor, M. G. J. (1989), ‘Robust statistical modeling using the t distribution’, Journal of the American Statistical Association 84, 881–896.

Lin, T. I., Lee, J. C. & Hsieh, W. J. (2007), ‘Robust mixture modeling using the skew t distribution’, Statistics and Computing 17, 81–92.

Liu, M. & Lin, T. I. (2014), ‘A skew-normal mixture regression model’, Educational and Psychological Measurement 74(1), 139–162.

Lucas, A. (1997), ‘Robustness of the student t based m-estimator’, Communications in Statistics: Theory and Methods 26, 1165–1182.

Markatou, M. (2000), ‘Mixture models, robustness, and the weighted likelihood methodology’, Biometrics 56, 483–486.

Peel, D. & McLachlan, G. J. (2000), ‘Robust mixture modelling using the t distribution’, Statistics and Computing 10(4), 339–348.

Pereira, J. R., Marques, L. A. & da Costa, J. M. (2012), ‘An empirical comparison of EM initialization methods and model choice criteria for mixtures of skew normal distributions’, Revista Colombiana de Estadistica 35(3), 457–478.

Quandt, R. E. (1972), ‘A new approach to estimating switching regressions’, Journal of the American Statistical Association 67, 306–310.

Quandt, R. E. & Ramsey, J. B. (1978), ‘Estimating mixtures of normal distributions and switching regressions’, Journal of the American Statistical Association 73, 730–752.

Schwarz, G. (1978), ‘Estimating the dimension of a model’, Annals of Statistics 6(2), 461–464.

Shen, H., Yang, J. & Wang, S. (2004), Outlier detecting in fuzzy switching regression models, in ‘International Conference on Artificial Intelligence: Methodology, Systems, and Applications’, Springer, pp. 208–215.

Song, W., Yao, W. & Xing, Y. (2014), ‘Robust mixture regression model fitting by laplace distribution’, Computational Statistics and Data Analysis 71, 128– 137.

Wei, Y. (2012), Robust mixture regression models using t-distribution, Master’s thesis, Kansas State University.

Yao, W., Wei, Y. & Yu, C. (2014), ‘Robust mixture regression using the tdistribution’, Computational Statistics and Data Analysis 71, 116–127.

Zeller, C. B., Cabral, C. R. B. & Lachos, V. H. (2016), ‘Robust mixture regression modeling based on scale mixtures of skew normal distributions’, Test 25, 375–396.

Zhang, J. (2013), Robust mixture regression modeling with pearson type vii distribution, Master’s thesis, Kansas State University.