Aviso: para depositar documentos, por favor, inicia sesión e identifícate con tu cuenta de correo institucional de la UCM con el botón MI CUENTA UCM. No emplees la opción AUTENTICACIÓN CON CONTRASEÑA Disculpen las molestias.
 

Model Selection for independent not identically distributed observations based on Rényi's pseudodistances

dc.contributor.authorFelipe Ortega, Ángel
dc.contributor.authorJaenada Malagón, María
dc.contributor.authorMiranda Menéndez, Pedro
dc.contributor.authorPardo Llorente, Leandro
dc.date.accessioned2023-06-22T12:52:32Z
dc.date.available2023-06-22T12:52:32Z
dc.date.issued2023-04-11
dc.description.abstractModel selection criteria are rules used to select the best statistical model among a set of candidate models, striking a trade-off between goodness of fit and model complexity. Most popular model selection criteria measure the goodness of fit trough the model log-likelihood function, yielding to non-robust criteria. This paper presents a new family of robust model selection criteria for independent but not identically distributed observations (i.n.i.d.o.) based on the Rényi's pseudodistance (RP). The RP-based model selection criterion is indexed with a tuning parameter α controlling the trade-off between efficiency and robustness. Some theoretical results about the RP criterion are derived and the theory is applied to the multiple linear regression model, obtaining explicit expressions of the model selection criterion. Moreover, restricted models are considered and explicit expressions under the multiple linear regression model with nested models are accordingly derived. Finally, a simulation study empirically illustrates the robustness advantage of the method.
dc.description.departmentDepto. de Estadística e Investigación Operativa
dc.description.facultyFac. de Ciencias Matemáticas
dc.description.refereedTRUE
dc.description.sponsorshipMinisterio de Ciencia e Innovación (España)
dc.description.statuspub
dc.eprint.idhttps://eprints.ucm.es/id/eprint/77633
dc.identifier.citationFelipe A, Jaenada M, Miranda P, Pardo L. Model Selection for independent not identically distributed observations based on Rényi’s pseudodistances. Journal of Computational and Applied Mathematics 2024; 440: 115630. [DOI: 10.1016/j.cam.2023.115630]
dc.identifier.officialurlhttps://doi.org/10.1016/j.cam.2023.11563
dc.identifier.urihttps://hdl.handle.net/20.500.14352/73280
dc.language.isoeng
dc.relation.projectIDPID2021-124933NB-I00
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.subject.cdu519.22
dc.subject.keywordRényi’s pseudodistance
dc.subject.keywordRobustness
dc.subject.keywordRestricted model
dc.subject.keywordMultiple linear regression model
dc.subject.ucmEstadística matemática (Matemáticas)
dc.subject.unesco1209 Estadística
dc.titleModel Selection for independent not identically distributed observations based on Rényi's pseudodistances
dc.typejournal article
dcterms.references[1] Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. In B. N. Petrov & F. Csáki (Eds.), 2nd international symposium on information theory (pp. 267–281). Budapest, Hungary: Akadémia Kiadó. [2] Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, AC-19, 716–723. [3] Basu, A., Harris I. R. , Hjort, N. L. and Jones, M. C. (1998). Robust and efficient estimation by minimising a density power divergence. Biometrika, 85 (3), 549–559. [4] Basu, A., Mandal, A., Martín, N. and Pardo, L. (2018). Testing composite hypothesis based on density power divergence. Sankhya, 80 (13), 222–262. [5] Bozdogan, H. (1987). Model selection and Akaike’s information criterion (AIC): The general theory and its analytical extensions. Psychometrika, 52, 345–370. [6] Broniatowski, M., Toma, A. and Vajda, I. (2012). Decomposable pseudodistances and applications in statistical estimation. Journal of Statistical Planning and Inference, 142, 2574–2585. [7] Castilla, E., Jaenada, M. and Pardo, L. (2022). Estimation and testing on independent not identically distributed observations based on Rényi’s pseudodistances. IEEE Transactions on Information Theory, 68, 7, 4588–4609. [8] Castilla, E., Jaenada, M., Martín, N. and Pardo, L. (2023). Robust approach for comparing two dependent normal populations through Waldtype tests based on rényi’s pseudodistance estimators. Statistics and Computing, DOI: 10.1007/s11222-022-10162-7. [9] Cavanaugh, J. E. and Neath, A. A. (2011). Akaike’s Information Criterion: Background, Derivation, Properties, and Refinements. International Encyclopedia of Statistical Science, 26–29. doi:10.1007/978-3-642-04898-2 111. [10] Dik, J. J. and Gunst, M. C. M. (1985). The distribution of general quadratic forms in normal variables. Statistica Neerlandica, 39, 14–26. [11] Draper, N.R. and Smith, H. (1981). Applied Regression Analysis, 2nd ed. Wiley Blackwell. Hoboken, NJ (USA). [12] Fujisawa, H. and Eguchi, S. (2008). Robust parameter estimation with a small bias agains theavy contamination. Journal of Multivariate Analysis, 99, 2053–2081. [13] Hurvich, C. M. and Tsai, C. L. (1989). Regression and time series model selection in small samples. Biometrika, 76, 297–307. [14] Hurvich, C. M. and Tsai, C. L. (1993). A corrected Akaike information criterion for vector autoregressive model selection. Journal of Time Series Analysis, 14, 271–279. [15] Hurvich, C. M. and Tsai, C. L. (1995). Model selection for extended quasi–likelihood models in small samples. Biometrics, 51, 1077–1084. [16] Jaenada, M., Miranda, P. and Pardo, L. (2022). Robust tests Statistics based on restricted minimum Rényi Pseudodistance estimators. Entropy, 24, 616. [17] Jaenada, M. and Pardo, L. (2022). Robust Statistical Inference in Generalized Linear Models based on minimum Rényi Pseudodistance estimators. Entropy, 24, 123. [18] Jones, M. C., Hjort, N. L., Harris, I. R. and Basu, A. (2001). A comparison of related density-based minimum divergence estimators. Biometrika, 88, 865–873. [19] Konishi, S. and Kitagawa, G. (1996). Generalised information criteria in model selection. Biometrika, 83, 875–890. [20] Kullback, S. and Leibler, R.A. (1951). On Information and Sufficiency. Annals of Mathematical Statistics, 22 (1), 79–86. [21] Kurata, S. and Hamada, E. (2018). A robust generalization and asymptotic properties of the model selection criterion family. Communication In Statistics (Theory and Methods), 47, 3, 532-547. [22] Mattheou, K., Lee, S. and Karagrigoriou, A. (2009). A model selection criterion based on the BHHJ measure of divergence. Journal of Statististical Planning and Inference, 139, 228–235. [23] Rao, C. R. and Wu, Y. (2001). On model Selection. IMS Lectures Notes. Monograph Series, 312, 1-57. [24] Schwarz, G. E. (1978). Estimating the dimension of a model. Annals of Statistics, 6 (2), 461-–464. [25] Takeuchi, K. (1976). Distribution of information statistics and criteria for adequacy of models. Math. Sci., 153, 12-18 (In Japanese). [26] Toma, A., Karagrigoriou, A., and Trentou, P. (2020). Robust model selection criteria based on pseudodistances. Entropy, 22(3), 304. [27] Toma, A. and Leoni-Auban, S. (2010). Robust tests based on dual divergence estimators and saddle points approximation. Journal of Multivariate Analysis, 101, 1143–1155.
dspace.entity.typePublication
relation.isAuthorOfPublication72ddce0d-fbc4-4233-800c-cbd2cc36a012
relation.isAuthorOfPublication931cc892-86a0-4d44-9343-7b54535c00a2
relation.isAuthorOfPublicationd940fcaa-13c3-4bad-8198-1025a668ed71
relation.isAuthorOfPublicationa6409cba-03ce-4c3b-af08-e673b7b2bf58
relation.isAuthorOfPublication.latestForDiscovery72ddce0d-fbc4-4233-800c-cbd2cc36a012

Download

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
felipe_model.pdf
Size:
273.46 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Model_Selection_Vers_Publicada.pdf
Size:
523.58 KB
Format:
Adobe Portable Document Format

Collections