Estandarización de la Imputación en la Encuesta de
Transporte de Viajeros
Loading...
Official URL
Full text at PDC
Publication date
2021
Authors
Advisors (or tutors)
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
Citation
Abstract
La imputación consiste en estimar los valores perdidos o missings recurriendo a otros datos aportados por la unidad o a datos de otras unidades semejantes. Su importancia radica en el hecho de que aumenta la calidad de las estimaciones y se confirma por ser una fase incluida en el estándar de estadística oficial Generic Statistical Business Process Model (GSBPM). Este estándar nace con la necesidad de estandarizar los procesos estadísticos entre organismos del mismo país y de diferentes países. Teniendo esto en cuenta, el objetivo de este Trabajo de Fin de Máster (TFM) es usar la Encuesta de Transporte de Viajeros (publicada mensualmente por el Instituto Nacional de Estadística) para desarrollar un sistema estandarizado de imputación que sustituya al que se emplea actualmente y que pueda ser aplicable a otras operaciones estadísticas. Para ello, se ha recurrido a dos fases: una de clasificación de unidades en imputables y no imputables mediante el paquete ranger del software libre R y otra de imputación propiamente dicha con el paquete simputation del mismo software.
The imputation process consists on estimating missing values using other data provided by the unit or data by other similar units. Its importance lies in the fact that it improves the quality of the estimations and is confirmed due to the inclusion into the official statistical standard Generic Statistical Business Process Model (GSBPM). This standard was born because of the need of standardize statistical processes among agencies in the same country and in different countries. Taking this into account, the goal of this project is to use the Traveler’s Transport Survey (published monthly by INE Spain) to develop an imputation system to replace the current one and that can be applicable to other statistical operations. Two phases have been used for this purpose: a classification of imputable and non-imputable units by means of the ranger package of the free software R and another phase of imputation with the simputation package of the mentioned software.
The imputation process consists on estimating missing values using other data provided by the unit or data by other similar units. Its importance lies in the fact that it improves the quality of the estimations and is confirmed due to the inclusion into the official statistical standard Generic Statistical Business Process Model (GSBPM). This standard was born because of the need of standardize statistical processes among agencies in the same country and in different countries. Taking this into account, the goal of this project is to use the Traveler’s Transport Survey (published monthly by INE Spain) to develop an imputation system to replace the current one and that can be applicable to other statistical operations. Two phases have been used for this purpose: a classification of imputable and non-imputable units by means of the ranger package of the free software R and another phase of imputation with the simputation package of the mentioned software.
Description
Calificación: 9.5