A study on the effect of imbalanced data in tourism recommendation models
dc.contributor.author | Fernández Muñoz, Juan José | |
dc.contributor.author | Moguerza, Javier | |
dc.contributor.author | Martín Duque, Clara | |
dc.contributor.author | Gómez Bruna, Diana | |
dc.date.accessioned | 2024-02-07T09:16:06Z | |
dc.date.available | 2024-02-07T09:16:06Z | |
dc.date.issued | 2019 | |
dc.description.abstract | Abstract Purpose – This paper aims to study the effect of imbalanced data in tourism quality models. It is demonstrated that this imbalance strongly affects the accuracy of tourism prediction models for hotel recommendation. Design/methodology/approach – A questionnaire was used to survey 83,740 clients from hotels between five and two or less stars using a binary logistic model. The data correspond to a sample of 87 hotels from all around the world (120 countries fromAmerica, Africa, Asia, Europe and Australia). Findings – The results of the study suggest that the imbalance in the data affects the prediction accuracy of the models used, especially to the prediction provided by unsatisfied clients, tending to consider them as satisfied customers. Practical implications – In this sense, special attention should be given to unsatisfied clients or, at least, some safeguards to prevent the effect of the imbalance of data should be included in the models. Social implications – In the tourism industry, the strong imbalance between satisfied and unsatisfied customers produces misleading prediction results. This fact could have effects on the quality policy of hoteliers. Originality/value – In this work, focusing on tourism data, it is shown that this imbalance strongly affects the prediction accuracy of the models used, especially to the prediction of the recommendation provided by unsatisfied customers, tending to consider them as satisfied customers; a methodological approach based on the balance of the data set used to build the models is proposed to improve the accuracy of the prediction for unsatisfied customers provided by traditional services quality models. | en |
dc.description.department | Depto. de Ciencia Política y de la Administración | |
dc.description.department | Depto. de Organización de Empresas | |
dc.description.faculty | Fac. de Comercio y Turismo | |
dc.description.refereed | TRUE | |
dc.description.status | pub | |
dc.identifier.citation | Fernández-Muñoz JJ, M. Moguerza J, Martin Duque C, Gomez Bruna D. A study on the effect of imbalanced data in tourism recommendation models. International Journal of Quality and Service Sciences. 2019;11(3):346-56. | |
dc.identifier.doi | 10.1108/IJQSS-05-2018-0050 | |
dc.identifier.issn | 1756-669X | |
dc.identifier.officialurl | https://www.doi.org/10.1108/IJQSS-05-2018-0050 | |
dc.identifier.relatedurl | https://www.emerald.com/insight/publication/issn/1756-669X | |
dc.identifier.uri | https://hdl.handle.net/20.500.14352/99796 | |
dc.issue.number | 3 | |
dc.journal.title | International Journal of Quality and Service Sciences | |
dc.language.iso | eng | |
dc.page.final | 356 | |
dc.page.initial | 346 | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | en |
dc.rights.accessRights | metadata only access | |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | |
dc.subject.cdu | 640.41 | |
dc.subject.keyword | Hotels | |
dc.subject.keyword | Data | |
dc.subject.keyword | Sampling | |
dc.subject.keyword | Quality perception | |
dc.subject.keyword | Quality mangement | |
dc.subject.ucm | Ciencias Sociales | |
dc.subject.ucm | Política | |
dc.subject.ucm | Turismo | |
dc.subject.ucm | Economía | |
dc.subject.unesco | 59 Ciencia Política | |
dc.subject.unesco | 5902.99 Otras | |
dc.subject.unesco | 5311 Organización y Dirección de Empresas | |
dc.subject.unesco | 5312.90 Economía Sectorial: Turismo | |
dc.title | A study on the effect of imbalanced data in tourism recommendation models | en |
dc.title.alternative | Estudio sobre el efecto de los datos desequilibrados en los modelos de recomendación turística | es |
dc.type | journal article | |
dc.volume.number | 11 | |
dspace.entity.type | Publication | |
relation.isAuthorOfPublication | 8158fa42-c840-4dbe-afce-82f82006c738 | |
relation.isAuthorOfPublication | 0202e3fb-9573-44c9-85f3-ee01d9c19c1a | |
relation.isAuthorOfPublication.latestForDiscovery | 0202e3fb-9573-44c9-85f3-ee01d9c19c1a |