A fair-multicluster approach to clustering of categorical data

Santos Mangudo, Carlos; Heras Martínez, Antonio José

A fair-multicluster approach to clustering of categorical data

Download

s10100-022-00824-2.pdf (516.43 KB)

Official URL

https://doi.org/10.1007/s10100-022-00824-2

Publication date

2022

Authors

Santos Mangudo, Carlos

Heras Martínez, Antonio José

Publisher

Springer Nature

Citations

Exportar

URI

https://hdl.handle.net/20.500.14352/72690

Abstract

In the last few years, the need of preventing classification biases due to race, gender, social status, etc. has increased the interest in designing fair clustering algorithms. The main idea is to ensure that the output of a cluster algorithm is not biased towards or against specific subgroups of the population. There is a growing specialized literature on this topic, dealing with the problem of clustering numerical data bases. Nevertheless, to our knowledge, there are no previous papers devoted to the problem of fair clustering of pure categorical attributes. In this paper, we show that the Multicluster methodology proposed by Santos and Heras (Interdiscip J Inf Knowl Manag 15:227–246, 2020. https://doi.org/10.28945/4643) for clustering categorical data, can be modified in order to increase the fairness of the clusters. Of course, there is a tradeoff between fairness and efficiency, so that an increase in the fairness objective usually leads to a loss of classification efficiency. Yet it is possible to reach a reasonable compromise between these goals, since the methodology proposed by Santos and Heras (2020) can be easily adapted in order to get homogeneous and fair clusters.

Description

CRUE-CSIC (Acuerdos Transformativos 2022)

UCM subjects

Estadística

Unesco subjects

1209 Estadística

Collections

Artículos

Full item page

A fair-multicluster approach to clustering of categorical data

Download

Official URL

Full text at PDC

Publication date

Authors

Advisors (or tutors)

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Citations

Exportar

URI

Citation

Abstract

Research Projects

Organizational Units

Journal Issue

Description

UCM subjects

Unesco subjects

Keywords

Collections