Back to Search Start Over

Automatic annotation of protected attributes to support fairness optimization

Authors :
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Universidad de Alicante. Instituto Universitario de Investigación Informática
Consuegra-Ayala, Juan Pablo
Gutiérrez, Yoan
Almeida-Cruz, Yudivian
Palomar, Manuel
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Universidad de Alicante. Instituto Universitario de Investigación Informática
Consuegra-Ayala, Juan Pablo
Gutiérrez, Yoan
Almeida-Cruz, Yudivian
Palomar, Manuel
Publication Year :
2024

Abstract

Recent research has shown that the unaware automation of high-risk decision-making tasks can result in unfair decisions being made. The most common approaches to address this problem adopt definitions of fairness based on protected attributes. Precise annotation of protected attributes enables the application of bias mitigation techniques to commonly unlabeled kinds of data (e.g., images, text, etc.). This paper proposes a framework to automatically annotate protected attributes in data collections. The framework focuses on providing a single interface to annotate protected attributes of different types (e.g., gender, race, etc.) and from different kinds of data. Internally, the framework coordinates multiple sensors to produce the final annotation. Several sensors for textual data are proposed. An optimization search technique is designed to tune the framework to specific domains. Additionally, a small dataset of movie reviews —annotated with gender and sentiment— was created. The evaluation in datasets of texts from diverse domains shows the quality of the annotations and their effectiveness to be used as a proxy to estimate fairness in datasets and machine learning models. The source code is available online for the research community.

Details

Database :
OAIster
Publication Type :
Electronic Resource
Accession number :
edsoai.on1427412544
Document Type :
Electronic Resource