View article

[PDF] from arxiv.org

Proxy non-discrimination in data-driven systems

Authors

Anupam Datta, Matt Fredrikson, Gihyuk Ko, Piotr Mardziel, Shayak Sen

Publication date

2017/7/25

Journal

arXiv preprint arXiv:1707.08120

Description

Machine learnt systems inherit biases against protected classes, historically disparaged groups, from training data. Usually, these biases are not explicit, they rely on subtle correlations discovered by training algorithms, and are therefore difficult to detect. We formalize proxy discrimination in data-driven systems, a class of properties indicative of bias, as the presence of protected class correlates that have causal influence on the system's output. We evaluate an implementation on a corpus of social datasets, demonstrating how to validate systems against these properties and to repair violations where they occur.

Total citations

Cited by 67

201720182019202020212022202320242 9 10 14 11 9 9 2

Scholar articles

Proxy non-discrimination in data-driven systems

A Datta, M Fredrikson, G Ko, P Mardziel, S Sen - arXiv preprint arXiv:1707.08120, 2017