Fair and Diverse DPP-Based Data Summarization

资源分类

2020-03-16 |

80 |

44 |

Abstract

Sampling methods that choose a subset of the data proportional to its diversity in the feature space are popular for data summarization. However, recent studies have noted the occurrence of bias – e.g., under or over representation of a particular gender or ethnicity – in such data sum marization methods. In this paper we initiate a study of the problem of outputting a diverse and fair summary of a given dataset. We work with a well-studied determinantal measure of diversity and corresponding distributions (DPPs) and present a framework that allows us to incorporate a general class of fairness constraints into such distributions. Designing efficient algorithm to sample from these constrained determinantal distributions, however, suffers from a complexity barrier; we present a fast sampler that is provab good when the input vectors satisfy a natural pro erty. Our empirical results on both real-world an synthetic datasets show that the diversity of the samples produced by adding fairness constraints is not too far from the unconstrained case.

上一篇：Online Learning with Abstention

下一篇：Minimal I-MAP MCMC for Scalable Structure Discovery in Causal DAG Models

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Stratified Strate...

In this paper we introduce Stratified Strategy ...
The Variational S...

Unlike traditional images which do not offer in...
Learning to learn...

The move from hand-designed features to learned...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Learning to Predi...

Much of model-based reinforcement learning invo...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com