资源论文Fair and Diverse DPP-Based Data Summarization

Fair and Diverse DPP-Based Data Summarization

2020-03-16 | |  80 |   44 |   0

Abstract

Sampling methods that choose a subset of the data proportional to its diversity in the feature space are popular for data summarization. However, recent studies have noted the occurrence of bias – e.g., under or over representation of a particular gender or ethnicity – in such data sum marization methods. In this paper we initiate a study of the problem of outputting a diverse and fair summary of a given dataset. We work with a well-studied determinantal measure of diversity and corresponding distributions (DPPs) and present a framework that allows us to incorporate a general class of fairness constraints into such distributions. Designing efficient algorithm to sample from these constrained determinantal distributions, however, suffers from a complexity barrier; we present a fast sampler that is provab good when the input vectors satisfy a natural pro erty. Our empirical results on both real-world an synthetic datasets show that the diversity of the samples produced by adding fairness constraints is not too far from the unconstrained case.

上一篇:Online Learning with Abstention

下一篇:Minimal I-MAP MCMC for Scalable Structure Discovery in Causal DAG Models

用户评价
全部评价

热门资源

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...