资源论文The Everlasting Database: Statistical Validity at a Fair Price

The Everlasting Database: Statistical Validity at a Fair Price

2020-02-17 | |  67 |   39 |   0

Abstract 

The problem of handling adaptivity in data analysis, intentional or not, permeates a variety of fields, including test-set overfitting in ML challenges and the accumulation of invalid scientific discoveries. We propose a mechanism for answering an arbitrarily long sequence of potentially adaptive statistical queries, by charging a price for each query and using the proceeds to collect additional samples. Crucially, we guarantee statistical validity without any assumptions on how the queries are generated. We also ensure with high probability that the cost for M non-adaptive queries is image.png while the cost to a potentially p adaptive user who makes M queries that do not depend on any others is image.png

上一篇:Clustering Redemption–Beyond the Impossibility of Kleinberg’s Axioms

下一篇:Algorithms and Theory for Multiple-Source Adaptation

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...