资源论文Comparing distributions: geometry improves kernel two-sample testing

Comparing distributions: geometry improves kernel two-sample testing

2020-02-19 | |  36 |   33 |   0

Abstract

Are two sets of observations drawn from the same distribution? This problem is a two-sample test. Kernel methods lead to many appealing properties. Indeed state-of-the-art approaches use the 图片.png distance between kernel-based distribution representatives to derive their test statistics. Here, we show that 图片.png distances (with p 图片.png 1) between these distribution representatives give metrics on the space of distributions that are well-behaved to detect differences between distributions as they metrize the weak convergence. Moreover, for analytic kernels, we show that the 图片.png geometry gives improved testing power for scalable computational procedures. Specifically, we derive a finite dimensional approximation of the metric given as the 图片.png1 norm of a vector which captures differences of expectations of analytic functions evaluated at spatial locations or frequencies (i.e, features). The features can be chosen to maximize the differences of the distributions and give interpretable indications of how they differs. Using an图片.png1 norm gives better detection because differences between representatives are dense as we use analytic kernels (non-zero almost everywhere). The tests are consistent, while much faster than state-of-the-art quadratic-time kernel-based tests. Experiments on artificial and real-world problems demonstrate improved power/time tradeoff than the state of the art, based on图片.png2 norms, and in some cases, better outright power than even the most expensive quadratic-time tests.

上一篇:From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization

下一篇:Linear Stochastic Bandits Under Safety Constraints

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...