资源论文Binary Embedding: Fundamental Limits and Fast Algorithm

Binary Embedding: Fundamental Limits and Fast Algorithm

2020-03-05 | |  103 |   90 |   0

Abstract

Binary embedding is a nonlinear dimension reduction methodology where high dimensional data are embedded into the Hamming cube while preserving the structure of the original space. Specifically, for an arbitrary N distinct points in 图片.png our goal is to encode each point using m-dimensional binary strings such that we can reconstruct their geodesic distance up to δ uniform distortion. Existing binary embedding algorithms either lack theoretical guaran tees or suffer from running time O mp . We make three contributions: (1) we establish a lower bound that shows any binary embedding oblivious to the set of points requires m = 图片.png bits and a similar lower bound for non-oblivious embeddings into Hamming distance; (2) we propose a novel fast binary embedding algorithm with provably optimal bit complexity m = 图片.png and near linear running time O(p log p) whenever log 图片.png with a slightly worse running time for larger log N ; (3) we also provide an analytic result about embedding a general set of points K ⊆ 图片.png with even infinite size. Our theoretical findings are supported through experiments on both synthetic and real data sets.

上一篇:Cheap Bandits

下一篇:Convex Formulation for Learning from Positive and Unlabeled Data

用户评价
全部评价

热门资源

  • Regularizing RNNs...

    Recently, caption generation with an encoder-de...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Deep Cross-media ...

    Cross-media retrieval is a research hotspot in ...

  • Joint Pose and Ex...

    Facial expression recognition (FER) is a challe...

  • Supervised Descen...

    Many computer vision problems (e.