Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines

资源分类

2020-04-02 |

160 |

165 |

Abstract

Recently, the coding of local features (e.g. SIFT) for image categorization tasks has been extensively studied. Incorporated within the Bag of Words (BoW) framework, these techniques optimize the pro- jection of local features into the visual codebook, leading to state-of-the- art performances in many benchmark datasets. In this work, we propose a novel visual codebook learning approach using the restricted Boltzmann machine (RBM) as our generative model. Our contribution is three-fold. Firstly, we steer the unsupervised RBM learning using a regularization scheme, which decomposes into a combined prior for the sparsity of each feature’s representation as well as the selectivity for each codeword. The codewords are then fine-tuned to be discriminative through the super- vised learning from top-down labels. Secondly, we evaluate the proposed method with the Caltech-101 and 15-Scenes datasets, either matching or outperforming state-of-the-art results. The codebooks are compact and inference is fast. Finally, we introduce an original method to visualize the codebooks and decipher what each visual codeword encodes.

上一篇：Towards Optimal Design of Time and Color Multiplexing Codes

下一篇：Active Frame Selection for Label Propagation in Videos

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Regularizing RNNs...

Recently, caption generation with an encoder-de...
Deep Cross-media ...

Cross-media retrieval is a research hotspot in ...
Compact MDDs for ...

Pseudo-Boolean (PB) constraints are usually en...
Supervised Descen...

Many computer vision problems (e.
Predictive-Correc...

While deep feature learning has revolutionized ...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com