资源论文Part-Stacked CNN for Fine-Grained Visual Categorization

Part-Stacked CNN for Fine-Grained Visual Categorization

2019-12-26 | |  55 |   48 |   0

Abstract

In the context of fine-grained visual categorization, theability to interpret models as human-understandable visualmanuals is sometimes as important as achieving high clas-sification accuracy. In this paper, we propose a novel Part-Stacked CNN architecture that explicitly explains the fine-grained recognition process by modeling subtle differences from object parts. Based on manually-labeled strong part annotations, the proposed architecture consists of a fullyconvolutional network to locate multiple object parts and a two-stream classification network that encodes object-level and part-level cues simultaneously. By adopting a set of sharing strategies between the computation of multiple object parts, the proposed architecture is very efficient running at 20 frames/sec during inference. Experimental re-sults on the CUB-200-2011 dataset reveal the effectivenessof the proposed architecture, from multiple perspectives of classification accuracy, model interpretability, and efficiency. Being able to provide interpretable recognition results in realtime, the proposed method is believed to be effective in practical applications.

上一篇:Visually Indicated Sounds

下一篇:Inverting Visual Representations with Convolutional Networks

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...