Modeling Local and Global Deformations in Deep Learning: Epitomic Convolution, Multiple Instance Learning, and Sliding Window Detection

2019-12-17 |

63 |

40 |

Abstract

Deep Convolutional Neural Networks (DCNNs) achieveinvariance to domain transformations (deformations) byusing multiple ‘max-pooling’ (MP) layers. In this workwe show that alternative methods of modeling deforma-tions can improve the accuracy and efficiency of DCNNs. First, we introduce epitomic convolution as an alternative to the common convolution-MP cascade of DCNNs, that comes with the same computational cost but favorable learning properties. Second, we introduce a Multiple Instance Learning algorithm to accommodate global translation and scaling in image classification, yielding an efficientalgorithm that trains and tests a DCNN in a consistent manner. Third we develop a DCNN sliding window detector thatexplicitly, but efficiently, searches over the object’s positioscale, and aspect ratio. We provide competitive image classification and localization results on the ImageNet dataset and object detection results on Pascal VOC2007.

上一篇：Understanding Tools: Task-Oriented Object Modeling, Learning and Recognition

下一篇：FaLRR: A Fast Low Rank Representation Solver

用户评价

全部评价

还没有评论，说两句吧！

热门资源

The Variational S...

Unlike traditional images which do not offer in...
Learning to Predi...

Much of model-based reinforcement learning invo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
Learning to learn...

The move from hand-designed features to learned...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com