ESPNetv2: A Light-weight, Power Efficient, and General PurposeConvolutional Neural Network

资源分类

2019-09-17 |

114 |

91 |

Abstract We introduce a light-weight, power efficient, and general purpose convolutional neural network, ESPNetv2, for modeling visual and sequential data. Our network uses group point-wise and depth-wise dilated separable convolutions to learn representations from a large effective receptive field with fewer FLOPs and parameters. The performance of our network is evaluated on four different tasks: (1) object classification, (2) semantic segmentation, (3) object detection, and (4) language modeling. Experiments on these tasks, including image classification on the ImageNet and language modeling on the PenTree bank dataset, demonstrate the superior performance of our method over the state-of-the-art methods. Our network outperforms ESPNet by 4-5% and has 2 4× fewer FLOPs on the PASCAL VOC and the Cityscapes dataset. Compared to YOLOv2 on the MS-COCO object detection, ESPNetv2 delivers 4.4% higher accuracy with 6× fewer FLOPs. Our experiments show that ESPNetv2 is much more power effi- cient than existing state-of-the-art efficient methods including ShuffleNets and MobileNets. Our code is open-source and available at https://github.com/sacmehta/ ESPNetv2.

上一篇：D2-Net: A Trainable CNN for Joint Description and Detection of Local Features

下一篇：Filter Pruning via Geometric Medianfor Deep Convolutional Neural Networks Acceleration

用户评价

全部评价

还没有评论，说两句吧！

热门资源

A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Learning to Predi...

Much of model-based reinforcement learning invo...
Joint Pose and Ex...

Facial expression recognition (FER) is a challe...
The Variational S...

Unlike traditional images which do not offer in...
Depth Super Resol...

We tackle the problem of jointly increasing the...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com