资源算法 retinanet_detection

retinanet_detection

2020-02-18 | |  50 |   0 |   0

Vehicle Detection with RetinaNet

Vehicle and pedestrian detection and tracking play a vital role in autonomous driving. In previous project, I implemented a vehicle detection and tracking pipeline based on traditional computer vision techniques. This project is to explore application of RetinaNet on the vehicle detection taask.

Dataset

The training and evaluation of this project is based on the Udacity annotated driving dataset. It includes driving in Mountain View California and neighboring cities during daylight conditions. I combined the two datasets and only retained bounding box annotations for car, truck, and pedestrian. The combined dataset

Here's an overview of the dataset

图片.png

Model evalulation

In this project, I'm interested in the detection accuracy of the models as well as their inference speed. The goal is to find a model that can detect vehicles with good accuracy in real time.

The accuracy of models is primarily evaluated by mean Average Precision (mAP) and mean Average Recall (mAR) at IOU of 0.5.

The models being benchmarked are

  1. sliding window method based on HOG feature and linear classifier

  2. RetinaNet with ResNet50 backbone, pre-trained on COCO

  3. RetinaNet with ResNet18 backbone, trained on driving dataset

  4. RetinaNet with MobileNet backbone, trained on driving dataset

Main results

Benchmark

ModelAP50 (car)AP50 (truck)AP50 (pedestrian)# of parametersCPU inference
(s/frame)
GPU inference
(s/frame)
HOG24.6---6.9
RetinaNet-ResNet50
pre-trained on COCO
71.853.432.437.42.00.14
RetinaNet-ResNet18-6466.754.127.212.01.40.1
RetinaNet-ResNet18-4866.151.018.87.01.20.09
RetinaNet-ResNet18-3271.955.234.73.40.970.09
RetinaNet-MobileNet-173.354.642.44.41.10.1
RetinaNet-MobileNet-0.7567.657.229.62.81.00.07
RetinaNet-MobileNet-0.565.355.236.31.60.770.055
RetinaNet-MobileNet-0.2567.654.138.20.840.540.05

Example detection result

图片.png

Vehicle tracking on movie

Here's the result of running RetinaNet-ResNet50-COCO on a dash camera video

video_out_coco.gif

Here's the result of running RetinaNet-MobileNet-0.25 on a dash camera video

video_out_mobilenet_0.25.gif

Appendix

The following graph shows the structure of feature pyramid net (FPN) built on top of ResNet backbone.

图片.png

The following graph showes the structure of regress and classification subnet.

图片.png

上一篇:keras-retinanet-test-data

下一篇:RetinaNet-mxnet

用户评价
全部评价

热门资源

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • shih-styletransfer

    shih-styletransfer Code from Style Transfer ...