Pedestrian detection based on faster R-CNN in nighttime by fusing deep convolutional features of successive images

By Jong Hyun Kim, Ganbayar Batchuluun, Kang Ryoung Park

Introduction

This code is relative to paper.

In the paper, training is done using Caltech and KAIST DB seperately, total 6 stages. However, this implementation trains network using both DB at the same time, total 4 stages, for simplicity, resulting a similar performance.

Also, this code is written based on the MATLAB implementations of ShaoqingRen/faster_rcnn and zhangliliang/RPN_BF.

This code has been tested on Windows7 with MATLAB 2017a.

Requirements

Caffe
MATLAB
GPU: Geforce GTX 1070, etc

Preparation for Training

download videos and toolboxes from KAIST Multispectral Pedestrian Detection Benchmark and Caltech Pedestrian Detection Benchmark.
extract visible camera images and annotations using each toolbox
place KAIST(set00-05, skip=10), Caltech(set00-10, skip=30) training images and annotations in ./datasets/train/
place KAIST(set06-11, skip=20) testing images and annotations in ./datasets/test/
place the toolbox folders (KAIST, Caltech) in ./external/, and name as toolbox(kaist) and toolbox(caltech), respectively
run fetch_data/fetch_caffe_mex_cuda65.m to download a compiled Caffe mex (for Windows only).
download ImageNet-pre-trained VGG16(reduced for 7x3 ROI pooling) model(depicted below) from GoogleDrive and place it to ./models/pre_trained_models/vgg_16layers