资源算法MobileNet-in-FPGA

MobileNet-in-FPGA

2020-02-27 | |  47 |   0 |   0

MobileNet in FPGA

Generator of verilog description for FPGA MobileNet implementation. There are several pre-trained models available for frequent tasks like detection of people, cars and animals. You can train your own model easily on your dataset using code from this repository and have the same very fast detector on FPGA working in real time for your own task.

Software requirements

Python 3.*, keras 2.2.4, tensorflow, kito

Hardware requirements

  1. TFT-screen ILI9341 Size: 2.8", Resolution: 240x320, Interface: SPI

  2. Camera OV5640. Active array size: 2592 x 1944

  3. OpenVINO Starter Kit. Cyclone V (301K LE, 13,917 Kbits embedded memory)

Demo

图片.png

How to run

  1. python3 r01_prepare_open_images_dataset.py - it will create training files using Open Images Dataset (OID).

  2. python3 r02_train_mobilenet.py - run training process. Will create weights for model and output accuracy of model.

  3. python3 r03_mobilenet_v1_reduce_and_scale_model.py - batchnorm fusion and rescale model on range (0, 1) instead of (0, 6). Returns new rescaled model

Note: You can skip part 1, 2 and 3 if you use our pretrained weight files below

  1. python3 r04_find_optimal_bit_for_weights.py - code to find optimal bit for feature maps, weights and biases, also returns maximum overflow for weights and biases over 1.0 value.

  2. python3 r05_gen_weights_in_verilog_format.py - generate weights in verliog format using optimal bits from previous step

  3. python3 r06_generate_debug_data.py - generate intermediate feature maps for each layer and details about first pixel calculation (can be used for debug)

  4. python3 r07_generate_verilog_for_mobilenet.py - generate verilog based on given model and parameters like number of convolution blocks

Updates

  • 2019.10.04 We greatly improved speed of image reading and preprocessing. Now it takes only 5% of total time instead of 77% earlier. Speed for 8 convolution version of device increased from ~10 FPS up to ~ 40 FPS.

Pre-trained models


People detector (128px)Cars detector (128px)Animals detector (128px)
Accuracy (%)84.4296.3189.67
Init model (can be used for training and fine-tuning)people.h5cars.h5animals.h5
Reduced and rescaled modelpeople.h5cars.h5animals.h5
Optimal bits found12, 11, 10, 7, 310, 9, 8, 7, 312, 11, 10, 7, 3
Quartus project (verilog)linklinklink

Connection of peripherals

图片.png

Writing weights in memory

See guide

Description of method

Innovate FPGA


上一篇:face-detection-ssd-mobilenet

下一篇: yolov3_mobilenet_caffe

用户评价
全部评价

热门资源

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...