Abstract:Based on the excellent performance of YOLO series networks in the field of target detection, a high-efficient YOLO target detection system based on FPGA is proposed. The YOLOv5n network is optimized by layer fusion, and the network model is retrained and quantized using quantization aware training. The feature map and weight data are quantized to 8 bits, reducing hardware resource consumption. A hybrid flow configurable hardware accelerator architecture is designed to realize the forward inference of the network layer by configuring model parameters. The network layer is optimized in hardware module, and ping-pong dual cache and interlayer pipeline is adopted. The entire hardware acceleration system is implemented through coordinated software-hardware design, and hardware modules are reasonably scheduled to achieve efficient parallel work of soft-core processor and hardware accelerator. Through practical test on Xilinx VC707 FPGA development board, the system achieves the throughput of 27.15 GOPS at the operating frequency of 100 MHz, and the power consumption is only 2.88 W, achieving a high energy efficiency of 9.43 GOPS/W, balancing detection speed and power consumption, and meeting the requirements of target detection.