Abstract:Abstract: Addressing the challenges of real-time performance and energy efficiency ratio of target tracking algorithms in edge computing scenarios, this study investigates the multi-granularity target tracking algorithm optimization deployment on reconfigurable arrays, enhancing the hardware deployment efficiency of convolutional layers in the DeepSORT target tracking network. An analysis of the computational characteristics of reconfigurable arrays supporting three data granularities—Int16, Int8, and Int4—was conducted. Based on actual deployment requirements, dynamic adjustments were made to the data organization methods, switching the computational channels within the MAC units of the reconfigurable arrays and optimizing network parameters. These approaches significantly improved convolutional efficiency and achieved an effective balance between computational accuracy and execution speed. Hardware testing was performed on a Virtex UltraScale 440 FPGA development board. The experimental results show that, compared to the Int16 mode, the Int8 and Int4 modes reduced memory access cycles by an average of 45.9% and 72%, and computational cycles by an average of 45.6% and 73%, respectively. The deployment scheme achieved an average PE utilization rate of 95.8% with a power consumption of only 5.08 W, providing a more flexible and efficient solution for the edge-side deployment of target tracking algorithms.