Abstract:In the automatic feeding control system, in order to solve the problems that the traditional inductive sensor is easy to be disturbed by the external complex environment and needs to carry out tedious calibration work, a visual classification and recognition method of silo video based on improved C3D model was proposed. Based on the experimental requirements, a cooperative target was designed and a video dataset for silo identification was established. The initial C3D model is improved as the backbone network, and the convolutional layers of the 3rd, 4th, and 5th layers of the model are simplified, which greatly reduces the number of model parameters and is conducive to speeding up the inference speed. The SE attention mechanism can effectively find the salient area of the target in the video frame of complex scenes, and can efficiently extract features while taking into account the time series information to improve the recognition accuracy. The experimental results show that the accuracy of the SE-C3D recognition model reaches 99.61%, which is 2.48% higher than that of the initial C3D model, and the performance indicators are also significantly improved compared with other typical 3D convolution models, which is of great significance for the development of intelligent feeding system in the future.