Abstract:Dynamic gesture recognition is one of the most popular tasks in the field of computer vision, which has been widely concerned by researchers. Dynamic gesture recognition technology has shown high application potential in many fields such as automatic driving, virtual reality and human-computer interaction. Gestures are an intuitive and ideal way to exchange information with others in a virtual space, to direct a robot to perform a specific task in a hostile environment, or to interact with a computer; Some commonly used dynamic gesture data sets are investigated and summarized, and the modes, data volume and application scenarios of dynamic gesture data sets are summarized and analyzed. Starting from the types of networks used, this paper summarizes the research progress of vision-based dynamic gesture recognition technology, focuses on introducing and concluding the methods based on deep learning, and summarizes and compares the methods based on convolutional neural network, recurrent neural network and graph neural network. Finally, the research direction of dynamic gesture recognition based on vision is prospected.