Abstract:Considering the problems of subjectivity and energy-consuming visual diagnosis of traditional Chinese medicine tongue diagnosis, a detection model based improved YOLOv5 for tongue tooth mark and fissure features was proposed. The SimAM-CSP module was introduced into the backbone of YOLOv5 to enhance the feature extraction capability of the network ground. The Bottleneck Attention Module was added between the Neck layer and the Head layer to further focus critical information. The feature fusion structure of YOLOv5 was adapted to increase the ability to perceive image details and improve performance of network. The localization loss function GIoU was replaced with EIoU in original YOLOv5 algorithm for the purpose of simultaneously improving the training convergence speed and prediction regression accuracy. The initial anchor frames of YOLOv5 were adjusted by the K-Means algorithm to make the model more suitable for tongue tooth mark and fissure detection. The improved YOLOv5 model was trained in the self-built tongue image dataset, and the average detection accuracy reaches 79.5%, which is 6.3 percentage points higher than that of the original algorithm. Experimental results show that the improved YOLOv5 model can effectively improve the detection accuracy of tongue tooth mark and fissure, which is helpful for assisting doctors in diagnosis.