Abstract:The MYOLOv8 algorithm is proposed to address the problem that the current object detection methods make it hard to represent multi-scale context features. In order to improve the detection capability of YOLOv8 model for small, medium and large objects, a hierarchical multi-scale extraction module is proposed to perform hierarchical feature aggregation of spatial features to capture multi-scale spatial context information. In order to further improve the model's ability to extract spatial semantics, a self-adaptive channel attention mechanism is proposed, which promotes the model to focus on useful features and suppress useless features by adaptively learning the interdependencies between adjacent channels. In order to improve the localization ability of model for boundary difficult samples, a Slide Loss is proposed to deal with the sample imbalance problem in object target detection, which employs a strong weighting on the difficult samples to motivate the model to focus on optimizing the difficult samples. Experimental results on the MS COCO dataset shows that the proposed algorithm improves the mAP by 3.4% and 1.4% compared to YOLOv8-n and YOLOv8-s, respectively, while having similar the number of parameters and computational cost, as well as faster inference speed.