Abstract:Steel is widely used in manufacturing, and surface defects significantly affect its quality. The diversity of defect shapes and interference from the detection background present challenges to existing detection models. To address this, this paper proposes the AGD-YOLO algorithm for detecting surface defects in steel.First, we designed the Adaptive Multi-Scale Downsampling (AMSD) module, which utilizes dilated convolutions and multi-pooling operations to capture and integrate multi-scale feature information. This module is incorporated into the backbone network to enhance defect recognition capability. Next, we introduced the Enhanced Global Context-Space Pooling Pyramid (EGC-SPP) module, which combines dilated convolutions and spatial pyramid pooling to integrate global background and edge features.Finally, we designed a Dual-Stream Fusion Network (DSFN) to enhance feature representation, improving the algorithm"s contextual complementarity and the ability to recognize detailed features. Experimental results indicate that the improved algorithm achieves a 7.1% increase in mAP on the NEU- DET dataset, effectively addressing the challenges in detecting surface defects on steel.