Abstract:To address the problem of degraded detection accuracy caused by target blurring and low edge contrast in underwater environments, this paper proposes DR-YOLOv11, an enhanced underwater scene detection algorithm based on YOLOv11n. The algorithm introduces RFAConv to replace standard convolutions, dynamically generating feature weights within the receptive field to enhance local feature extraction capability. RFCAConv is introduced into the C3K2 module, integrating receptive-field attention with coordinate attention to enhance the robustness of feature extraction. Furthermore, a Context-Guided Local Position Attention mechanism (CGLPA) is designed, which constructs contextual priors and fuses channel feature information to guide adaptive weighting of neighborhood features, thereby improving the detection performance for small targets in blurred environments. Bi-directional Feature Pyramid Network (BiFPN) is adopted to reconstruct the neck structure, achieving feature fusion while reducing model complexity. Experiments on the DUO underwater object detection dataset show that DR-YOLOv11 achieves 86.0 % mAP@0.5 and 66.5 % mAP@0.5:0.95 with only 1.8 M parameters, surpassing YOLOv11n by 3.5 and 3.3 percentage points, respectively, while reducing the parameter count by 31 %.