Abstract:Accurate segmentation of metallographic images of heavy-duty bearings is an important approach for evaluating their microstructures and predicting service life. Aiming at the segmentation accuracy issues caused by low edge definition, large size variations and small particle sizes of carbide microstructures in heavy-duty bearing metallographic images, an ACSA-U-net model for carbide segmentation in metallographic images is proposed based on adaptive multi-scale feature fusion and attention mechanism. By constructing a multi-scale feature extraction network, the CASAB module is embedded into the skip connections of the encoder path to enhance the quality of features transmitted via skip connections, highlight effective channels and spatial regions, suppress irrelevant background interference, and improve the ability to capture feature information of carbide particles in metallographic images. An Adaptive Feature Enhancement Module (AFEM) is designed in the sampling process of the decoder path, which adopts an adaptive gating mechanism to realize adaptive fusion and enhancement of multi-scale features. The CBAM module is embedded therein to adaptively optimize the feature maps in both channel and spatial dimensions. Validation results based on a metallographic image dataset of a certain type of heavy-duty bearing show that the proposed ACSA-U-net model outperforms the traditional U-net++ and its improved variants in terms of Dice coefficient, precision, recall and intersection over union (IoU). It achieves stable segmentation of carbides in metallographic images and provides support for the research on service performance evaluation of heavy-duty bearings.