Journal of Tropical Oceanography ›› 2026, Vol. 45 ›› Issue (3): 95-108.doi: 10.11978/2025098CSTR: 32234.14.2025098

Previous Articles     Next Articles

An improved model for epibenthic organism target detection based on RT-DETR

DENG Jianzhi1(), TANG Zhenghao2,3, LI Yun4()   

  1. 1 College of Physics and Electronic Information Engineering, Guilin University of Technology, Guilin 541006, China
    2 Engineering Research Center of Optoelectronic Information and Intelligent Communication Technology, Guilin University of Technology, Guilin 541006, China
    3 Key Laboratory of Low-dimensional Structural Physics and Application, Education Department of Guangxi Zhuang Autonomous Region, Guilin University of Technology, Guilin 541006, China
    4 School of Physics and Electronic Information, Guangxi Minzu University, Nanning 530006, China
  • Received:2025-07-07 Revised:2025-11-04 Online:2026-05-10 Published:2026-05-28
  • Contact: LI Yun. email: 20240165@gxmzu.edu.cn
  • Supported by:
    Guangxi Science and Technology Major Project (Guike AA23062035-2); Guangxi Science and Technology Major Project (Guike AA24263038)

Abstract:

Epibenthic organism target detection is of great significance in marine ecological monitoring and resource management. However, due to the low illumination, blurriness and complex background of underwater images, the performance of traditional detection algorithms is limited. This paper proposes the MEIE-RTDETR (multiscale edge information enhance-real time detection transformer) model. By designing a multi-scale edge information enhancement module to strengthen feature extraction, adopting an adaptive sparse self-attention (ASSA) mechanism to reduce computational redundancy, and introducing an intensity enhance layer (IEL) module to improve the feature pyramid for better small target detection, the model enhances its overall capability. Additionally, a powerful-IoU (P-IoU) + normalized Wasserstein distance (NWD) loss function is designed to improve performance in scenarios with blurred bounding boxes and multi-scale target detection. Experiments on the DUO (detecting underwater objects) and RUOD (rethinking general underwater object detection) datasets show that the improved model outperforms Faster R-CNN (region-based convolutional neural networks), the YOLO (you only look once) series, and the original RT-DETR (real time-detection transformer). It achieves mAP50 scores of 85.0% and 85.4% on the respective datasets with significantly reduced number of parameters and computational complexity, providing an effective solution for lightweight and high-precision epibenthic target detection.

Key words: epibenthos, target detection, RT-DETR, multi-scale feature enhancement, adaptive sparse self-attention (ASSA), lightweight model

CLC Number: 

  • TP391.41