热带海洋学报 ›› 2026, Vol. 45 ›› Issue (3): 95-108.doi: 10.11978/2025098CSTR: 32234.14.2025098

• 海洋生物学 • 上一篇    下一篇

基于RT-DETR的浅海底栖生物目标检测改进模型

邓健志1(), 唐政豪2,3, 李云4()   

  1. 1 桂林理工大学, 物理与电子信息工程学院, 广西 桂林 541006
    2 桂林理工大学, 光电信息与智能通信技术工程研究中心, 广西 桂林 541006
    3 桂林理工大学, 广西高校低维结构物理与应用重点实验室, 广西 桂林 541006
    4 广西民族大学, 物理与电子信息学院, 广西 南宁 530006
  • 收稿日期:2025-07-07 修回日期:2025-11-04 出版日期:2026-05-10 发布日期:2026-05-28
  • 通讯作者: 李云。email: 20240165@gxmzu.edu.cn
  • 作者简介:

    邓健志(1982—), 男, 广东省肇庆市人, 教授, 博士生导师, 主要从事可见光通信、嵌入式技术、物联网等方面的研究。email:

  • 基金资助:
    广西科技重大专项(桂科AA23062035-2); 广西科技重大专项(桂科AA24263038)

An improved model for epibenthic organism target detection based on RT-DETR

DENG Jianzhi1(), TANG Zhenghao2,3, LI Yun4()   

  1. 1 College of Physics and Electronic Information Engineering, Guilin University of Technology, Guilin 541006, China
    2 Engineering Research Center of Optoelectronic Information and Intelligent Communication Technology, Guilin University of Technology, Guilin 541006, China
    3 Key Laboratory of Low-dimensional Structural Physics and Application, Education Department of Guangxi Zhuang Autonomous Region, Guilin University of Technology, Guilin 541006, China
    4 School of Physics and Electronic Information, Guangxi Minzu University, Nanning 530006, China
  • Received:2025-07-07 Revised:2025-11-04 Online:2026-05-10 Published:2026-05-28
  • Contact: LI Yun. email: 20240165@gxmzu.edu.cn
  • Supported by:
    Guangxi Science and Technology Major Project (Guike AA23062035-2); Guangxi Science and Technology Major Project (Guike AA24263038)

摘要:

浅海底栖生物目标检测在海洋生态监测和资源管理中具有重要意义, 但受限于水下图像的低光照、模糊及复杂背景, 传统检测算法性能不佳。本文提出了MEIE-RTDETR(multi-scale edge information enhancement real-time detection transformer)模型, 通过设计多尺度边缘信息增强模块强化特征提取, 采用自适应稀疏自注意力(adaptive sparse self-attention, ASSA)降低计算冗余, 并引入亮度信息增强IEL(intensity enhancement layer)模块改进特征金字塔提升小目标检测能力, 最后设计P-IoU(powerful-IoU)+NWD(normalized Wasserstein distance)损失函数增强对边界框模糊和多尺度目标检测的性能。在DUO(detecting underwater objects)和RUOD(rethinking general underwater object detection)数据集上的实验表明, 改进后的模型在参数量和计算量显著降低的同时, mAP50分别达到85.0%和85.4%, 优于Faster R-CNN(region-based convolutional neural networks)、YOLO(you only look once)系列及原始RT-DETR(real-time detection transformer), 为水下轻量化高精度检测提供了有效解决方案。

关键词: 浅海底栖生物, 目标检测, RT-DETR, 多尺度特征增强, 自适应稀疏自注意力, 轻量化模型

Abstract:

Epibenthic organism target detection is of great significance in marine ecological monitoring and resource management. However, due to the low illumination, blurriness and complex background of underwater images, the performance of traditional detection algorithms is limited. This paper proposes the MEIE-RTDETR (multiscale edge information enhance-real time detection transformer) model. By designing a multi-scale edge information enhancement module to strengthen feature extraction, adopting an adaptive sparse self-attention (ASSA) mechanism to reduce computational redundancy, and introducing an intensity enhance layer (IEL) module to improve the feature pyramid for better small target detection, the model enhances its overall capability. Additionally, a powerful-IoU (P-IoU) + normalized Wasserstein distance (NWD) loss function is designed to improve performance in scenarios with blurred bounding boxes and multi-scale target detection. Experiments on the DUO (detecting underwater objects) and RUOD (rethinking general underwater object detection) datasets show that the improved model outperforms Faster R-CNN (region-based convolutional neural networks), the YOLO (you only look once) series, and the original RT-DETR (real time-detection transformer). It achieves mAP50 scores of 85.0% and 85.4% on the respective datasets with significantly reduced number of parameters and computational complexity, providing an effective solution for lightweight and high-precision epibenthic target detection.

Key words: epibenthos, target detection, RT-DETR, multi-scale feature enhancement, adaptive sparse self-attention (ASSA), lightweight model

中图分类号: 

  • TP391.41