热带海洋学报

• • 上一篇    

基于全局注意力感知和局部可变形卷积的浅海水深测绘方法研究

陈颖盈1 ,钱硕1 ,王微1 ,张高唯1 ,王祎1 ,郝增周2 ,徐凌宇3 ,石绥祥4
  

  1. 1, 北京邮电大学, 北京 100876;

    2, 自然资源部第二海洋研究所卫星海洋环境动力学国家重点实验室, 浙江 杭州 310012;

    3, 上海大学, 上海 200444;

    4, 国家海洋信息中心数字海洋实验室, 天津 300171



  • 收稿日期:2025-08-31 修回日期:2025-12-08 接受日期:2025-12-26
  • 通讯作者: 王微
  • 基金资助:

    国家自然科学基金项目(62076232,62172049)

Shallow-Sea Bathymetric Mapping Based on Global Attention Awareness and Local Deformable Convolution

CHEN Yingying1, QIAN Shuo1, WANG Wei1, ZHANG Gaowei1, WANG Yi1, HAO Zengzhou2, XU Lingyu3, SHI Suixiang4   

  1. , Beijing University of Posts and Telecommunications 100876, China
    State Key Laboratory of Satellite Ocean Environment Dynamics, Second Institute of Oceanography, Ministry of Natural Resources 310012, China
    , Shanghai University 200444, China
    Digital Ocean Laboratory, National Marine Data and Information Service 300171, China
  • Received:2025-08-31 Revised:2025-12-08 Accepted:2025-12-26
  • Supported by:

    National Natural Science Foundation of China (62076232,62172049)

摘要: 可靠的浅海水深数据对海洋生态研究、海洋过程分析及人类海洋活动支撑具有关键意义。传统测深技术如声学测深仪、激光雷达等存在成本高、可达性受限的问题,合成孔径雷达(SAR)测深易受环境因素干扰,而基于卫星多光谱影像的水深反演(satellite derived bathymetry,SDB)方法虽成本效益更优,但现有 SDB 模型仍存在明显局限:物理模型依赖高质量数据且求解复杂,传统机器学习经验模型受限于实测数据获取难度,卷积神经网络(convolutional neural networks,CNNs)与vision transformer(ViT)等深度学习模型难以同时兼顾浅滩礁石等局部细节的捕捉与远距离全局趋势的刻画。此外,目前的多数算法缺乏岛屿之间的迁移泛化性验证,制约实际应用。针对上述问题,本文提出一种融合了可变形卷积和多尺度特征的SDB模型:以Internimage 模型作为主干模型,利用其可变形卷积层自适应调整感受野以精准捕捉复杂地形局部空间异质性,并借助交互注意力机制建模长距离依赖以优化全局趋势刻画;在 Internimage 提取特征后,采用改进 Upernet 模型融合多尺度特征,构建下游水深反演头以进一步平衡局部精细预测与全局趋势一致性感知。在数据预处理阶段,对多时相 Sentinel-2 L2A SR数据进行云掩膜,并结合实测高程数据作为标签。本实验选取全球六个典型研究区(夏威夷瓦胡岛、考艾岛、圣克罗伊岛、维耶克斯岛、塞班岛与天宁岛)开展独立拟合测试,并通过 “预训练+少样本二阶段训练”的模式验证模型的跨岛屿泛化能力。结果表明,本文提出的模型反演精度显著优于其他基线模型,且在新岛屿上通过少样本微调即可展现出良好的泛化性,为实际浅海水域水深测绘提供了更高效、可靠的技术路径。

关键词: 水深测绘, Internimage, Upernet, satellite derived bathymetry, 泛化性, 深度学习

Abstract: Reliable shallow-sea bathymetry data plays a crucial role in marine ecological research, marine process analysis, and the support of human marine activities. Traditional bathymetric technologies, such as acoustic sounders and lidar, suffer from high costs and limited accessibility. Synthetic Aperture Radar - based bathymetry is susceptible to interference from environmental factors. Although the satellite-derived bathymetry (SDB) method using satellite multispectral imagery offers better cost-effectiveness, existing SDB models still have significant limitations: physical models rely on high-quality data and involve complex solving processes; traditional machine learning empirical models are constrained by the difficulty in acquiring in-situ measured data; and deep learning models such as convolutional neural networks (CNNs) and vision transformers (ViTs) struggle to simultaneously capture local details (e.g., shoals and reefs) and characterize long-range global trends. Additionally, most current algorithms lack verification of cross-island transfer generalization, which restricts their practical applications.To address the above issues, this study proposes an SDB model integrating deformable convolution and multi-scale features. The Internimage model is adopted as the backbone: its deformable convolution layers adaptively adjust the receptive field to accurately capture the local spatial heterogeneity of complex terrains, while the cross-attention mechanism is utilized to model long-range dependencies for optimizing global trend characterization. After feature extraction by Internimage, an improved Upernet model is employed to fuse multi-scale features, and a downstream bathymetry inversion head is constructed to further balance the perception of local fine-grained prediction and global trend consistency.In the data preprocessing stage, cloud masking is performed on multi-temporal Sentinel-2 L2A surface reflectance data, and in-situ measured elevation data are used as labels. Experiments were conducted in six typical global study areas (Oahu Island and Kauai Island in Hawaii, St. Croix Island, Vieques Island, Saipan Island, and Tinian Island) for independent fitting tests. Moreover, a "pre-training and few-shot two-stage training" paradigm was used to verify the cross-island generalization capability of the proposed model. The results demonstrate that the proposed model achieves significantly higher inversion accuracy than other baseline models. Furthermore, it exhibits excellent generalization performance on new islands through few-shot fine-tuning, providing a more efficient and reliable technical approach for practical bathymetric mapping in shallow-sea areas.

Key words: Bathymetric Mapping, Internimage, Upernet, satellite derived bathymetry, Generalization, Deep Learning