Journal of Tropical Oceanography ›› 2024, Vol. 43 ›› Issue (5): 190-202.doi: 10.11978/2023172CSTR: 32234.14.2023172

• Oceanographic Research and Observation • Previous Articles    

Rule set and multilayer perceptron based quality control method for Argo temperature data*

QI Huandong1,2(), ZHU Cheng2, LI Xuchun2, JING Xindi2, SONG Derui2,3()   

  1. 1. College of Information Technology (Shanghai Ocean University), Shanghai 201306, China
    2. National Marine Environmental Monitoring Center, Dalian 116023, China
    3. School of Geographical Sciences (Liaoning Normal University), Dalian 116029, China
  • Received:2023-11-21 Revised:2024-01-08 Online:2024-09-10 Published:2024-10-10
  • Supported by:
    National Key Research and Development Program of China(2021YFF0704000); National Key Research and Development Program of China(2022YFC3106100)

Abstract:

The ocean temperature data plays a crucial role in global ocean observation and climate research. Quality control is essential to ensure the reliability of these data. However, the current recall rate of anomalous data in large datasets is unsatisfactory. This paper proposes a quality control method based on a rule set and multilayer perceptron (RS-MLP), using Argo temperature data. Initially, thirteen machine learning models are compared and analyzed to select the optimal model. Subsequently, a rule set consisting of six rule-based quality control check modules is designed. Finally, the RS-MLP method is constructed by integrating the rule set with the optimal machine learning model, and its performance is evaluated using Argo data from the South China Sea region. The results show that the RS-MLP achieves good performance with true positive rate (TPR), true negative rate (TNR), and area under the receiver operating characteristic (ROC) curve (AUC) reaching 94%, 96%, and 95% respectively in a test set of 351746 temperature data points. The recall rate of anomalous data at different depth levels is stable, demonstrating excellent quality control performance.

Key words: Argo, temperature, machine learning, quality control