基于土壤温度与地表净辐射特征的浅层土壤水分动态机器学习预测模型
DOI:
CSTR:
作者:
作者单位:

中国科学院南京土壤研究所

作者简介:

通讯作者:

中图分类号:

S271;S273.1

基金项目:


A Machine Learning Prediction Model for Shallow Soil Moisture Dynamics Based on Soil Temperature and Surface Net Radiation Characteristics
Author:
Affiliation:

Institute of Soil Science,Chinese Academy of Sciences

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    精准获取土壤水分动态对水文过程模拟、精准农业实施及气候变化响应研究具有重要意义。本研究以5 cm深度土壤含水量为预测目标,基于0-10 cm多个深度(0、1、3、5、10 cm)的土壤温度、地表净辐射的高频原位监测数据,结合谐波分析与统计分析,提取相位、振幅、日均值、极值等具有物理意义的特征参数,构建多维特征数据集;系统性的对比了岭回归、Lasso 回归、支持向量机、随机森林、梯度提升树及极端梯度提升(XGBoost)6种机器学习模型的预测性能,并采用以时间顺序划分数据集的方式评估模型泛化能力。结果表明:在预测精度上,树集成方法整体优于线性模型与核函数方法,其中XGBoost模型表现最佳,其独立测试集决定系数(R2)达0.565;进一步比较不同输入数据集发现,基于物理过程提取的日尺度特征参数数据集在预测精度上显著优于使用半小时分辨率观测值的数据集,表明特征提取可有效过滤噪声、再现水热过程耦合机制;特征重要性分析显示,表层土壤(0-3 cm)的温度动态特征对模型预测贡献较大,其重要性高于目标层(5 cm),印证了土壤水热耦合的垂向传递规律;仅依赖时间特征的模型预测完全失效,表明季节性背景须与反映日际波动的物理特征协同才能实现有效预测。本研究提出的“物理特征提取与XGBoost相结合”的方案,提升了土壤水分时序预测的泛化能力与机理解释性,为利用易观测物理量反演浅层土壤水分提供了方法参考。

    Abstract:

    Accurate acquisition of soil moisture dynamics is crucial for hydrological process modeling, precision agriculture implementation, and climate change impact studies. This study aimed to predict soil water content at a 5 cm depth. Utilizing high-frequency in-situ monitoring data of layered soil temperature within 0-10 cm depths (0, 1, 3, 5, 10 cm) and surface net radiation, a multi-dimensional feature dataset was constructed by extracting physical features such as phase, amplitude, and daily temperature range through harmonic analysis and statistical analysis. A systematic comparison of the predictive performance of six machine learning models: Ridge Regression, Lasso Regression, Support Vector Machines, Random Forests, Gradient Boosting Decision Tree, and eXtreme Gradient Boosting (XGBoost). Model generalization capabilities were evaluated by dividing the dataset into time-series segments. The results indicated that the XGBoost model performed relatively well, achieving a coefficient of determination (R2) of 0.565 on the independent test set, outperforming linear models and the Support Vector Machine. Further comparison of different input datasets reveals that the dataset of daily-scale feature parameters extracted from physical processes significantly outperforms the dataset using half-hourly resolution observations in terms of prediction accuracy, indicating that feature extraction can effectively filter noise and focus on the core mechanism of hydrothermal coupling. Feature importance analysis revealed that thermal dynamic features from the surface soil layers (0-3 cm) were key drivers for the model, with higher importance than those from the target layer (5 cm), confirming the vertical transmission pattern of soil hydrothermal coupling. Models relying solely on temporal features failed completely, demonstrating that the seasonal background provided by time information must work in synergy with physical features reflecting diurnal fluctuations to achieve effective prediction. The proposed approach of combining "physical feature extraction and XGBoost" in this study enhances the generalizability and interpretability of soil moisture time series prediction, providing a methodological reference for deriving shallow soil moisture from easily observable physical quantities.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2026-01-14
  • 最后修改日期:2026-02-02
  • 录用日期:2026-02-12
  • 在线发布日期:
  • 出版日期:
文章二维码