首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于优化随机森林回归模型的土壤盐渍化反演
引用本文:杨练兵,陈春波,郑宏伟,罗格平,尚白军,Olaf Hellwich.基于优化随机森林回归模型的土壤盐渍化反演[J].地球信息科学,2021,23(9):1662-1674.
作者姓名:杨练兵  陈春波  郑宏伟  罗格平  尚白军  Olaf Hellwich
作者单位:1. 中国科学院新疆生态与地理研究所荒漠与绿洲生态国家重点实验室,乌鲁木齐 8300112. 新疆维吾尔自治区遥感与地理信息系统应用重点实验室,乌鲁木齐 8300113. 中国科学院大学,北京 1000494. 中国科学院中亚生态与环境研究中心, 乌鲁木齐 8300115. 德国柏林工业大学计算机视觉和遥感研究所,柏林 10623
基金项目:国家自然科学基金项目(41877012);中国科学院一带一路项目(2018-YDYLTD-002);中国科学院特色研究所项目(TSS-2015-014-FW-1-3)
摘    要:当前应用于土壤盐分含量(Soil Salinity Content, SSC)反演的随机森林回归(Random Forests Regression, RFR)较少关注对模型精度影响较大的反演参数子集和模型参数的同步优化。本研究选择渭-库绿洲和奇台绿洲为实验区,基于Landsat-5 TM、SRTM、MOD11A2.006遥感数据构建反演参数。首先,利用弹性网络(Elastic Net, EN)筛选出反演参数子集,然后利用遗传算法(Genetic Algorithm, GA)和贝叶斯优化算法(Bayesian Optimization Algorithm, BOA)分别优化随机森林回归(Random Forests Regression, RFR)参数,建立反演参数子集和模型参数分步优化的RFR模型(EN-GA-RFR、EN-BOA-RFR)。建立利用GA和BOA分别同步优化反演参数子集和模型参数的RFR模型(GA-RFR、BOA-RFR)。在每个实验区,对比EN-GA-RFR、EN-BOA-RFR、GA-RFR、BOA-RFR的预测精度。最后分析每个实验区各类盐渍土的空间分布,并对2个实验区的反演参数进行对比分析。结果表明:每个实验区模型预测精度由高到低的排序均为BOA-RFR>GA-RFR>EN-BOA-RFR=EN-GA-RFR,整体上BOA的优化性能均好于GA;渭-库绿洲和奇台绿洲面积占比最大的盐渍土类型分别为盐渍土和中度盐渍土;反演参数对SSC的表征能力存在空间分异性。

关 键 词:土壤盐分含量  同步优化  随机森林回归  贝叶斯优化算法  遗传算法  弹性网络  反演参数子集  模型参数  
收稿时间:2020-11-26

Retrieval of Soil Salinity Content based on Random Forests Regression Optimized by Bayesian Optimization Algorithm and Genetic Algorithm
YANG Lianbing,CHEN Chunbo,ZHENG Hongwei,LUO Geping,SHANG Baijun,Olaf Hellwich.Retrieval of Soil Salinity Content based on Random Forests Regression Optimized by Bayesian Optimization Algorithm and Genetic Algorithm[J].Geo-information Science,2021,23(9):1662-1674.
Authors:YANG Lianbing  CHEN Chunbo  ZHENG Hongwei  LUO Geping  SHANG Baijun  Olaf Hellwich
Abstract:Random Forests Regression (RFR) is often used to inverse Soil Salinity Content (SSC)nowadays. However, the most important impact factors on the model accuracy such as the synchronization optimization of the inversion parameters subset and the model parameters have not been studied carefully in the applications of RFR. In this study, we selected Weiku Oasis and Qitai Oasis as experiment areas. The inversion parameters were constructed based on remote sensing data, including Landsat-5 TM, SRTM, and MOD11A2.006. Firstly, we applied Elastic Net (EN) to select a subset of the inversion parameters, developed Genetic Algorithm (GA) and Bayesian Optimization Algorithm (BOA) to optimize RFR, and established RFR models (EN-GA-RFR, EN-BOA-RFR) for stepwise optimization of inversion parameters subset and model parameters. Then we used GA and BOA to simultaneously optimize the inversion parameters subset and model parameters based on the combination methods of RFR, including GA-RFR and BOA-RFR methods. Furthermore, in each experiment area, we compared the prediction accuracy of EN-GA-RFR, EN-BOA-RFR, GA-RFR, and BOA-RFR. In this way, the spatial distributions of various saline soils in each experiment area were analyzed. The inversion parameters of the two experiment areas were also compared and analyzed. The results show that the order of model prediction accuracy in each study area from high to low is BOA-RFR>GA-RFR>EN-BOA-RFR=EN-GA-RFR. Overall, BOA had a better optimization performance than GA. Finally, the results show that the types of saline soils with the largest area in Ku Oasis and Qitai Oasis are saline soil and moderate saline soil, respectively. The inversion parameters have spatial differentiation in the characterization ability of SSC.
Keywords:soil salinity content  synchronization optimization  Random Forests Regression  Bayesian Optimization Algorithm  Genetic Algorithm  Elastic Net  inversion parameters subset  model parameters  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《地球信息科学》浏览原始摘要信息
点击此处可从《地球信息科学》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号