首页 | 本学科首页   官方微博 | 高级检索  
     

基于随机森林算法的土壤有机质含量高光谱检测
引用本文:包青岭,丁建丽,王敬哲,蔡亮红. 基于随机森林算法的土壤有机质含量高光谱检测[J]. 干旱区地理, 2019, 42(6): 1404-1414. DOI: 10.12118/j.issn.1000-6060.2019.06.20
作者姓名:包青岭  丁建丽  王敬哲  蔡亮红
作者单位:新疆大学资源与环境科学学院智慧城市与环境建模自治区普通高校重点实验室,新疆乌鲁木齐830046;绿洲生态教育部重点实验室,新疆乌鲁木齐830046
基金项目:新疆自治区重点实验室专项基金(2016D03001);新疆自治区科技支疆项目(201591101)
摘    要:为了探讨既能保留光谱信息又能准确对土壤有机质含量进行快速检测。以新疆南部渭干河—库车绿洲内部73个土壤样点及其对应的高光谱数据为研究对象,采用小波变换与数学变换进行光谱数据预处理,分析各小波分解重构光谱在不同有机质含量与不同土壤类型下光谱曲线差异,通过相关分析确定最大小波分解层并筛选敏感波段,结合灰色关联分析与随机森林预测分类模型对各小波分解特征光谱进行重要性分析,最后基于最优特征光谱建立多元线性预测模型并进行分析。结果表明:(1) 耕作土壤与林地土壤光谱曲线波段相较盐渍土壤和荒漠土壤光谱曲线变化较为平缓,同时在水分吸收波段处,盐渍土壤光谱曲线吸收谷最深。(2) 小波变换分解光谱与土壤有机质含量的相关性随着分解层数增加呈现先减后增趋势,在第6层中,特征光谱曲线与敏感波段数量变化趋于稳定,确定为小波变换最大分解层。(3) 随机森林模型相比灰色关联分析对于各小波分解层因子的筛选符合预期,按照对土壤有机质含量影响从高到低排序为L3-(1/LgR)′、L4-(1/LgR)′、L6-(1/LgR)′、L5-(1/LgR)′、L2-(1/LgR)′、L0-1/LgRL1-1/LgR。(4)在小波分解光谱中,中频范围特征光谱对干旱区土壤有机质含量的估测能力优于高频与低频范围特征光谱,同时基于L-MC建立的模型精度最高。研究表明:基于机器学习分类方法结合小波分解的土壤光谱有机质含量监测,可以有效的减少噪声波段干扰,并提高特征波段的分类预测精度。

关 键 词:高光谱  土壤有机质含量  小波变换  随机森林
收稿时间:2019-03-20

Hyperspectraldetection of soil organic matter content based on random forest algorithm
BAO Qing-ling,DING Jian-li,WANG Jing-zhe,CAI Liang-hong. Hyperspectraldetection of soil organic matter content based on random forest algorithm[J]. Arid Land Geography, 2019, 42(6): 1404-1414. DOI: 10.12118/j.issn.1000-6060.2019.06.20
Authors:BAO Qing-ling  DING Jian-li  WANG Jing-zhe  CAI Liang-hong
Affiliation:1 Key Laboratory of WisdomCity and Environmental Modeling Department of Education,XinjiangUniversity,Urumqi 830046, Xinjiang,China;2.KeyLaboratory of Oasis Ecology,Xinjiang University,Urumqi 830046,Xinjiang,China
Abstract:Inorder to explore how to retain the spectral information and accurately detectthe soil organic matter content, this paper investigated the possibility ofusing spectral processing techniques such as wavelet decomposition and randomforest method to estimate the soil organic matter content and analyze thespectral curves of different wavelet decomposition reconstruction spectra indifferent soil types using spectroscopy data. This study took the soil samplesas the study objects which were collected in Weigan River Oasis of Kuqa County,a typical arid area oasis at north-central of the Tarim Basin in Xinjiang, China.The soil organic matter content of these samples was determined. The ASD FieldSpec FR was used to measure the soil samples’spectrum,and the spectral data were preprocessed by wavelet decomposition andmathematical transformation. Discrete wavelet transform (DWT) has the functionof multi-scale analysis, which can transform multi-scale wavelet decompositionof soil near infrared spectroscopy data to analyze the spectral curves ofdifferent wavelet decomposition reconstruction spectra in different organicmatter content and different soil types. The correlation analysis was used todetermine the maximum wavelet decomposition layer and filter sensitive bands. Finally,a multi-variant linear prediction model about soil organic matter content wasestablished based on the optimal characteristic spectrum produced by combininggrey correlation analysis, random forest method to analyze the significance ofdifferent wavelet decomposition characteristic spectra. The results showed asfollows: (1) The spectral reflectance of each wavelet decomposed is decreasedwith the increase of organic matter content. At the same time, the spectralcurve of cultivated soil and forest soil shows a more gradual change than thatof the saline soil and desert soil. (2) The correlation between thedecomposition spectrum of the wavelet transform and the soil organic mattercontent is decreased first and then increased with the increase of thedecomposition layer. In the sixth layer, the characteristic spectral curve andthe number of sensitive bands tend to be stable, which helps to determine thislayer as the largest decomposition layer of wavelet transform. (3) Comparedwith the gray correlation analysis, the random forest model is in line with theexpectation for screening the factors of wavelet decomposition at each layer, andit comes a list of descending order according to the impact on soil organicmatter content as follows: L3-(1/LgR)′、L4-(1/LgR)′、L6-(1/LgR)′、L5-(1/LgR)′、L2-(1/LgR)′、L0-1/LgRL1-1/LgR. (4) Combining all SOM estimation models for statisticalanalysis,the model based on L-MC has the highest accuracy. Theresearch shows that the monitoring of soil spectral organic matter contentbased on machine learning classification method combined with waveletdecomposition can effectively reduce noise band interference and improve theclassification prediction accuracy of feature bands. The random forestprediction classification model has significant advantages over the traditionallinear prediction classification model, such as gray correlation analysis. Therandom forest model not only outperforms the grey correlation analysis instatistical results, but also shows better reliability and stability inpredicting ability. The results could provide scientific reference and supportfor the study of soil nutrients in the arid zone and local precisionagriculture.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《干旱区地理》浏览原始摘要信息
点击此处可从《干旱区地理》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号