首页 | 本学科首页   官方微博 | 高级检索  
     


A novel index to evaluate discretization methods:A case study of flood susceptibility assessment based on random forest
Authors:Xianzhe Tang  Takashi Machimura  Wei Liu  Jiufeng Li  Haoyuan Hong
Abstract:
The selection of a suitable discretization method(DM)to discretize spatially continuous variables(SCVs)is critical in ML-based natural hazard susceptibility assessment.However,few studies start to consider the influence due to the selected DMs and how to efficiently select a suitable DM for each SCV.These issues were well addressed in this study.The information loss rate(ILR),an index based on the informa-tion entropy,seems can be used to select optimal DM for each SCV.However,the ILR fails to show the actual influence of discretization because such index only considers the total amount of information of the discretized variables departing from the original SCV.Facing this issue,we propose an index,infor-mation change rate(ICR),that focuses on the changed amount of information due to the discretization based on each cell,enabling the identification of the optimal DM.We develop a case study with Random Forest(training/testing ratio of 7:3)to assess flood susceptibility in Wanan County,China.The area under the curve-based and susceptibility maps-based approaches were presented to compare the ILR and ICR.The results show the ICR-based optimal DMs are more rational than the ILR-based ones in both cases.Moreover,we observed the ILR values are unnaturally small(<1%),whereas the ICR values are obviously more in line with general recognition(usually 10%-30%).The above results all demonstrate the superiority of the ICR.We consider this study fills up the existing research gaps,improving the ML-based natural hazard susceptibility assessments.
Keywords:Machine learning  Natural hazards  Information change rate  Discretization method
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号