首页 | 官方网站   微博 | 高级检索  
     

机器学习分类算法在降雨型滑坡预报中的应用
引用本文:刘海知,徐辉,包红军,徐为,闫旭峰,鲁恒,徐成鹏.机器学习分类算法在降雨型滑坡预报中的应用[J].应用气象学报,2022,33(3):282-292.
作者姓名:刘海知  徐辉  包红军  徐为  闫旭峰  鲁恒  徐成鹏
作者单位:1.国家气象中心, 北京 100081
摘    要:针对气象灾害预警业务中客观描述降雨型滑坡发生不确定性的实际需求,利用2014—2020年全国滑坡数据以及多源融合降水实况分析数据,通过样本构建、模型训练、参数优化以及预报输出等关键步骤构建基于机器学习分类算法的区域降雨诱发滑坡概率预报模型,探究不同类型机器学习分类算法识别诱发滑坡的降雨过程的可行性。结果表明:在算法评估中,线性判别分析算法准确率最高且泛化能力最好,其次为逻辑回归算法,再次为最邻近算法。在预报试验中,线性判别分析、逻辑回归以及最邻近等算法能够提取并学习降雨诱发滑坡的条件特征,对诱发滑坡的降雨过程有一定识别能力,最邻近算法和逻辑回归算法的概率预报高值区范围相对较大,易造成虚警结果,线性判别分析算法对局地降雨信息的提炼较好,但线性判别分析算法在非降雨中心区域输出低值概率预报的面积偏大。

关 键 词:滑坡    影响因素    机器学习    分类算法
收稿时间:2022-01-25

Application of Machine Learning Classification Algorithm to Precipitation-induced Landslides Forecasting
Affiliation:1.National Meteorological Center, Beijing 1000812.CMA-HHU Joint Laboratory for Hydro Meteorological Studies, Beijing 1000813.China Institute of Geo-Environmental Monitoring, Beijing 1000814.College of Water Resource and Hydropower, Sichuan University, Chengdu 610065
Abstract:To address the practical needs of objectively describing the uncertainty of rainfall-based landslides and the existing problems of single warning indicators and subjective forecasting methods in the meteorological disaster early warning business, landslide disaster data from 2014 to 2020 and multi-source used precipitation analysis data are investigated to construct a regional rainfall-induced landslides probability forecasting model. Machine learning classification algorithms is implemented through key steps such as sample construction, model training, parameter optimization and forecast output to explore the feasibility of different types of algorithms in identifying landslides-causing rainfall processes. A training sample set construction method based on the positive samples, the negative samples are obtained by sampling under spatial-temporal limitation. The evaluation of different machine learning classification algorithms using the sample set shows that linear discriminant analysis algorithm has the highest accuracy(0.863) and the best generalization ability(area under the receiver operating characteristic curve is 0.886) without over-fitting problem, followed by the logistic regression algorithm and the K-nearest neighbor algorithm. In the probabilistic forecasting test for the cases of rainfall-induced landslides in 2021, all of three algorithms can extract and learn the conditional features and have certain ability to identify the rainfall processes which induce landslides. K-nearest neighbor algorithms and logistic regression algorithms have a relatively large range of probabilistic forecasting high value areas, which are prone to false alarm results. The probability forecast of the linear discriminant analysis algorithms is more convergent in the range of the high value area, and it can extract local rainfall information better, but it outputs unnecessary low-value probability forecasts in non-rainfall central area. The rainfall-induced landslides probability prediction model based on the machine learning classification algorithm comprehensively considers the coupling effect of the underlying surface factor and the rainfall factor, which is better than the commonly used critical threshold model that assumes the occurrence of landslides in the forecast area is only related to rainfall. The application results show that the machine learning classification algorithm model makes up for the shortcomings of existing forecasting models that are less likely to reflect the influence of the surface environment, so it is an important way to improve the performance of landslides forecasting and warning.
Keywords:
点击此处可从《应用气象学报》浏览原始摘要信息
点击此处可从《应用气象学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号