首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于零膨胀贝叶斯时空建模的精细尺度伪基站垃圾短信分析方法
引用本文:史雨飞,陶海燕,卓莉.基于零膨胀贝叶斯时空建模的精细尺度伪基站垃圾短信分析方法[J].地球信息科学,2022,24(11):2089-2101.
作者姓名:史雨飞  陶海燕  卓莉
作者单位:中山大学地理科学与规划学院,广东省公共安全与灾害工程技术研究中心/广东省城市化与地理环境空间模拟重点实验室,广州 511400
基金项目:国家自然科学基金项目(41971372);广东省自然科学基金项目(2020A1515010680)
摘    要:伪基站垃圾短信活动存在显著的时空自相关和异质性现象,采用时空分析方法可以精准把握伪基站的移动规律和行为模式,为相关部门综合施策、探索长效管理机制提供科学的依据。然而,精细尺度下垃圾短信数据集中过多零数据导致的零膨胀问题,使当前的时空分析方法并不适用。为此,本文以2017年2月23日至2017年4月26日北京市色情服务类垃圾短信数据为例,构建零膨胀贝叶斯时空模型,不仅可以解决零膨胀问题,而且可以综合分析伪基站的空间、时间、时空效应以及外部影响因素,以识别伪基站活动的相对风险高值区、探究城市建成环境对其的影响。结果发现:在数据集中零值占比高达83.46%的情况下,基于零膨胀泊松分布的贝叶斯时空模型具有更好的拟合精度;色情服务类垃圾短信空间上的高风险区域主要聚集在北京市主城区的东部,风险值最高的区域属于朝阳区;周四、五、六风险趋势会相对增加,且18:00至次日02:00为高发时期;伪基站一般18:00从主城区的西南部开始向东北方向移动,凌晨01:00聚集在朝阳区西北部区域;商务住宅与住宿服务类城市环境与垃圾短信呈正相关,餐饮服务与派出所类城市环境呈负相关。研究表明,零膨胀贝叶斯时空模型为精细尺度的伪基站垃圾短信研究,提供了一个可以有效整合多个时间截面的分析数据、充分考虑伪基站的时空关系和外部影响因素并解决数据中存在零过多现象的方法,为发展和验证伪基站的环境犯罪学理论提供了一种重要的分析方法。

关 键 词:伪基站  贝叶斯时空模型  高风险区域  时空交互  精细尺度  零膨胀问题  色情服务类垃圾短信  
收稿时间:2022-04-19

Fine-scale Pseudo Base Station Spam Message Analysis Method based on Zero-inflated Bayesian Spatiotemporal Modeling
SHI Yufei,TAO Haiyan,ZHUO Li.Fine-scale Pseudo Base Station Spam Message Analysis Method based on Zero-inflated Bayesian Spatiotemporal Modeling[J].Geo-information Science,2022,24(11):2089-2101.
Authors:SHI Yufei  TAO Haiyan  ZHUO Li
Institution:School of Geography and Planning, Guangdong Provincial Engineering Research Center for Public Security and Disasters/Guangdong Provincial Key Laboratory of Urbanization and Spatial Simulation of Geographic Environment, Sun Yat-sen University, Guangzhou 511400, China
Abstract:There are significant spatiotemporal autocorrelation and heterogeneity in spam message activities of pseudo base stations. Using spatiotemporal analysis method can accurately grasp the movement law and behavior pattern of pseudo base stations, which provides a scientific basis for relevant departments to formulate comprehensive policies and explore long-term management mechanism. However, the problem of zero inflation caused by excessive zero data in the spam SMS data set at the fine scale makes the spatiotemporal analysis method not applicable. In this paper, using the Beijing municipal erotic service spam message data from February 23 to April 26, 2017 as an example. we constructed the zero inflation Bayesian spatiotemporal model, which can not only solve the problem of zero inflation, but also comprehensively analyze space, time, space and time effect, and external influence factors of pseudo base stations. Based on this, we further identified the high risk areas of pseudo base station activity and explored the influence of urban built environment. The results show that the Bayesian spatiotemporal model based on zero-inflation Poisson distribution has a higher fitting accuracy when the ratio of zero values in the dataset is 83.46%. The high risk areas of pornographic service spam messages are mainly concentrated in the eastern part of the main urban area of Beijing, and the Chaoyang District has the highest risk value. The risk increases relatively on Thursday, Friday, and Saturday, and the high-risk period is from 6 pm one day to 2 pm the next. The pseudo base station generally starts moving from the southwest to the northeast of the main city at 6 pm and gathers in the northwest of Chaoyang District at 1 am. There is a positive correlation between the urban environment of commercial residence and accommodation service and the spam message, while there is a negative correlation between the urban environment of catering service and police stations. The zero-inflation Bayesian spatiotemporal model for analyzing fine scale pseudo base station spam messages can effectively integrate multiple time cross section data, take into account the external factors and the relationship between time and space of pseudo base stations, and solve the problem of too much zero data in the dataset. Our study provides an important analysis method for the development and validation of pseudo base station environmental criminology theory.
Keywords:pseudo base station  Bayesian spatiotemporal model  high risk area  spatiotemporal interactions  fine scale  zero-inflation problem  pornographic service spam messages  
点击此处可从《地球信息科学》浏览原始摘要信息
点击此处可从《地球信息科学》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号