首页 | 本学科首页   官方微博 | 高级检索  
     

基于图像语义分割的动态场景下的单目SLAM算法
引用本文:盛超,潘树国,赵涛,曽攀,黄砺枭. 基于图像语义分割的动态场景下的单目SLAM算法[J]. 测绘通报, 2020, 0(1): 40-44. DOI: 10.13474/j.cnki.11-2246.2020.0009
作者姓名:盛超  潘树国  赵涛  曽攀  黄砺枭
作者单位:东南大学仪器科学与工程学院, 江苏 南京 210096
基金项目:江苏省测绘地理信息科研项目(JSCHKY201808);国家重点研发计划(2016YFB0502101);国家自然科学基金(41574026;41774027)
摘    要:在恢复场景信息和相机运动时,传统的SLAM算法是基于静态环境假设的。场景中的动态物体会降低算法的稳健性和最终的定位精度。本文提出将基于深度学习的图像语义分割技术与传统的视觉SLAM算法结合,以减少动态物体对定位结果的干扰。首先,构建有监督的卷积神经网络对输入图像中的动态物体进行分割,获得语义图像;然后,从原始图像中提取特征点,并根据语义图像剔除动态物体特征点,保留静态物体特征点;最后,利用静态物体特征点采用基于特征点的单目视觉SLAM算法对相机运动进行跟踪。在ApolloScape自动驾驶数据集上的试验表明,与传统方法相比,本文算法在动态场景中定位精度提升约17%。

关 键 词:单目视觉SLAM  动态物体  卷积神经网络  语义分割  深度学习  
收稿时间:2019-05-12
修稿时间:2019-07-02

Monocular SLAM system in dynamic scenes based on image semantic segmentation
SHENG Chao,PAN Shuguo,ZHAO Tao,ZEN Pan,HUANG Lixiao. Monocular SLAM system in dynamic scenes based on image semantic segmentation[J]. Bulletin of Surveying and Mapping, 2020, 0(1): 40-44. DOI: 10.13474/j.cnki.11-2246.2020.0009
Authors:SHENG Chao  PAN Shuguo  ZHAO Tao  ZEN Pan  HUANG Lixiao
Affiliation:School of Instrument Science and Engineering, Southeast University, Nanjing 210096, China
Abstract:The traditional visual SLAM algorithm is based on the static environment assumption when recovering scene information and camera motion. The dynamic objects in the scene will reduce the robustness of the algorithm and the final positioning accuracy. In this paper, we propose to combine the image semantic segmentation based on deep learning method with the traditional visual SLAM to reduce the interference of dynamic objects on the positioning results. Firstly, a supervised convolutional neural network (CNN) is used to segment the dynamic objects in the input image to obtain the semantic image. Secondly, after extracting feature points from the original image, the feature points of dynamic objects are eliminated object feature points according to the semantic image, and the feature points of static objects are saved. Finally, the monocular SLAM method with points is used to track the camera motion based on the saved feature points. Experiments on the ApolloScape datasets show that compared with the traditional method, the proposed method improves the positioning accuracy in dynamic scenes by about 17%.
Keywords:monocular visual SLAM  dynamic objects  CNN  semantic segmentation  deep learning method  
本文献已被 CNKI 等数据库收录!
点击此处可从《测绘通报》浏览原始摘要信息
点击此处可从《测绘通报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号