首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于MongoDB的矢量空间数据云存储与处理系统
引用本文:雷德龙,;郭殿升,;陈崇成,;巫建伟,;吴小竹.基于MongoDB的矢量空间数据云存储与处理系统[J].地球信息科学,2014(4):507-516.
作者姓名:雷德龙  ;郭殿升  ;陈崇成  ;巫建伟  ;吴小竹
作者单位:[1]福州大学福建省空间信息工程研究中心,空间数据挖掘与信息共享教育部重点实验室,福州350002; [2]南卡罗来纳大学,美国南卡罗来纳州SC29208
基金项目:国家科技支撑计划项目(2013BAH28F02); 福建省“百人工程”计划(033091); 福建省科技计划项目(2010I0008); 欧盟第七框架国际合作项目(247608)
摘    要:近年来,海量空间数据存储与处理日益成为地理信息科学领域的研究热点。其中,矢量空间数据更因其较高的复杂性,成为该类研究的重点问题。本文基于文档数据库,探究了多用户数据存储、矢量空间数据存储、海量矢量空间数据并行处理等问题,给出了存储和处理矢量空间数据的方法。在三层式云存储架构基础上,设计并实现了矢量空间数据云存储与处理系统VectorDB,达到了海量矢量空间数据的高效存储与处理要求。系统采用文档数据库MongoDB存储矢量空间数据,使用OGR库实现不同格式矢量空间数据的转换与存储,并用Hadoop对数据库中的数据进行并行计算,以及用mongo-hadoop作为MongoDB与Hadoop之间的连接器。通过实验对比了VectorDB与PostGIS的矢量空间数据读写性能,并分析了VectorDB与MongoDB在海量数据并行处理性能方面的差异。结果表明:VectorDB具有更好的读取性能和海量数据处理性能,适合多用户不同格式、不同属性矢量空间数据存储,对海量矢量数据存储与处理问题具有参考价值。

关 键 词:矢量数据  NoSQL数据库  MongoDB  云存储  Hadoop  多用户

Vector Spatial Data Cloud Storage and Processing Based on MongoDB
Institution:LEI Delong, GUO Diansheng, CHEN Chongcheng, WU Jianwei, WU Xiaozhu (1. Laboratory of Spatial Data Mining and Information Sharing of Ministry of Education, Spatial Information Research Centre of Fujian, Fuzhou University, Fuzhou 350002, China 2. University of South Carolina, South Carolina SC 29208, USA)
Abstract:With the rapidly growing volume and complexity of spatial data, efficient storage and processing of massive geo-spatial data have become urgent research problems in GIScience and related fields. Vector spatial data storage and processing is particularly challenging due to its complexity in data representation, access and analysis. In this paper, we present an approach and a system for cloud-based vector data storage and analysis,with the ability to support multi-user access and parallel processing. Our system, named VectorDB, extends MongoDB(a document-oriented NoSQL database system) and integrates the Hadoop framework for parallel spatial data processing and analysis. With a three-layered browser-server architecture, the system consists of a suite of modules for data storage, conversion, query, and analysis. The OGR Simple Features Library is integrated to perform data conversions between MongoDB and various formats of vector spatial data. We use the MongoDB Connector for Hadoop(mongo-hadoop) to transfer data between MongoDB and Hadoop. An experiment is carried out using five physical servers to compare the performance of VectorDB and PostGIS for vector data reading,writing, and query. Preliminary results indicate that, although VectorDB is slightly slower in data writing, it gains significant power for data access and spatial query over PostGIS. We also compared VectorDB and MongoDB for massive vector data processing. Results show that VectorDB has a better performance than MongoDB in massive vector data processing. VectorDB is different from the traditional relational spatial database, and it can support dynamic schema and thus is much more flexible and effective for storing, accessing, and analyzing various vector spatial data models and data formats. Our approach and implemented system will be useful for a variety of applications that need to store and access vector spatial data in a cloud environment.
Keywords:vector  NoSQL Database  MongoDB  cloud storage  Hadoop  multiple-user
本文献已被 CNKI 维普 等数据库收录!
点击此处可从《地球信息科学》浏览原始摘要信息
点击此处可从《地球信息科学》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号