首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于主题爬行模式的地理信息分布式检索方法
引用本文:王小康,邓硕,吴博,李景文.一种基于主题爬行模式的地理信息分布式检索方法[J].测绘与空间地理信息,2015(4):96-97,101.
作者姓名:王小康  邓硕  吴博  李景文
作者单位:1. 桂林理工大学测绘地理信息学院,广西桂林,541004;2. 桂林理工大学测绘地理信息学院,广西桂林541004; 桂林理工大学广西空间信息与测绘重点实验室,广西桂林541004
基金项目:广西自然科学基金重点项目(2014GXNSFDA118032)资助
摘    要:当前网络中地理信息以几何形式递增,为了高效地从海量网络信息中检索出高质量的地理信息,本文提出了一种基于主题爬行的地理信息分布式检索方法。本文采用面向对象的方法将网络地理数据按照四元组的要求进行分解和组织,对地物实体的主题文本特征、地理空间特征、时间维特征等相关信息进行封装,建立四元组实体对象,实现了地理信息数据的相互集成与组织。引入MapReduce模式的并行处理机制完成对网页中地理信息数据的优化存储与索引,并且通过分别计算网页文本、地理文本与查询关键词的主题相关性对爬取的网页进行有序的排列,从而提供快捷、高效的地理信息主题查询。

关 键 词:MapReduce  主题爬行  地理信息  主题相关度

AGeographic Information Retrieval Methods Based on the Mode of Distributed Crawling
WANG Xiao-kang , DENG Shuo , WU Bo , LI Jing-wen.AGeographic Information Retrieval Methods Based on the Mode of Distributed Crawling[J].Geomatics & Spatial Information Technology,2015(4):96-97,101.
Authors:WANG Xiao-kang  DENG Shuo  WU Bo  LI Jing-wen
Institution:WANG Xiao-kang;DENG Shuo;WU Bo;LI Jing-wen;School of Institute of Surveying and Mapping Geographic Information,Guilin University of Technology;Guangxi Key Laboratory of Spatial Information and Surveying,Guilin University of Technology;
Abstract:Current geographic information network in order to geometric form is increasing, in order to efficiently retrieve from the massive network information of high quality geographic information, this paper proposes a geographic information retrieval methods based on distributed crawling crawling body.In this paper, by using object oriented method the network geographic data are decom-posed and organization in accordance four tuples of objects, , geographic features, time Victor syndrome and other related information package, the establishment of four tuple entity object, realization of the geographic information data integration and organization, en-hance organizational efficiency index of the source data.Parallel processing mechanism into MapReduce mode to accomplish optimal storage of geographic information data in web and retrieval, and by calculating the page relevance text, geography text with the query keywords are orderly arranged for crawling web pages, so as to provide geographic information subject fast, efficient query.
Keywords:MapReduce  topic crawling  geographic information  topic relevance
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号