首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
社交媒体签到数据中蕴含着大量的用户活动信息。理解社交媒体用户的活动和行为类型,对探索人类的移动性和行为模式等有着重要意义。提出了一种针对新浪微博(简称为微博)的用户活动分类方法,结合图像表达和时空数据分类技术,识别微博签到数据所代表的用户活动类型。首先,根据兴趣点属性信息将微博签到数据所代表的用户活动分为餐饮、生活服务、校园、户外、娱乐、出行6大类;然后,基于卷积神经网络和K近邻分类方法,融合签到数据中的图像场景信息与时空信息,对微博用户的活动行为进行分类。实验结果表明,所提方法能够显著提高微博用户活动类型识别的准确性,为精确探索人类行为活动提供更加有效的数据支持。  相似文献   

2.
了解城市污染状况是治理城市污染的首要环节。社交媒体中包含了能够反映人们对周围环境的感受的数据资料,可以帮助人们更直观地了解城市污染现状。本文以新浪微博为例,提出了从社交媒体中获取城市污染相关数据的方案,设计了基于社交媒体的城市污染信息分类法和可视化方法,在实验部分以北京市为例对本方法的结果进行了分析与讨论。  相似文献   

3.
Least-squares collocation may be used for the estimation of spherical harmonic coefficients and their error and error correlations from GOCE data. Due to the extremely large number of data, this requires the use of the so-called method of Fast Spherical Collocation (FSC) which requires that data is gridded equidistantly on each parallel and have the same uncorrelated noise on the parallel. A consequence of this is that error-covariances will be zero except between coefficients of the same signed order (i.e., the same order and the same coefficient type CC or SS). If the data distribution and the characteristics of the data noise are symmetric with respect to the equator, then, within a given order and coefficient type, the error-covariances amongst coefficients whose degrees are of different parity also vanish. The deviation from this “ideal” pattern has been studied using data-sets of second order radial derivatives of the anomalous potential. A total number of points below 17,000 were used having an equi-angular or an equal area distribution or being associated with points on a realistic GOCE orbit but close to the nodes of a grid. Also the data were considered having a correlated or an uncorrelated noise and three different signal covariance functions. Grids including data or not including data in the polar areas were used. Using the functionals associated with the data, error estimates of coefficients and error-correlations between coefficients were calculated up to a maximal degree and order equal to 90. As expected, for the data-distributions with no data in the polar areas the error-estimates were found to be larger than when the polar areas contained data. In all cases it was found that only the error-correlations between coefficients of the same order were significantly different from zero (up to 88%). Error-correlations were significantly larger when data had been regarded as having non-zero error-correlations. Also the error-correlations were largest when the covariance function with the largest signal covariance distance was used. The main finding of this study was that the correlated noise has more pronounced impact on gridded data than on data distributed on a realistic GOCE orbit. This is useful information for methods using gridded data, such as FSC.  相似文献   

4.
来自社交网络的时空大数据具有海量和高动态的特性,有效选择时空数据进行聚焦挖掘分析至关重要。以微博位置签到数据为例,首先,对时空大数据空间聚类挖掘的有效选择问题进行了研究,针对社交网络时空数据不确定性问题,提出了时空大数据针对聚类挖掘的有效选择方法。聚类挖掘有效选择方法提出从空间、时间或属性等维度对时空大数据进行分割。然后,对分割得到的数据集进行空间探索分析(exploratory spatial data analysis,ESDA),得到具有聚类挖掘潜力的数据集。最后,以武汉市微博位置签到数据进行商圈热点探测为例,对提出的社交网络时空大数据聚类挖掘有效选择方法进行验证。结果表明,有效选择方法可以得到挖掘效率和精准性更高的时空数据集。  相似文献   

5.
In order to evacuate residents in time, flood warning systems must have rapid data processing algorithms to translate detailed numerical data into simple warnings and navigation aids. Although it is common to display warning messages by directly drawing red circles on a map embedded in a Web page to indicate the regions about to be inundated, such an approach has some drawbacks. We propose an alternative way by summarizing the warning messages based on landmarks, so that the messages can be short and convey even more information. We have designed two approaches to output such messages. They are the nearest landmark approach and the threshold approach, which differ in the way of determining which landmarks will be influenced by a certain flooded region. These two methods are implemented and we compare their performance through real and synthetic datasets. Experimental results show that the threshold approach usually takes less execution time than the nearest landmark approach. Its severity ranking of landmarks is also better suited to human behaviour.  相似文献   

6.
传统面向文本数据的事件检测方法在处理以微博为代表的社交媒体数据时面临着效率和准确性的挑战。同时,社交媒体数据中富含的位置信息常常不能被有效地识别和利用,这无疑会影响到事件检测的效果。本文基于对已有研究的总结归纳,定义了一类面向微博签到数据的时空热点事件,并提出了一种新的微博时空热点事件检测方法对其进行识别。通过两组实际数据的实验,证明该方法能够有效地从海量的微博数据中挖掘出具有时空特征的热点事件。  相似文献   

7.
POI的现势性对于位置服务至关重要,但传统人工实地调查效率低,现势性无法满足需求。以当前用户参与数众多的微博社交网络为数据平台,提出了一种基于微博位置签到数据的POI更新方法。首先,对微博位置签到数据进行预处理,剔除语义与空间位置不一致的噪声点,在此基础上提出一种基于RANSAC算法的位置签到数据集地理配准方法,实现位置签到数据与已有地理数据库的可靠配准;然后,将位置签到数据集与已有POI数据库进行空间分析与匹配建模,对匹配不成功的位置签到数据进行有效性验证,提取有效新增数据入库用以更新POI;最后,以武汉市的街旁网位置签到数据进行POI更新实验,能够有效地发现新增POI和消失POI,为POI快速高效更新提供了全新的方式。  相似文献   

8.
利用社交媒体数据模拟城市空气质量趋势面   总被引:1,自引:0,他引:1       下载免费PDF全文
近年来,随着城市的发展,空气污染日益严重。目前,我国城市空气质量监测主要依靠空气质量监测站,但监测站数量有限,并且空气质量在一个城市的不同区域会出现较大起伏,单一利用监测站不易发现城市所有位置的空气质量起伏变化。对此,利用带有地理位置信息的新浪微博数据,分析空气污染相关主题微博与空气质量监测站点空气质量指数(air quality index,AQI)数据的相关性,建立两者间的函数关联,提出了一种建立城市空气质量趋势面的方法。实验结果表明,该方法不仅能定性地表现出城市不同区域的相对空气质量,也可定量、细粒度地展示城市空气质量情况。  相似文献   

9.
黄克标  庞勇  舒清态  付甜 《遥感学报》2013,17(1):165-179
结合机载、星载激光雷达对GLAS(地球科学激光测高系统)光斑范围内的森林地上生物量进行估测,并利用MODIS植被产品以及MERIS土地覆盖产品进行了云南省森林地上生物量的连续制图。机载LiDAR扫描的260个训练样本用于构建星载GLAS的森林地上生物量估测模型,模型的决定系数(R2)为0.52,均方根误差(RMSE)为31Mg/ha。研究结果显示,云南省总森林地上生物量为12.72亿t,平均森林地上生物量为94Mg/ha。估测的森林地上生物量空间分布情况与实际情况相符,森林地上生物量总量与基于森林资源清查数据的估测结果相符,表明了利用机载LiDAR与星载ICESatGLAS结合进行大区域森林地上生物量估测的可靠性。  相似文献   

10.
ABSTRACT

Although Twitter is used for emergency management activities, the relevance of tweets during a hazard event is still open to debate. In this study, six different computational (i.e. Natural Language Processing) and spatiotemporal analytical approaches were implemented to assess the relevance of risk information extracted from tweets obtained during the 2013 Colorado flood event. Primarily, tweets containing information about the flooding events and its impacts were analysed. Examination of the relationships between tweet volume and its content with precipitation amount, damage extent, and official reports revealed that relevant tweets provided information about the event and its impacts rather than any other risk information that public expects to receive via alert messages. However, only 14% of the geo-tagged tweets and only 0.06% of the total fire hose tweets were found to be relevant to the event. By providing insight into the quality of social media data and its usefulness to emergency management activities, this study contributes to the literature on quality of big data. Future research in this area would focus on assessing the reliability of relevant tweets for disaster related situational awareness.  相似文献   

11.
ABSTRACT

National spatial data infrastructures are key to achieving the Digital Earth vision. In many cases, national datasets are integrated from local datasets created and maintained by municipalities. Examples are address, building and topographic information. Integration of local datasets may result in a dataset satisfying the needs of users of national datasets, but is it productive for those who create and maintain the data? This article presents a stakeholder analysis of the Basisregistratie Adressen en Gebouwen (BAG), a collection of base information about addresses and buildings in the Netherlands. The information is captured and maintained by municipalities and integrated into a national base register by Kadaster, the Cadastre, Land Registry and Mapping Agency of the Netherlands. The stakeholder analysis identifies organisations involved in the BAG governance framework, describes their interests, rights, ownerships and responsibilities in the BAG, and maps the relationships between them. Analysis results indicate that Kadaster and the municipalities have the highest relative importance in the governance framework of the BAG. The study reveals challenges of setting up a governance framework that maintains the delicate balance between the interests of all stakeholders. The results provide guidance for SDI role players setting up governance frameworks for national or global datasets.  相似文献   

12.
Service area research is one of the pivotal topics in Urban Geography. This article first put forward a model of urban population estimation. And on the basis we measured the size and distribution of population in downtown Shanghai, China. The population model was confirmed well by the traditional survey model. Then we extracted a 1-month actual-time data set contains geo-location by collecting in Sina Weibo data, and generated Voronoi diagram by these data which denoted the service patches. We assigned population to each patch. Second part, we proposed a shortest distance algorithm, a minimum time algorithm and an improved p-median algorithm, took advantages of these three methods to divide the service area of metro stations based on patches. Subsequently, we computed the service population in each service area. Last, we took metro line 1 and 2 as examples to research the relationship among 3 location-allocation methods in detail. The results showed that: The spatial distribution of population of the core city in Shanghai emerged a descending trend from center to periphery clearly. All indicators (including area, population, distance and time) in central city within inner ring road have changed little compared with the region between inner and outer ring road. Yet the improved p-median algorithm has a certain effect of optimization. It presented a scientific and rational travel scheme for citizens cost smallest price to select better starting metro station. The study results should contribute to theoretical and technical support for location-allocation of public service facilities.  相似文献   

13.
Global geospatial data from Earth observation: status and issues   总被引:1,自引:0,他引:1  
ABSTRACT

Data covering the whole of the surface of the Earth in a homogeneous and reliable manner has been accumulating over many years. This type of data became available from meteorological satellites from the 1960s and from Earth-observing satellites at a small scale from the early 1970s but has gradually accumulated at larger scales up to the present day when we now have data covering many environmental themes at large scales. These data have been used to generate information which is presented in the form of global data sets. This paper will give a brief introduction to the development of Earth observation and to the organisations and sensors which collect data and produce global geospatial data sets. Means of accessing global data sets will set out the types of data available that will be covered. Digital elevation models are discussed in a separate section because of their importance in georeferencing image data as well as their application to analysis of thematic data. The paper will also examine issues of availability, accuracy, validation and reliability and will look at future challenges.  相似文献   

14.
The survey of the gravity field of the Earth is interpreted as a process of communication. The information inferred from the data is represented in the form of geopotential models. The paper presents a quantitative analysis of this information for spherical harmonic expansions of the potential in terms of information measures, particularly the first Kullback-Leibler information number for continuous random vectors. Common degree variance models are used for the construction of prior information. The informational viewpoint is compared to the usual interpretation in terms of errors or error degree variances.  相似文献   

15.
申鑫  曹林  佘光辉 《遥感学报》2016,20(6):1446-1460
精确估算森林生物量对全球碳平衡以及气候变化的研究有重要意义。以亚热带天然次生林为研究对象,借助地面实测样地数据,通过对机载LiCHy(LiDAR,CCD and Hyperspectral)传感器同时获取的高光谱和高空间分辨率数据进行信息提取和数据融合,建模反演森林生物量。首先通过面向对象分割方法进行单木冠幅提取,然后融合从高光谱数据提取的光谱特征变量和从高空间分辨率数据提取的单木冠幅统计变量,构建多元回归模型估算地上、地下生物量,最后利用地面实测生物量经交叉验证评价模型精度。结果表明,综合模型的精度(R~2为0.54—0.62)高于高光谱模型(R~2为0.48—0.57);在高光谱模型中地上生物量模型精度(R~2为0.57)高于地下生物量模型(R~2为0.48);在综合模型中地上生物量模型精度(R~2为0.62)同样高于地下生物量模型(R~2为0.54)。交叉验证结果表明,与仅使用高光谱数据(单一数据源)相比,通过集成高光谱和高空间分辨率数据的生物量反演效果有所提升,可以更加有效地估算亚热带森林生物量。  相似文献   

16.
Geothermal data are published using different IT services, formats and content representations, and can refer to both regional and global scale information. Geothermal stakeholders search for information with different aims. E-Infrastructures are collaborative platforms that address this diversity of aims and data representations. In this paper, we present a prototype for a European Geothermal Information Platform that uses INSPIRE recommendations and an e-Infrastructure (D4Science) to collect, aggregate and share data sets from different European data contributors, thus enabling stakeholders to retrieve and process a large amount of data. Our system merges segmented and national realities into one common framework. We demonstrate our approach by describing a platform that collects data from Italian, French, Hungarian, Swiss and Icelandic geothermal data providers.  相似文献   

17.
18.
结合实际项目经验,浅谈警用地理信息系统建设中基础地理信息和公共地理信息数据的采集。采用高分辨率的卫星影像数据,结合外业采集的点状信息,进行警用基础地理信息数据和应急公共地理信息的采集,创建符合警用地理信息属性数据结构的空间矢量数据。  相似文献   

19.
Abstract

The paper discusses the need of a high-level query language to allow analysts, geographers and, in general, non-programmers to easily cross-analyze multi-source VGI created by means of apps, crowd-sourced data from social networks and authoritative geo-referenced data, usually represented as JSON data sets (nowadays, the de facto standard for data exported by social networks). Since an easy to use high-level language for querying and manipulating collections of possibly geo-tagged JSON objects is still unavailable, we propose a truly declarative language, named J-CO-QL, that is based on a well-defined execution model. A plug-in for a GIS permits to visualize geo-tagged data sets stored in a NoSQL database such as MongoDB; furthermore, the same plug-in can be used to write and execute J-CO-QL queries on those databases. The paper introduces the language by exemplifying its operators within a real study case, the aim of which is to understand the mobility of people in the neighborhood of Bergamo city. Cross-analysis of data about transportation networks and VGI from travelers is performed, by means of J-CO-QL language, capable to manipulate and transform, combine and join possibly geo-tagged JSON objects, in order to produce new possibly geo-tagged JSON objects satisfying users’ needs.  相似文献   

20.
A computer-efficient global data file, which contains digitized information that enables identification of a given latitude/longitude defined point as over land or over water, was generated from a data base which defines the world's shoreline. The method used in the generation of this land-sea boundary data map and its data structure are discussed. The data file was originally generated on a Control Data Corporation(CDC) computer, but it has been transported to other computer systems, includingIBM, DEC/VAX, UNIVAC and Cray computers. The land-sea boundary map also includes information on islands and inland lakes. The resolution of this map is 5′×5′ or an equivalent of9 km square surface blocks at the equator. The software to access this data base is structured to be easily transportable to different computers. This data base was used in the generation of the Seasat Geophysical Data Record(GDR) to identify whether a spaceborne radar altimeter measurement was over-land or over-ocean.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号