首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The implementation of social network applications on mobile platforms has significantly elevated the activity of mobile social networking. Mobile social networking offers a channel for recording an individual’s spatiotemporal behaviors when location-detecting capabilities of devices are enabled. It also facilitates the study of time geography on an individual level, which has previously suffered from a scarcity of georeferenced movement data. In this paper, we report on the use of georeferenced tweets to display and analyze the spatiotemporal patterns of daily user trajectories. For georeferenced tweets having both location information in longitude and latitude values and recorded creation time, we apply a space–time cube approach for visualization. Compared to the traditional methodologies for time geography studies such as the travel diary-based approach, the analytics using social media data present challenges broadly associated with those of Big Data, including the characteristics of high velocity, large volume, and heterogeneity. For this study, a batch processing system has been developed for extracting spatiotemporal information from each tweet and then creating trajectories of each individual mobile Twitter user. Using social media data in time geographic research has the benefits of study area flexibility, continuous observation and non-involvement with contributors. For example, during every 30-minute cycle, we collected tweets created by about 50,000 Twitter users living in a geographic region covering New York City to Washington, DC. Each tweet can indicate the exact location of its creator when the tweet was posted. Thus, the linked tweets show a Twitter users’ movement trajectory in space and time. This study explores using data intensive computing for processing Twitter data to generate spatiotemporal information that can recreate the space–time trajectories of their creators.  相似文献   

2.
With the rapid growth and popularity of mobile devices and location‐aware technologies, online social networks such as Twitter have become an important data source for scientists to conduct geo‐social network research. Non‐personal accounts, spam users and junk tweets, however, pose severe problems to the extraction of meaningful information and the validation of any research findings on tweets or twitter users. Therefore, the detection of such users is a critical and fundamental step for twitter‐related geographic research. In this study, we develop a methodological framework to: (1) extract user characteristics based on geographic, graph‐based and content‐based features of tweets; (2) construct a training dataset by manually inspecting and labeling a large sample of twitter users; and (3) derive reliable rules and knowledge for detecting non‐personal users with supervised classification methods. The extracted geographic characteristics of a user include maximum speed, mean speed, the number of different counties that the user has been to, and others. Content‐based characteristics for a user include the number of tweets per month, the percentage of tweets with URLs or Hashtags, and the percentage of tweets with emotions, detected with sentiment analysis. The extracted rules are theoretically interesting and practically useful. Specifically, the results show that geographic features, such as the average speed and frequency of county changes, can serve as important indicators of non‐personal users. For non‐spatial characteristics, the percentage of tweets with a high human factor index, the percentage of tweets with URLs, and the percentage of tweets with mentioned/replied users are the top three features in detecting non‐personal users.  相似文献   

3.
Social media messages, such as tweets, are frequently used by people during natural disasters to share real‐time information and to report incidents. Within these messages, geographic locations are often described. Accurate recognition and geolocation of these locations are critical for reaching those in need. This article focuses on the first part of this process, namely recognizing locations from social media messages. While general named entity recognition tools are often used to recognize locations, their performance is limited due to the various language irregularities associated with social media text, such as informal sentence structures, inconsistent letter cases, name abbreviations, and misspellings. We present NeuroTPR, which is a Neuro‐net ToPonym Recognition model designed specifically with these linguistic irregularities in mind. Our approach extends a general bidirectional recurrent neural network model with a number of features designed to address the task of location recognition in social media messages. We also propose an automatic workflow for generating annotated data sets from Wikipedia articles for training toponym recognition models. We demonstrate NeuroTPR by applying it to three test data sets, including a Twitter data set from Hurricane Harvey, and comparing its performance with those of six baseline models.  相似文献   

4.
The use of social media data in geographic studies has become common, yet the question of social media's validity in such contexts is often overlooked. Social media data suffers from a variety of biases and limitations; nevertheless, with a proper understanding of the drawbacks, these data can be powerful. As cities seek to become “smarter,” they can potentially use social media data to creatively address the needs of their most vulnerable groups, such as ethnic minorities. However, questions remain unanswered regarding who uses these social networking platforms, how people use these platforms, and how representative social media data is of users' everyday lives. Using several forms of regression, I explore the relationships between a conventional data source (the U.S. Census) and a subset of Twitter data potentially representative of minority groups: tweets created by users with an account language other than English. A considerable amount of non‐stationarity is uncovered, which should serve as a warning against sweeping statements regarding the demographics of users and where people prefer to post. Further, I find that precisely located Twitter data informs us more about the digital status of places and less about users' day‐to‐day travel patterns.  相似文献   

5.
Social media networks allow users to post what they are involved in with location information in a real‐time manner. It is therefore possible to collect large amounts of information related to local events from existing social networks. Mining this abundant information can feed users and organizations with situational awareness to make responsive plans for ongoing events. Despite the fact that a number of studies have been conducted to detect local events using social media data, the event content is not efficiently summarized and/or the correlation between abnormal neighboring regions is not investigated. This article presents a spatial‐temporal‐semantic approach to local event detection using geo‐social media data. Geographical regularities are first measured to extract spatio‐temporal outliers, of which the corresponding tweet content is automatically summarized using the topic modeling method. The correlation between outliers is subsequently examined by investigating their spatial adjacency and semantic similarity. A case study on the 2014 Toronto International Film Festival (TIFF) is conducted using Twitter data to evaluate our approach. This reveals that up to 87% of the events detected are correctly identified compared with the official TIFF schedule. This work is beneficial for authorities to keep track of urban dynamics and helps build smart cities by providing new ways of detecting what is happening in them.  相似文献   

6.
ABSTRACT

Although Twitter is used for emergency management activities, the relevance of tweets during a hazard event is still open to debate. In this study, six different computational (i.e. Natural Language Processing) and spatiotemporal analytical approaches were implemented to assess the relevance of risk information extracted from tweets obtained during the 2013 Colorado flood event. Primarily, tweets containing information about the flooding events and its impacts were analysed. Examination of the relationships between tweet volume and its content with precipitation amount, damage extent, and official reports revealed that relevant tweets provided information about the event and its impacts rather than any other risk information that public expects to receive via alert messages. However, only 14% of the geo-tagged tweets and only 0.06% of the total fire hose tweets were found to be relevant to the event. By providing insight into the quality of social media data and its usefulness to emergency management activities, this study contributes to the literature on quality of big data. Future research in this area would focus on assessing the reliability of relevant tweets for disaster related situational awareness.  相似文献   

7.
Many different methods are used to disaggregate census data and predict population densities to construct finer scale, gridded population data sets. These methods often involve a range of high resolution geospatial covariate datasets on aspects such as urban areas, infrastructure, land cover and topography; such covariates, however, are not directly indicative of the presence of people. Here we tested the potential of geo‐located tweets from the social media application, Twitter, as a covariate in the production of population maps. The density of geo‐located tweets in 1x1 km grid cells over a 2‐month period across Indonesia, a country with one of the highest Twitter usage rates in the world, was input as a covariate into a previously published random forests‐based census disaggregation method. Comparison of internal measures of accuracy and external assessments between models built with and without the geotweets showed that increases in population mapping accuracy could be obtained using the geotweet densities as a covariate layer. The work highlights the potential for such social media‐derived data in improving our understanding of population distributions and offers promise for more dynamic mapping with such data being continually produced and freely available.  相似文献   

8.
ABSTRACT

Natural disasters, such as wildfires, earthquakes, landslides, or floods, lead to an increase in topical information shared on social media and in increased mapping activities in volunteered geographic information (VGI) platforms. Using earthquakes in Nepal and Central Italy as case studies, this research analyzes the effects of natural disasters on short-term (weeks) and longer-term (half year) changes in OpenStreetMap (OSM) mapping behavior and tweet activities in the affected regions. An increase of activities in OSM during the events can be partially attributed to those focused OSM mapping campaigns, for example, through the Humanitarian OSM Team (HOT). Using source tags in OSM change-sets, it was found that only a small portion of external mappers actually travels to the affected regions, whereas the majority of external mappers relies on desktop mapping instead. Furthermore, the study analyzes the spatio-temporal sequence of posted tweets together with keyword filters to identify a subset of users who most likely traveled to the affected regions for support and rescue operations. It also explores where, geographically, earthquake information spreads within social networks.  相似文献   

9.
Widespread use of social media during crises has become commonplace, as shown by the volume of messages during the Haiti earthquake of 2010 and Japan tsunami of 2011. Location mentions are particularly important in disaster messages as they can show emergency responders where problems have occurred. This article explores the sorts of locations that occur in disaster‐related social messages, how well off‐the‐shelf software identifies those locations, and what is needed to improve automated location identification, called geo‐parsing. To do this, we have sampled Twitter messages from the February 2011 earthquake in Christchurch, Canterbury, New Zealand. We annotated locations in messages manually to make a gold standard by which to measure locations identified by a Named Entity Recognition software. The Stanford NER software found some locations that were proper nouns, but did not identify locations that were not capitalized, local streets and buildings, or non‐standard place abbreviations and mis‐spellings that are plentiful in microtext. We review how these problems might be solved in software research, and model a readable crisis map that shows crisis location clusters via enlarged place labels.  相似文献   

10.
ABSTRACT

Understanding and detecting the intended meaning in social media is challenging because social media messages contain varieties of noise and chaos that are irrelevant to the themes of interests. For example, conventional supervised classification approaches would produce inconsistent solutions to detecting and clarifying whether any given Twitter message is really about a wildfire event. Consequently, a renovated workflow was designed and implemented. The workflow consists of four sequential procedures: (1) Apply the latent semantic analysis and cosine similarity calculation to examine the similarity between Twitter messages; (2) Apply Affinity Propagation to identify exemplars of Twitter messages; (3) Apply the cosine similarity calculation again to automatically match the exemplars to known training results, and (4) Apply accumulative exemplars to classify Twitter messages using a support vector machine approach. The overall correction ratio was over 90% when a series of ongoing and historical wildfire events were examined.  相似文献   

11.
SensePlace3 (SP3) is a geovisual analytics framework and web application that supports overview + detail analysis of social media, focusing on extracting meaningful information from the Twitterverse. SP3 leverages social media related to crisis events. It differs from most existing systems by enabling an analyst to obtain place-relevant information from tweets that have implicit as well as explicit geography. Specifically, SP3 includes not just the ability to utilize the explicit geography of geolocated tweets but also analyze implicit geography by recognizing and geolocating references in both tweet text, which indicates locations tweeted about, and in Twitter profiles, which indicates locations affiliated with users. Key features of SP3 reported here include flexible search and filtering capabilities to support information foraging; an ingest, processing, and indexing pipeline that produces near real-time access for big streaming data; and a novel strategy for implementing a web-based multi-view visual interface with dynamic linking of entities across views. The SP3 system architecture was designed to support crisis management applications, but its design flexibility makes it easily adaptable to other domains. We also report on a user study that provided input to SP3 interface design and suggests next steps for effective spatiotemporal analytics using social media sources.  相似文献   

12.
Pervasive presence of location-sharing services made it possible for researchers to gain an unprecedented access to the direct records of human activity in space and time. This article analyses geo-located Twitter messages in order to uncover global patterns of human mobility. Based on a dataset of almost a billion tweets recorded in 2012, we estimate the volume of international travelers by country of residence. Mobility profiles of different nations were examined based on such characteristics as mobility rate, radius of gyration, diversity of destinations, and inflow–outflow balance. Temporal patterns disclose the universally valid seasons of increased international mobility and the particular character of international travels of different nations. Our analysis of the community structure of the Twitter mobility network reveals spatially cohesive regions that follow the regional division of the world. We validate our result using global tourism statistics and mobility models provided by other authors and argue that Twitter is exceptionally useful for understanding and quantifying global mobility patterns.  相似文献   

13.
The objective of this article is to conduct a systematic literature review that provides an overview of the current state of research concerning methods and application for spatiotemporal analyses of the social network Twitter. Reviewed papers and their application domains have shown that the study of geographical processes by using spatiotemporal information from location‐based social networks represent a promising and still underexplored field for GIScience researchers.  相似文献   

14.
王敬泉  王凯 《测绘通报》2019,(12):142-146
我国处于社会转型期,对突发事件引起的网络舆情分析需求迫切。微博由于其大数据特点为公众舆论和自然环境知识发现提供大量数据,将其与位置信息联系能给地理分析提供新的发现。通过用户发布的微博消息进行分析挖掘,为公众舆论监督提供帮助,为政府决策提供信息。本文在微博数据帮助下进行了数据挖掘,在空间维度上分析了舆论传播模式,使用可视化与统计方法相结合的方式探究了舆情传播规律。  相似文献   

15.
China's social media platform, Sina Weibo, like Twitter, hosts a considerable amount of big data: messages, comments, pictures. Collecting and analyzing information from this treasury of human behavior data is a challenge, although the message exchange on the network is readable by everyone through the web or app interface. The official Application Programming Interface (API) is the gateway to access and download public content from Sina Weibo and is used to collect messages for all mainland China. The nearby_timeline() request is used to harvest only messages with associated location information. This technical note serves as a reference for researchers who do not speak Mandarin but want to collect data from this rich source of information. Ways of data visualization are presented as a point cloud, density per areal unit, or clustered using Density‐Based Spatial Clustering of Applications with Noise (DBSCAN). The relation of messages to census information is also given.  相似文献   

16.
Rapid flood mapping is critical for local authorities and emergency responders to identify areas in need of immediate attention. However, traditional data collection practices such as remote sensing and field surveying often fail to offer timely information during or right after a flooding event. Social media such as Twitter have emerged as a new data source for disaster management and flood mapping. Using the 2015 South Carolina floods as the study case, this paper introduces a novel approach to mapping the flood in near real time by leveraging Twitter data in geospatial processes. Specifically, in this study, we first analyzed the spatiotemporal patterns of flood-related tweets using quantitative methods to better understand how Twitter activity is related to flood phenomena. Then, a kernel-based flood mapping model was developed to map the flooding possibility for the study area based on the water height points derived from tweets and stream gauges. The identified patterns of Twitter activity were used to assign the weights of flood model parameters. The feasibility and accuracy of the model was evaluated by comparing the model output with official inundation maps. Results show that the proposed approach could provide a consistent and comparable estimation of the flood situation in near real time, which is essential for improving the situational awareness during a flooding event to support decision-making.  相似文献   

17.
ABSTRACT

Massive social media data produced from microblog platforms provide a new data source for studying human dynamics at an unprecedented scale. Meanwhile, population bias in geotagged Twitter users is widely recognized. Understanding the demographic and socioeconomic biases of Twitter users is critical for making reliable inferences on the attitudes and behaviors of the population. However, the existing global models cannot capture the regional variations of the demographic and socioeconomic biases. To bridge the gap, we modeled the relationships between different demographic/socioeconomic factors and geotagged Twitter users for the whole contiguous United States, aiming to understand how the demographic and socioeconomic factors relate to the number of Twitter users at county level. To effectively identify the local Twitter users for each county of the United States, we integrate three commonly used methods and develop a query approach in a high-performance computing environment. The results demonstrate that we can not only identify how the demographic and socioeconomic factors relate to the number of Twitter users, but can also measure and map how the influence of these factors vary across counties.  相似文献   

18.
Residential locations play an important role in understanding the form and function of urban systems. However, it is impossible to release this detailed information publicly, due to the issue of privacy. The rapid development of location‐based services and the prevalence of global position system (GPS)‐equipped devices provide an unprecedented opportunity to infer residential locations from user‐generated geographic information. This article compares different approaches for predicting Twitter users' home locations at a precise point level based on temporal and spatial features extracted from geo‐tagged tweets. Among the three deterministic approaches, the one that estimates the home location for each user by finding the weighted most frequently visited (WMFV) cluster of that user always provides the best performance when compared with the other two methods. The results of a fourth approach, based on the support vector machine (SVM), are severely affected by the threshold value for a cluster to be identified as the home.  相似文献   

19.
The analysis of social media content for the extraction of geospatial information and event‐related knowledge has recently received substantial attention. In this article we present an approach that leverages the complementary nature of social multimedia content by utilizing heterogeneous sources of social media feeds to assess the impact area of a natural disaster. More specifically, we introduce a novel social multimedia triangulation process that uses both Twitter and Flickr content in an integrated two‐step process: Twitter content is used to identify toponym references associated with a disaster; this information is then used to provide approximate orientation for the associated Flickr imagery, allowing us to delineate the impact area as the overlap of multiple view footprints. In this approach, we practically crowdsource approximate orientations from Twitter content and use this information to orient Flickr imagery accordingly and identify the impact area through viewshed analysis and viewpoint integration. This approach enables us to avoid computationally intensive image analysis tasks associated with traditional image orientation, while allowing us to triangulate numerous images by having them pointed towards the crowdsourced toponym location. The article presents our approach and demonstrates its performance using a real‐world wildfire event as a representative application case study.  相似文献   

20.
Abstract

The vision of Digital Earth (DE) put recently forward under the auspices of the International Society for DE extends the paradigm of spatial data infrastructures by advocating an interactive and dynamic framework based on near-to-real time information from sensors and citizens. This paper contributes to developing that vision and reports the results of a two-year research project exploring the extent to which it is possible to extract information useful for policy and science from the large volumes of messages and photos being posted daily through social networks. Given the noted concerns about the quality of such data in relation to that provided by authoritative sources, the research has developed a semi-automatic workflow to assess the fitness for purpose of data extracted from Twitter and Flickr, and compared them to that coming from official sources, using forest fires as a case study. The findings indicate that we were able to detect accurately six of eight major fires in France in the summer of 2011, with another four detected by the social networks but not reported by our official source, the European Forest Fire Information Service. These findings and the lessons learned in handling the very large volumes of unstructured data in multiple languages discussed in this study provide useful insights into the value of social network data for policy and science, and contribute to advancing the vision of DE.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号