首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Hadoop在气象数据密集型处理领域中的应用
引用本文:肖卫青,杨润芝,胡开喜,林润生,刘立明,谷军霞.Hadoop在气象数据密集型处理领域中的应用[J].气象科技,2015,43(5):823-828.
作者姓名:肖卫青  杨润芝  胡开喜  林润生  刘立明  谷军霞
作者单位:国家气象信息中心,北京 100081,国家气象信息中心,北京 100081,国家气象信息中心,北京 100081,国家气象信息中心,北京 100081,国家气象信息中心,北京 100081,国家气象信息中心,北京 100081
摘    要:气象资料的统计分析计算属于数据密集型计算,目前的处理方式多为单机处理,对大量数据的处理比较慢,难以应对日益增长的数据,对气象资料的研究形成一定的制约。针对数据密集型气象数据的处理,尝试应用Hadoop的MapReduce思想提高计算效率;对Hadoop在处理大量小文件组成的气象数据时的低效率,提出对原始文件进行预处理,将多个小文件整合成能直接用于计算的大文件。试验证明,该方法解决了Hadoop处理大量小文件时的低效率问题,通过与Oracle入库检索的比较,应用Hadoop处理数据密集型气象资料具有实际意义。

关 键 词:Hadoop    HDFS    MapReduce    气象数据    数据密集型计算
收稿时间:2014/5/30 0:00:00
修稿时间:2015/6/29 0:00:00

Application of Hadoop in Data-Intensive Processing of Meteorological Data
Xiao Weiqing,Yang Runzhi,Hu Kaixi,Lin Runsheng,Liu Liming and Gu Junxia.Application of Hadoop in Data-Intensive Processing of Meteorological Data[J].Meteorological Science and Technology,2015,43(5):823-828.
Authors:Xiao Weiqing  Yang Runzhi  Hu Kaixi  Lin Runsheng  Liu Liming and Gu Junxia
Institution:National Meteorological Information Center, Beijing 100081,National Meteorological Information Center, Beijing 100081,National Meteorological Information Center, Beijing 100081,National Meteorological Information Center, Beijing 100081,National Meteorological Information Center, Beijing 100081 and National Meteorological Information Center, Beijing 100081
Abstract:The statistical analysis of meteorological data processing is data intensive and always conducts on a single machine. The speed is too slow when the data set is large, which restrains researches of meteorological data. Hadoop and MapReduce are used to speed up the data intensive processing of meteorological data. In allusion to the low efficiency of processing enormous and small files by using Hadoop, a preprocess is conducted to integrate the enormous and small files to a large one. The experiment proved that this method can solve the low efficiency problem when using Hadoop to process enormous and small files. Comparing with Oracle, it is more useful to use Hadoop to process data intensive Meteorological computing.
Keywords:Hadoop  HDFS  MapReduce  meteorological data  data intensive
本文献已被 万方数据 等数据库收录!
点击此处可从《气象科技》浏览原始摘要信息
点击此处可从《气象科技》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号