首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于张量分解的AoT序列数据有损压缩方法
引用本文:杨晨,高鸿,张丽莹,胡旭,俞肇元,李冬双.基于张量分解的AoT序列数据有损压缩方法[J].地球信息科学,2021,23(1):134-142.
作者姓名:杨晨  高鸿  张丽莹  胡旭  俞肇元  李冬双
作者单位:1.南京师范大学 虚拟地理环境教育部重点实验室,南京 2100232.江苏省地理环境演化国家重点实验室培育建设点,南京 2100233.江苏省地理信息资源开发与利用协同创新中心,南京 2100234.江苏省作物遗传生理国家重点实验室/江苏省作物栽培生理重点实验室,扬州大学农学院,扬州 2250095.江苏省粮食作物现代产业技术协同创新中心,扬州大学,扬州 225009
基金项目:国家重点研发计划项目;国家自然科学基金项目
摘    要:Array of Things (AoT)通过单一位置上的多传感器对城市系统进行连续动态观测。AoT观测数据量大且持续增长,使得如何利用有限的计算资源进行AoT序列数据的压缩传输成为其应用的关键瓶颈之一。本文提出了一种基于张量分解的AoT序列数据的有损压缩方法。面向其海量、高维且需在传感器端处理的需求,该方法首先将AoT序列数据组织成高维张量,利用算法复杂度较低的张量CANDECOMP/PARAFAC (CP)分解提取各维度上的特征主分量,而后利用张量重构实现特征保持的数据有损压缩。利用基于张量分解的有损压缩方法,针对美国芝加哥市区的24 h内感测的声光电磁数据进行了实验,讨论了不同压缩参数对压缩比、压缩误差、压缩精度、压缩时间、压缩过程运行内存占用和压缩结果内存占用之间的影响。实验结果表明该方法可实现AoT序列数据的有损压缩,其较小的内存占用能够支持传感器端数据压缩。并且与原始光场强度对比表明,压缩后的数据保持了原有时空分布特征。与传统矢量量化编码压缩方法相比,在相同压缩精度下,本文方法的压缩比约高27%~76%,压缩时间约节省46%~73%,压缩结果所占内存约节省17%~57%,因此本文方法具有更高的压缩比,更低的压缩时间和内存占用,可为AoT这一类数据的大规模有损压缩提供借鉴意义。

关 键 词:传感器  时空序列  AoT  有损压缩  多维张量  张量分解  CP分解  张量重构  
收稿时间:2020-07-31

A Lossy Compression Method for AoT Sequence Data based on Tensor Decomposition
YANG Chen,GAO Hong,ZHANG Liying,HU Xu,YU Zhaoyuan,LI Dongshuang.A Lossy Compression Method for AoT Sequence Data based on Tensor Decomposition[J].Geo-information Science,2021,23(1):134-142.
Authors:YANG Chen  GAO Hong  ZHANG Liying  HU Xu  YU Zhaoyuan  LI Dongshuang
Institution:(Key Laboratory of Virtual Geographic Environment of The Ministry of Education(Nanjing Normal University),Nanjing 210023,China;Cultivation Base of State Key Laboratory of Geographical Environment Evolution,Jiangsu Province,Nanjing 210023,China;Jiangsu Provincial Center for Collaborative Innovation in Geographical Information Resource Development and Application,Nanjing 210023,China;Jiangsu Key Laboratory of Crop Genetics and Physiology/Jiangsu Key Laboratory of Crop Cultivation and Physiology,Agricultural College of Yangzhou University,Yangzhou 225009,China;Jiangsu Co-Innovation Center for Modem Production Technology of Grain Crops,Yangzhou University,Yangzhou 225009,China)
Abstract:Array of Things(AoT) provides continuous and dynamic observations of urban systems through multiple sensors at a single location. How to utilize the limited computing resources to compress and transmit AoT sequence data becomes one of the key bottlenecks of the AoT application. Considering that most AoT sequence data are massive, high-dimensional and needed to be processed at the sensor side, a tensor decomposition method is introduced to the lossy compression for AoT sequence data in this work. This method first organizes the AoT sequence data as a high-dimensional tensor to preserve the multidimensional coupling relationship among the different dimensions. The CANDECOMP/PARAFAC(CP) decomposition, which has simple parameter, relatively simple principle and low algorithm complexity, is then utilized to decompose and extract the principal feature components in each dimension of AoT sequence data. Since these principal feature components are obtained by absorbing the multidimensional coupling relationship, they can be further combined with tensor reconstruction to approximate the original data accurately. Considering that the data approximation is obtained by removing the redundant information, it can achieve the data lossy compression with the feature preservation. The simulation experiment is conducted based on the acousto-optic electromagnetic data sensed within 24 hours in the downtown area of Chicago in the United States. The influences of different compression parameters on compression ratio, compression error, compression accuracy, compression time, memory usage under the conditions of different compression parameter are discussed. The experimental results show that, with the increase of compression parameter, the compression error obviously decreases and the memory occupation weakly increases, which demonstrates that tensor-based method can achieve lossy compression of AoT sequence data and both the memory occupation during the running process and the memory occupation of the final results can support the data compression of sensor segments. Compared with the original intensity of the light field, the compressed data maintain the spatio-temporal distribution characteristics of original data that would not affect the further data analysis. In addition, compared with the traditional vector quantization coding compression method, the compression ratio of this method is higher about 27%~76%, the compression time is less about 46%~73%, and the memory occupation of compression result is smaller about 17%~57%. Therefore, the tensor-based method has a higher compression ratio, less compression time and smaller memory occupation under the same compression accuracy. The tensor-based method can also be applied to the data with multidimensional features,such as spatial dimensions with different locations, time dimensions at different time nodes, and attribute dimensions of different variables(temperature, humidity, etc.), which could provide a feasible idea for largescale lossy compression of massive multidimensional geographic sensor sequence data represented by AoT sequence data.
Keywords:sensor  spatio-temporal sequence  Array of Things(AoT)  lossy compression  multidimensional tensor  tensor decomposition  CANDECOMP/PARAFAC decomposition  tensor reconstruction
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《地球信息科学》浏览原始摘要信息
点击此处可从《地球信息科学》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号