首页 | 本学科首页   官方微博 | 高级检索  
     检索      

中文文本的地理空间关系标注
引用本文:张雪英,张春菊,朱少楠.中文文本的地理空间关系标注[J].测绘学报,2012,41(3):468-474.
作者姓名:张雪英  张春菊  朱少楠
作者单位:南京师范大学虚拟地理环境教育部重点实验室,江苏南京,210046
基金项目:国家自然科学基金,江苏省研究生创新项目
摘    要:为有效地解决当前相关标准和标准数据匮乏的问题,通过分析中文文本中地理空间关系描述的语言特点,提出中文文本的地理空间关系标注体系,并以GATE(General Architecture for Text Engineering)为标注工具,以《中国大百科全书中国地理》为文本数据源,采用交叉校验方式建立了地理空间关系标注语料库。实现了中文文本中地理空间关系描述的结构化表达,提供了地理空间关系信息抽取的标准化测试数据。

关 键 词:自然语言  中文文本  地理空间关系  标注体系  标注语料库

Annotation for Geographical Spatial Relations in Chinese Text
ZHANG Xueying,ZHANG Chunju,ZHU Shaonan.Annotation for Geographical Spatial Relations in Chinese Text[J].Acta Geodaetica et Cartographica Sinica,2012,41(3):468-474.
Authors:ZHANG Xueying  ZHANG Chunju  ZHU Shaonan
Institution:Institute of Geographical Science,Nanjing Normal University,Nanjing 210046,China
Abstract:Corpus annotation is a task to provide both reference and training material for method development and benchmark data sets annotated with a given annotation scheme.After analysis of the linguistic characteristics,an annotation scheme is proposed for markup linguistic expressions for spatial relations in Chinese text.And then a natural language processing software-GATE(General Architecture for Text Engineering) is introduced as the annotation tool.Based on the proposed annotation scheme,a corpus with "Encyclopedia of China Geography" as the source data is annotated by means of cross-validation to solve the problem of annotation inconsistency.In order to realize the structurized representation of geographical spatial relations described in natural language,and to provide standard training and test data for their extraction.
Keywords:natural languages Chinese texts spatial relations annotation schemes annotated corpus
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号