首页 | 本学科首页   官方微博 | 高级检索  
     

中文文本的地名解析方法研究
引用本文:唐旭日, 陈小荷, 张雪英. 中文文本的地名解析方法研究[J]. 武汉大学学报 ( 信息科学版), 2010, 35(8): 930-935.
作者姓名:唐旭日  陈小荷  张雪英
作者单位:1南京师范大学文学院,南京市宁海路122号210097;2南京师范大学虚拟地理环境教育部重点实验室,南京市文苑路1号210046
基金项目:国家863计划资助项目(2007AA12Z221);国家自然科学基金资助项目(40971231,60773173);国家社科基金资助项目(07BYY050)
摘    要:讨论了中文文本的地名解析流程,提出基于条件随机场和篇章地名关系的地名识别方法、基于局部模糊匹配的地名标准化方法以及基于认知显著度的地理编码方法,并构建了地名解析原型系统。实验显示,该系统可以获得较为满意的精确率、召回率和F-1值,同时讨论了地名词典的完备性、地名识别精度以及地名语义歧义消除等影响地名解析性能的主要因素。

关 键 词:地名解析  地名识别  地理编码  地名匹配
收稿时间:2010-06-15
修稿时间:2013-07-09

Research on Toponym Resolution in Chinese Text
TANG Xuri, CHEN Xiaohe, ZHANG Xueying. Research on Toponym Resolution in Chinese Text[J]. Geomatics and Information Science of Wuhan University, 2010, 35(8): 930-935.
Authors:TANG Xuri  CHEN Xiaohe  ZHANG Xueying
Affiliation:1School of Chinese Language and Literature,Nanjing Normal University,122 Ninghai Road,Nanjing 210097,China;(2 Key Laboratory of Virtual Geographical Environment,Ministry of Education,Nanjing Normal University,1 Wenyuan Road,Nanjing 210046,China
Abstract:This paper explores approaches for Toponym resolution in Chinese text,and proposes a geo-parsing approach based on conditional random fields and discourse toponym relations,and a geo-coding approach based on partial fuzzy matching and cognitive salience calculation.The proposed geo-parsing approach deals with the recognition of toponym in three major steps.The experiment shows that the key factors that may influence the performance of toponym resolution in Chinese text are the coverage of gazetteer,the performance of geo-parsing and the performance of semantic disambiguation of toponyms.In our experiment,there are about 17% toponyms can not locate their semantics in the gazetteer.Ambiguity in geo-parsing and geo-coding are the next prominent factors that affect the overall performance of toponym resolution.
Keywords:toponym resolution  geo-parsing  toponym matching  semantic disambiguation
本文献已被 CNKI 等数据库收录!
点击此处可从《武汉大学学报(信息科学版)》浏览原始摘要信息
点击此处可从《武汉大学学报(信息科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号