首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于复合特征的中文地名识别方法
引用本文:魏勇,李鸿飞,胡丹露,李响,马雷雷.一种基于复合特征的中文地名识别方法[J].武汉大学学报(信息科学版),2018,43(1):17-23.
作者姓名:魏勇  李鸿飞  胡丹露  李响  马雷雷
作者单位:1.31008部队, 北京, 100091
基金项目:国家自然科学基金青-基金41401467四川省应急测绘与防灾减灾工程技术研究中心开放基金K2015B014
摘    要:中文地名识别是命名实体识别的重要研究课题之一,也是提高地理信息系统应用水平的关键。传统的地名识别主要基于词性或地名要素特征,特征类型有限。提出了一种基于复合特征的中文地名识别方法,挖掘中文地名在自然语言中的特点,设计了类型、路径、距离和数量四种句法特征,基于地名要素特征、词性特征、句法特征三种复合特征利用条件随机场模型实现了中文地名的训练和识别。通过实验对比复合特征在中文地名识别方法的效果,结果表明复合特征能够有效提高中文地名识别的准确率和召回率,尤其是对于复杂地名的识别,具有良好的效果。

关 键 词:地名识别    复合特征    句法分析    条件随机场
收稿时间:2016-01-10

A Method of Chinese Place Name Recognition Based on Composite Features
Institution:1.Troops 31008, Beijing 100091, China2.Institute of Geospatial Information, Information and Engineering University, Zhengzhou 450052, China3.Troops 95291, Hengyang 421010, China
Abstract:Chinese place name recognition is a research topic in named entity recognition, and a key to improve the application level of the geographic information systems in China. The traditional place name recognition method is based on the element characteristics of a place name and the part of speech of words, and employs limited features. This paper proposes a method of Chinese place name recognition method using syntactic features, and mines the syntactic characteristics of place names in natural language. The design employs four syntactic features, class, path, distance, and number, in conditional random fields (CRF) to train and recognize Chinese place names based on place name element s, position of speech (POS) and syntactic features. Comparative experiments with composite features and traditional features for Chinese place name show that with the help of the three composite feature, s Chinese place name recognition accuracy and recall rate can be improved effectively and with good results for complex place names.
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《武汉大学学报(信息科学版)》浏览原始摘要信息
点击此处可从《武汉大学学报(信息科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号