文章摘要
赵蕴华.基于GATE的中文专利摘要的抽取[J].数字图书馆论坛,2008,(11):
基于GATE的中文专利摘要的抽取
GATE-based Chinese Patent Abstracts’ Extraction
投稿时间:2008-10-07  修订日期:2008-10-07
DOI:
中文关键词: 中文专利摘要 GATE 信息抽取
英文关键词: Chinese Patent Abstract, GATE, Information Extraction
基金项目:其它
作者单位E-mail
赵蕴华* 中国科学技术信息研究所 万方数据技术研究院 zhaoyh@wanfangdata.com.cn 
摘要点击次数: 1342
全文下载次数: 674
中文摘要:
      本文通过对“新能源汽车”中文专利摘要的阅读和分析,提出了一种专利摘要内容判别原则。并通过对国外开源抽取工具GATE和中科院分词工具ICTCLAS的学习和改进,实现了对中文专利摘要的批量抽取,为专利知识库的自动构建准备了充分的语料基础。
英文摘要:
      With reading and analyzing the Chinese Patent Abstracts of New Resource Cars, this paper brings forward a judging principle of the abstracts. Then, this paper learns a foreign open-source extraction tool which named GATE and the word split software which named ICTCLAS. With improving the GATE and ICTCLAS, this paper achieves at extracting a batch of Chinese Patent Abstracts, which prepares enough language resources for constituting the Patent Knowledge Base automatically.
查看全文   查看/发表评论  下载PDF阅读器
关闭

分享按钮