林泽斐.面向KMS的Web信息采集机制研究[J].数字图书馆论坛,2007,(11): |
面向KMS的Web信息采集机制研究 |
Research on Web Gathering Mechanisms for Knowledge Management System |
投稿时间:2007-08-13 修订日期:2007-08-13 |
DOI: |
中文关键词: 知识管理系统;信息采集;信息提取 |
英文关键词: Knowledge Management System ; Web Gathering ; Data Extraction |
基金项目: |
|
摘要点击次数: 1268 |
全文下载次数: 582 |
中文摘要: |
Web是KMS(知识管理系统)信息采集的重要来源之一,但其数据的庞杂无序和半结构化特性给信息采集工作造成了一定难度。本文对Web信息采集机制,特别是HTML结构特征分析法进行了探讨,并结合采集机制研究,以建立企业名录信息库为例,说明采集系统如何最大限度的提高KMS基础信息采集的能力。 |
英文摘要: |
Web is one of the important sources that the KMS information collect. However, web information is large in amount, disordered and semi-structured, which creates difficulties for information collection..This paper collects the mechanism to the information of Web, especially the HTML structure characteristic analysis the method carried on the study, and combine to collect the mechanism research, to take the establishment business enterprise record the information database as an example, the elucidation collect how system is utmost improve the ability that the KMS foundation information collect. |
查看全文
查看/发表评论 下载PDF阅读器 |
关闭 |