王 东 王 飘 江俊鹏 李 青 徐晨阳.科技项目申报书查重方法研究[J].中国科技资源导刊,2022,(5):30~40 |
科技项目申报书查重方法研究 |
Research on the Duplicate Checking Method for Scientific and Technical Project Applications |
投稿时间:2022-07-08 |
DOI: |
中文关键词: 科技项目申报书;DSSM 架构;文本相似度;查重算法;查重系统 |
英文关键词: Declaration of Scientific and Technical Projects, DSSM architecture, text similarity, duplicate algorithm, duplicate checking |
基金项目: |
作者 | 单位 | 王 东 王 飘 江俊鹏 李 青 徐晨阳 | (中国科学技术信息研究所,北京 100038) |
|
摘要点击次数: 1011 |
全文下载次数: 1838 |
中文摘要: |
开展面向科技项目申报书的查重方法研究,对于推进学术诚信建设、营造风清气正的科研环境具有重要意义。目前,关于科技项目申报书的查重研究仍处于起步阶段,针对存在的查重系统架构不明确、查重算法准确率较低等问题,构建一套涵盖科技项目申报书数据处理、分布式任务、查重算法模块与查重报告生成的系统模型,并在查重算法方面提出基于DSSM架构的相似度检测算法模型。实验结果表明,该查重系统能够实现较高的查重准确率和查重效率,能够在科技项目申报书查重方面发挥积极的作用。 |
英文摘要: |
It is of great significance for promoting the construction of academic integrity and creating a clean and positive scientific research environment to carry out the research on duplication checking methods for the declaration of scientific and technical projects. At present, the research on duplicate checking of scientific and technical project application is still in its infancy, and there are problems such as unclear duplicate checking system architecture and low accuracy of duplicate checking algorithm. To solve these problems, this paper designs and implements a system model covering data processing of scientific and technical project declaration, distributed tasks, duplicate checking algorithm module and duplicate checking report generation, and proposes a similarity detection algorithm model based on DSSM architecture in duplicate checking algorithm. The experimental results show that our duplicate checking system can achieve high duplicate checking accuracy and efficiency, and we believe that it can play a positive role in duplicate checking of scientific and technical project declaration. |
查看全文
查看/发表评论 下载PDF阅读器 |
关闭 |
|
|
|