王瑞云,贾君枝.基于用户适用度的开放数据质量提升研究[J].数字图书馆论坛,2018,(12):18~26 |
基于用户适用度的开放数据质量提升研究 |
The Research of Improving Open Data Quality Based on Fitness for Use |
投稿时间:2018-11-15 |
DOI:10.3772/j.issn.1673-2286.2018.12.003 |
中文关键词: 开放数据质量;用户适用度;需求匹配;下载浏览比 |
英文关键词: Open Data Quality; Fitness for Use; Demand Matching; Ratio between Downloads and Reviews |
基金项目:本研究得到国家社会科学基金重点项目"基于关联数据的中文名称规范档语义描述及数据聚合研究"(编号:15ATQ004)资助. |
|
摘要点击次数: 2137 |
全文下载次数: 1412 |
中文摘要: |
本文研究如何提高开放数据质量以更好地满足用户的应用需求.先分析用户需求匹配的行为过程,以北京开放数据门户网站的个体数据集为基本研究对象,选取浏览次数、下载次数和下载浏览比作为外部行为结果指标;然后分析外部指标与数据集的主题、元数据说明、及时性,以及数据表列数、行数等内在质量指标的可能的正相关关系;从相关分析中发现极端不符合正相关的异常数据集,联系这些数据集的用户选择情景深入讨论,提出针对这些异常数据集的质量提升建议. |
英文摘要: |
This paper aims at improving open data quality on fitness for use. Exploring behaviors of users’ demand matching, using bjdata.gov.cn as study case, selecting downloads, reviews of every dataset and their express computing as output indices, we research the possibility of positive relationship of between above indices and other dataset’s inner quality indices which containing content theme, metadata, timeliness, columns and rows of data resource table. More importantly we find out many exceptional datasets that don’t extremely confirm to the positive relationship, and discuss further those datasets on their user selection context. Finally we suggest on quality improving for those exception datasets. |
查看全文
查看/发表评论 下载PDF阅读器 |
关闭 |
|
|
|