Theoretical Research on Data Diversity
摘要: 数据多样性是数据的本质属性。在信息技术突飞猛进式发展和开放科学数据的时代背景下,数据多样性特征愈发明显。本文首先详细阐述数据多样性的内外表现,其中内部表现包括:科学数据生产过程的不同对象、数据出版的三位一体、不同学科采集暂存数据时不同的数据格式;外部表现包括数据生命周期加速了数据多样性、科研生命周期增加了数据多样性、数据在具体应用时被型塑而生发的多样性。随后,文章简要介绍了数据多样性的共同特征和影响因素,并从三个方面介绍了数据多样性的应用表征。对图书馆与馆员来说,认识数据多样性可以在一定程度上帮助科研人员解决数据汇交任务和数据披露压力,让数据重用变得简单并符合理想的数据生态体系。因此,作为一名数据馆员,需要有数据管理的能力并了解数据伦理的相关法律法规、政策与协议,努力为科研人员提供数据增值的业务。
Abstract: Diversity is the essential attribute of data, especially scientific data. In the context of rapid development of information technologies (ITs) and the era of open research data, the characteristics of data diversity have become more obvious. Firstly, the paper elaborates on the internal and external manifestations of data diversity. The internal manifestations are different objects in the scientific data production process, the trinity of data publishing, and different data formats when collecting and depositing data in different disciplines. The external manifestations include data curation lifecycle accelerates the diversity of data, the research lifecycle increases the diversity of data, and diversity increased because of being sharpening when in practical use. Then, the paper describes the common features and impact factors of data diversity, and introduces the application representation of data diversity from three aspects. For libraries and data librarians, recognizing the diversity of data may probably help researchers solve the required task of data deposit and data release in open research data era, and making data reuse simple and creating an ideal data ecosystem. Therefore, as a data librarian, the data management capacity and the knowledge of relevant laws, regulations, policies, and agreements of data ethics are needed, in order to provide data value-added services.
[V1] | 2021-11-24 13:50:08 | ChinaXiv:202111.00029V1 | 下载全文 |
1. 融媒体赋能高校图书馆阅读推广文化育人品牌活动的研究 | 2023-12-04 |
2. 论课程信息素养 | 2023-11-22 |
3. 美国高校信息素养教育TTT教学模式的研究 | 2023-11-22 |
4. 中国式现代化对于全球开放科学实践的启示与意义 | 2023-11-17 |
5. 高校青年学者科研竞争力多维评价体系研究 | 2023-11-12 |
6. 数据管理计划是保障公共数据开放利用与价值实现的关键措施 | 2023-11-07 |
7. 基于知识图谱的黄河流域非遗资源 智能问答研究 | 2023-11-07 |
8. 基于文本挖掘与复杂网络的我国绿色消费领域 研究主题挖掘 | 2023-11-07 |
9. 《点石斋画报》主题演化分析 | 2023-11-07 |
10. 浅谈“挖衬”技艺在名人手札修复中的应用——以南开大学图书馆藏《林颂河手札》修复为例 | 2023-11-07 |