Your conditions: 曲云鹏
  • Research on the Theoretical Framework of Web Archiving Data Quality Assurance Strategies

    Subjects: Library Science,Information Science >> Library Science submitted time 2023-10-08 Cooperative journals: 《知识管理论坛》

    Abstract: [Purpose/significance] Quality assurance is one of the most important procedures in web archiving, it runs throughout the whole web archiving work and affects the success odds of web archiving work. [Method/process] In this article, we made an analysis and comparative study for the quality assurance strategies of domestic and foreign web archiving organizations, and proposed a strategic theoretical framework for data quality assurance. [Result/conclusion] The framework in this article is a data-centered design, it includes a series of criteria and operating specifications, carries out data quality inspection throughout the collecting procedure by using semi-automatic auxiliary tools. Meanwhile, to ensure access to high quality archive data, the framework also takes team building, running environment maintenance and authorized backup to the websites as supplementary means.

  • An Overview on the Computing Method of the Lexical Chain Text Representation Model

    Subjects: Library Science,Information Science >> Library Science submitted time 2023-10-08 Cooperative journals: 《知识管理论坛》

    Abstract: [Purpose/significance] Text representation is an important step in intelligence processing. An excellent text representation model can reflect the document content precisely and sufficiently. Besides, it can improve the processing effect. It can be broadly applied in the fields of automatic abstracting and text segmentation. [Method/process] In this article, we collected the related documents and analyzed them. The construction methods and disambiguation in the lexical chain computing were classified and concluded. The computing method of the lexical chain relation included the computing method based on semantic association, the computing method based on statistical information and the computing method based on charts. The semantic disambiguation was important in the construction of the lexical chain, which directly affected the results and efficiency of the lexical chain construction. [Result/conclusion] The lexical chain text representation can be easily constructed and broadly applied. There are still some problems in the text representation model of the lexical chain. For example, there are many limitations to construct it by dictionaries, which does not take the context into consideration. The lexical chain model will possibly develop towards the fusion semantic relation method, the statistical algorithm and the context analysis of distributed semantics in the future.

  • Integration of Public Knowledge Space and Public Cultural Space——Exploration of National Science Library's Blending into Public Cultural Service System

    Subjects: Library Science,Information Science >> Information Science submitted time 2023-04-01 Cooperative journals: 《图书情报工作》

    Abstract: [Purpose/significance] As the important public knowledge space and public cultural space, libraries have the responsibility to innovate their service modes, promote their public cultural service levels, and adapt to the international development tendency.[Method/process] This paper analyzed the integration tactics and innovative cultural service mode, based on the establishment of the Museum of Chinese Academy of Sciences History at National Science Library.[Result/conclusion] The emphases to improve public cultural service are focused on strengthening the participation function of libraries in social construction, stimulating research on linked data, service, and business promotion, developing the educational functions of libraries to form lifelong education bases, integrating into the construction of national public digital cultural services to shape a new mode of cultural services based on big data, improving mobile terminal services to provide intelligent cultural services based on contexts, promoting multiple exchanges and cooperation to promote the sharing of cultural service resources, developing the mechanism of rational use of data and data security.

  • 一种分布式语义增强的词汇链文本表示模型构建方法

    Subjects: Library Science,Information Science >> Information Science submitted time 2017-11-08 Cooperative journals: 《数据分析与知识发现》

    Abstract:【目的】利用分布式语义关联计算词衔接关系, 解决目前词汇链构建时存在的词间关系探测深度不够等问 题, 提高词汇链构建质量。【方法】对词汇链构建的技术方法进行归纳, 利用 WordNet 词典关系来计算文本中语 言单元的语义关联, 利用分布式记忆模型来计算语言单元之间的潜在语义关系, 将这两种语义关系结合起来实 现词汇链文本表示模型的构建。同时在理论研究的基础之上选择医学领域科技论文进行对比实验。【结果】在文 本主题描述方面, 本文方法的词汇链构建结果要优于非贪婪算法, 算法耗时与非贪婪算法相当。【局限】算法耗 时较长; 没有完整考虑词衔接关系; 只在对医学领域科技文献的主题识别中验证了该方法的有效性, 还需要在 更多领域进行证明。【结论】分布式语义关联可以识别潜在语义, 对使用多元短语构建词汇链也有较大的帮助, 能 有效地增强词汇链构建效果。