ChinaXiv.org 中国科学院科技论文预发布平台

Submitted Date

2023
13
2017
2

Subjects

Authors

Institution

result total 15.

Hide Summary

Hits

Date

Downloads

Your conditions: 南京农业大学信息科学技术学院南京 210095

1. ChinaXiv:202308.00608
Download

Research on Collaboration Network of Scientific Data-Taking Clinical Trial Data of ClinicalTrials.gov as an Example

Subjects: Library Science，Information Science >> Library Science submitted time 2023-08-27 Cooperative journals: 《图书情报工作》

Xu Xiaojie He Lin Shao Bo

Abstract： [Purpose/significance] Based on the scientific data, we constructed a new collaboration network and compared with cooperative network of traditional publications. By exploring the differences of the two collaboration network from the perspective of network analysis, this paper provided reference for scientific data management. [Method/process] Taking clinical science database of ClinicalTrials.gov website as an example, this paper used crawler technology to capture the metadata of traditional publication and clinical trial research. Then, based on these two kinds of metadata, this paper constructed different cooperative networks respectively. Finally, it used complex network analysis to explore these networks to compare the similarities and differences between two networks. [Result/conclusion] Scientific collaboration network extracted metadata from a scientific repository and publication could provide richer and more accurate information of collaboration than just using metadata from publications alone.This paper reveals the importance of scientific data management and open sharing.

Hits 464 Downloads 160 Comment 0
2. ChinaXiv:202308.00643
Download

Research of Automatic Extraction of Entities of Data Science Recruitment and Analysis Based on Deep Learning

Subjects: Library Science，Information Science >> Library Science submitted time 2023-08-27 Cooperative journals: 《图书情报工作》

Wang Dongbo Hu Haotian Zhou Xin Zhu Danhao

Abstract： [Purpose/significance] Data science is emerging as a new interdisciplinary field which combines many fields. Extracting the corresponding entities knowledge from the announcement information of data science recruitment can not only help to understand the development of data science from a market perspective, but also help to improve the content of data science teaching.[Method/process] Based on the recruitment announcement from the recruitment website, combining with information science data collection, annotation and organization methods, data science corpus was constructed and the corresponding entities from it were extracted.[Result/conclusion] In the existing 11000 annotated data science corpus scale recruitment announcement, based on the Bi-LSTM-CRF, CRF and Bi-LSTM models, this paper compared the extraction performance of data science recruiting entities and finally determined the final data science recruitment entities automatic extraction model, designed the data science recruitment entities automatic extraction platform, and built a data science recruitment entities network.

Hits 412 Downloads 149 Comment 0
3. ChinaXiv:202308.00259
Download

A Comparative Study of Model Performances Facing Abstract Structure Function

Subjects: Library Science，Information Science >> Library Science submitted time 2023-08-26 Cooperative journals: 《图书情报工作》

Wang Dongbo Lu Haoxiang Zhou Xin Zhu Danhao

Abstract： [Purpose/significance] Abstract can explain concisely the research purposes, research methods and the final part of the statement, which is of high exploration value and significance.[Method/process] In this paper, four short-term memory networks (long short-term memory, support vector machine, LSTM-CRF and CNN-CRF) were selected to summarize the journal articles of 3672 CNKI databases.[Result/conclusion] The long-term memory network model identifies the highest F value of 69.15%, the maximum F value of LSTM-CRF neural network model is 88.76%, and the highest F value of RNN-CRF model is 89.10%. The highest support vector machine classifier classification macro F value is 72.04%. The experimental results have a high reference value for the selection of the experimental model of the functional structure of academic dissertation in the field of library and information science.

Hits 570 Downloads 197 Comment 0
4. ChinaXiv:202307.00295
Download

Construction, Performance and Application of New Era People's Daily Segmented Corpus (Ⅲ)——Analysis and Comparison of Sentence Length and Word

Subjects: Library Science，Information Science >> Library Science submitted time 2023-07-26 Cooperative journals: 《图书情报工作》

Huang Shuiqing Wang Dongbo

Abstract： [Purpose/significance] The statistics and analysis of sentence length in different dimensions and vocabulary distribution based on the New Era People's Daily(NEPD) word segmentation corpus is not only conducive to a relatively comprehensively and systematically understanding of the linguistic characteristics of the contemporary Chinese text, but also beneficial to the subsequent exploration of natural language processing and text mining of the text.[Method/process] Based on the word segmentation data of People's Daily in January 2018 and the word segmentation data of People's Daily in January 1998, 6 sentence categories used in the statistics were determined, and the sentence length distribution of character and word units was counted and analyzed, and the distribution of words in static state was revealed based on Zipf's law.[Result/conclusion] From the perspective of the sentence length distribution in the word dimension and the Zipf distribution of vocabulary, the sentence length and vocabulary distribution have both changed in the 1998 and 2018 corpora as time goes by, but this change is continuous and related.

Hits 479 Downloads 123 Comment 0
5. ChinaXiv:202307.00312
Download

Construction, Performance and Application of New Era People's Daily Segmented Corpus (Ⅱ)——Constructing Automatic Word Segmentation Model of Deep Learning

Subjects: Library Science，Information Science >> Library Science submitted time 2023-07-26 Cooperative journals: 《图书情报工作》

Huang Shuiqing Wang Dongbo

Abstract： [Purpose/significance] On the basis of the new era People's Daily(NEPD) word segmentation corpus, the construction of the automatic word segmentation model of deep learning not only can help to provide relevant experience for the construction of high-performance word segmentation model, but also can verify the performance of the corresponding model of deep learning through specific natural language processing tasks.[Method/process] Based on the introduction of Bi-directional Long Short-Term Memory (Bi-LSTM) and Bi-directional Long Short-Term Memory with conditional random field (Bi-LSTM-CRF), this paper expounded the process, type and situation of Chinese word segmentation preprocessing, the evaluation indexes and parameters and hardware platform, the Bi-LSTM and Bi-LSTM-CRF Chinese automatic word segmentation models were constructed respectively, and the overall performance of the models was analyzed.[Result/conclusion] The overall performance of the Bi-LSTM and Bi-LSTM-CRF Chinese automatic word segmentation model is relatively reasonable from the three indexes of precision, recall and F value. In terms of specific performance, Bi-LSTM word segmentation model is superior to Bi-LSTM-CRF word segmentation model, but the difference is very small.

Hits 421 Downloads 189 Comment 0
6. ChinaXiv:202307.00327
Download

Construction, Performance and Application of New Era People's Daily Segmented Corpus (I)——Construction and Evaluation of Corpus

Subjects: Library Science，Information Science >> Library Science submitted time 2023-07-26 Cooperative journals: 《图书情报工作》

Huang Shuiqing . Wang Dongbo

Abstract： [Purpose/significance] The construction of the segmented corpus of People's Daily in line with the new era provides new annotated corpus for Chinese information processing, and also offers new language resources for analyzing modern Chinese from a diachronic perspective.[Method/process] The data source, annotation specification and process of the constructed corpus were explained on the basis of analyzing the existing Chinese word segmentation corpus, on the other hand, the corpus performance was evaluated by constructing the automatic word segmentation model by comparing with the existing corpus.[Result/conclusion] The New Era People's Daily Segmented Corpus(NEPD) with a large scale and a long time span follows the basic processing standards of modern Chinese corpus. The part of January 2018 is selected from NEPD to build a segmentation model based on conditional random field model. The performance of the corpus of People's Daily in January 2018 is evaluated and compared with that of the corpus of People's Daily in January 1998. The specific evaluation indexes obtained from the corpus show that the overall performance of the corpus of People's Daily in the new era is relatively outstanding. The corpus of 1998 could not be replaced, but it is very necessary to construct the NEPD.

Hits 445 Downloads 147 Comment 0
7. ChinaXiv:202307.00403
Download

Research on the Development of Digital Reading Promotion Model in Libraries

Subjects: Library Science，Information Science >> Library Science submitted time 2023-07-26 Cooperative journals: 《图书情报工作》

Peng Aidong Xing Sisi Mao Yihong

Abstract： [Purpose/significance] This paper aims to provide suggestions for the future development of digital reading promotion in libraries, on the basis of investigating the development of digital reading promotion in domestic libraries.[Method/process] Taking the main public libraries and academic libraries in China as the research objects, and taking their websites, WeChat accounts and Weibo accounts as data sources, this paper investigates the current situation of digital reading promotion in libraries by means of network survey and literature survey.[Result/conclusion] The digital reading promotion in libraries in our country develops rapidly. The present models of digital reading promotion include activities, publications, interpersonal communication, advertisements, courses, navigation and recommendation. However, the development is not balanced. Activities and recommendation are the main models. There is still much room for the development of digital reading promotion in libraries.

Hits 296 Downloads 127 Comment 0
8. ChinaXiv:202307.00544
Download

Research on the Collaboration Relationship Between Government and Social Organizations in Public Libraries Service Supply

Subjects: Library Science，Information Science >> Library Science submitted time 2023-07-26 Cooperative journals: 《图书情报工作》

Li Yang

Abstract： [Purpose/significance] Under the background of government function transformation, the cooperation of government with social organizations in providing public libraries service is an effective measure for the reform of the public library management system. It is also an important development trend of public libraries. The study of the relationship between the government and the social organization can not only enrich the relevant theoretical results, but also provide a reference for the development of the collaboration between government and social organizations in public libraries service supply. [Method/process] Through the literature research, network survey and field research, this paper discusses the concept and characteristic of the collaboration relationship between government and social organizations in public libraries service supply, and deeply analyzes the role of different subjects, the causes, the types and the formation process of the collaboration relationship between government and social organizations in public libraries service supply. On this basis, it puts forward the countermeasures for sustainable development of the collaboration relationship between government and social organizations in public libraries service supply. [Result/conclusion] The establishment of good collaboration relationship needs to correct the collaborative motivations, improve the collaborative systems and standardize the collaborative behaviors.

Hits 326 Downloads 131 Comment 0
9. ChinaXiv:202304.00148
Download

Observation and Thinking About Online Reading Service During the Period of National Prevention of COVID-19

Subjects: Library Science，Information Science >> Information Science submitted time 2023-04-01 Cooperative journals: 《图书情报工作》

Mao Yihong

Abstract： [Purpose/significance] This paper summarizes and analyzes the measures of online reading service in China during the period of national prevention of COVID-19, and provides reference for improving online reading service in China.[Method/process] Online reading services of publishing houses, libraries and other reading service institutions were tracked online on the platform of WeChat, Weibo and websites, etc. and its performance was summarized and analyzed.[Result/conclusion] At present, all kinds of reading service institutions have shifted their service focus to online services in an emergency. According to the diverse reading needs of all kinds of readers, they have increased system support, published epidemic prevention publications in an emergency, and expanded the dissemination of online high-quality books. They have generally performed well, but some problems have also been exposed. We need to change the pressure into the power to further improve the online reading service system and improve the comprehensive reading service ability.

Hits 195 Downloads 86 Comment 0
10. ChinaXiv:202304.00299
Download

A Research on the Visualization and Metric Analysis of War in Zuo Zhuan Based on Social Network Analysis

Subjects: Library Science，Information Science >> Information Science submitted time 2023-04-01 Cooperative journals: 《图书情报工作》

Fan Wenjie Li Zhongkai Huang Shuiqing

Abstract： [Purpose/significance] The development of digital humanities has aroused widespread concern in the field of social sciences and humanities, by using convenient and efficient computing technology to extract potential information from massive data resources and unstructured text, and present it to users in a more intuitive and clear way.[Method/process] This paper took the war described in Zuo Zhuan as the research object, and extracted the strategic offensive sides and strategic defenders of each war from the war sentences. From the perspective of digital humanities, it explored the feasibility of using social network analysis methods to describe the changes of the war pattern in Spring and Autumn Period. On this basis, the vassal states in the Spring and Autumn Period were divided into different groups according to the relationship between war cooperation and war confrontation. The main groups and the core vassal states were analyzed and discussed one by one. In addition, the war in Zuo Zhuan was dynamically displayed in using 3 techniques:html, css and E-Charts.[Result/conclusion] We provided a method for extracting war information from the unstructured Zuo Zhuan texts during the Spring and Autumn Period and organized it into quantifiable data. It proved that it was feasible to show the relationship between the vassal states during the Spring and Autumn Period from the perspective of war, and also showed the feasibility and great potential of digital humanities technology in the research of humanistic history.

Hits 226 Downloads 96 Comment 0
11. ChinaXiv:202304.00337
Download

Creating the Knowledge Service Value of Library with Users——Analysis and Enlightenment of User Wisdom Integrated in Yunzhou Knowledge Space

Subjects: Library Science，Information Science >> Information Science submitted time 2023-04-01 Cooperative journals: 《图书情报工作》

Li Yang Zheng Dejun

Abstract： [Purpose/significance] The value of user participation in knowledge service has attracted more and more attention in academic circles. The analysis of the case of user wisdom integrated in digital knowledge space can provide new ideas and methods for the innovation of library knowledge service.[Method/process] From the perspective of value co-creation, this paper analyzed the role orientation of users, the stage strategy of the user wisdom integration and the guarantee mechanism of the user wisdom integrated in Yunzhou knowledge space. Then this paper summarized the mode and its advantages.[Result/conclusion] The enlightenment of user wisdom integrated in Yunzhou knowledge space to library knowledge service innovation has three aspects:attaching importance to the value of user wisdom integration, supply-side and demand-side joint efforts; finding out the value orientation of users and implementing different incentive measures; caring about the value experience of users and establishing a diverse interaction mechanism.

Hits 221 Downloads 128 Comment 0
12. ChinaXiv:202304.00375
Download

The Definition of Smart Reading Service and Its Related Research

Subjects: Library Science，Information Science >> Information Science submitted time 2023-04-01 Cooperative journals: 《图书情报工作》

Mao Yihong Zhu Lingling Han Yan

Abstract： [Purpose/significance] This paper defines the concept of smart reading service, sorts out the relevant researches in recent 3 years in China, and provides reference for the research and development in this field.[Method/process] The relevant researches in China since 2017 were searched through CNKI, relevant research literature was combed, research hotspot was analyzed, analysis framework of smart reading service was summarized, and future research was prospected.[Result/conclusion] Since 2017, the domestic researches of smart reading service mainly include smart reading service system and platform, users, service content and strategy, service evaluation, service management, etc. Smart reading service content and strategy, users are two hot research directions, and big data and artificial intelligence technology, virtual reality technology are the hot research topics. Smart reading service will be a research hotspot in the future, and technology and users will be the focus of future research.

Hits 205 Downloads 125 Comment 0
13. ChinaXiv:202304.00710
Download

Construction and Application of Entity Recognition Model Based on Deep Learning of Classics in Digital Humanities

Subjects: Library Science，Information Science >> Information Science submitted time 2023-04-01 Cooperative journals: 《图书情报工作》

Du Yue Wang Dongbo Jiang Chuan Xu Runhua Li Bin Xu Chao Xu Chenfei

Abstract： [Purpose/significance] The classics are the carrier of Chinese traditional culture, thought and wisdom. Combining the methods of data acquisition, labeling and analysis of digital humanities, it is of great significance for the automatic entity recognition of classics for subsequent application research. [Method/process] The corpus was constructed based on 25 pre-Qin literature that have been automatically segmented and manually annotated, based on the corpus of different sizes and seven deep learning models of Bi-LSTM, Bi-LSTM-Attention, Bi-LSTM-CRF, Bi-LSTM-CRF-Attention, Bi-RNN, Bi-RNN-CRF and BERT, we extracted the corresponding entities that constituted historical events and compared their effects.[Result/conclusion] The accuracy of the Bi-LSTM-Attention and Bi-RNN-CRF models trained on all corpus reached 89.79% and 89.33%, respectively, confirming the feasibility of applying deep learning to large-scale text datasets.

Hits 282 Downloads 154 Comment 0
14. ChinaXiv:201711.01951
Download

多特征知识下的食品安全事件实体抽取研究

Subjects: Library Science，Information Science >> Information Science submitted time 2017-11-08 Cooperative journals: 《数据分析与知识发现》

王东波吴毅叶文豪刘睿伦

Abstract：【目的】从大规模食品安全事件当中抽取食品安全事件实体。【方法】基于已发生的食品安全事件, 结合情报学数据获取、标注和组织的方法, 融合食品安全事件实体的多种分布特征知识, 通过条件随机场模型, 构建食品安全事件语料并从中抽取相应的实体。【局限】在食品安全事件实体抽取过程中所制定的特征模板在领域化迁移上具有一定的局限性。【结果】在已有1500万字经过标注的食品安全事件语料的规模上, 通过统计食品安全事件实体的内部和外部特征, 基于条件随机场机器学习模型, 构建了食品安全实体的抽取模型, 该模型最高的F 值达到91.94%。【结论】通过对食品安全事件实体抽取结果的分析, 在食品这一领域化的语料上, 基于条件随机场进行实体抽取是可行的。

Hits 2521 Downloads 1423 Comment 0
15. ChinaXiv:201711.01252
Download

采用连续词袋模型(CBOW)的领域术语自动抽取研究

Subjects: Library Science，Information Science >> Information Science submitted time 2017-10-11 Cooperative journals: 《数据分析与知识发现》

姜霖王东波

Abstract： [Objective] This study tries to extract domain terms more accurately and conveniently. [Methods] First, proposed a method using the CBOW model to build word vectors for each component of the terms. Then, applied the cosine similarity to calculate the internal correlation degree among each term’s individual components. To get more representative terms, we used the PageRank algorithm to rank the candidates. [Results] We obtained high recall and precision rates using the paper abstacts in the field of natural language processing as the training pool. [Limitations]The training pool was relatively small, which might influence the

Hits 2741 Downloads 1832 Comment 0

Research on Collaboration Network of Scientific Data-Taking Clinical Trial Data of ClinicalTrials.gov as an Example

Research of Automatic Extraction of Entities of Data Science Recruitment and Analysis Based on Deep Learning

A Comparative Study of Model Performances Facing Abstract Structure Function

Construction, Performance and Application of New Era People's Daily Segmented Corpus (Ⅲ)——Analysis and Comparison of Sentence Length and Word

Construction, Performance and Application of New Era People's Daily Segmented Corpus (Ⅱ)——Constructing Automatic Word Segmentation Model of Deep Learning

Construction, Performance and Application of New Era People's Daily Segmented Corpus (I)——Construction and Evaluation of Corpus

Research on the Development of Digital Reading Promotion Model in Libraries

Research on the Collaboration Relationship Between Government and Social Organizations in Public Libraries Service Supply

Observation and Thinking About Online Reading Service During the Period of National Prevention of COVID-19

A Research on the Visualization and Metric Analysis of War in Zuo Zhuan Based on Social Network Analysis

Creating the Knowledge Service Value of Library with Users——Analysis and Enlightenment of User Wisdom Integrated in Yunzhou Knowledge Space

The Definition of Smart Reading Service and Its Related Research

Construction and Application of Entity Recognition Model Based on Deep Learning of Classics in Digital Humanities

多特征知识下的食品安全事件实体抽取研究

采用连续词袋模型(CBOW)的领域术语自动抽取研究