ChinaXiv.org 中国科学院科技论文预发布平台

Reg Login

EN | 中文

Submitted Date

2023
1

Subjects

Information Science
1

Authors

Institution

result total 1.

Hide Summary

Hits

Date

Your conditions: 陈仕

1. ChinaXiv:202304.00684
Download

Discussion on Using Transfer Learning to Accurately Identify Domain Information

Subjects: Library Science，Information Science >> Information Science submitted time 2023-04-01 Cooperative journals: 《图书情报工作》

Lu Quan Hao Zhitong Chen Jing Chen Shi Zhu Anqi

Abstract： [Purpose/significance] To solve the problem that the identification effect of the target domain information is difficult to improve because of not enough samples, we will transfer the results of unsupervised learning from big data to the feature space of the target domain. [Method/process] Used the RoBERTa model, which was pre-trained with Chinese Wikipedia and other data, for transfer learning. After mapping the learning results to the target domain, DPCNN was used to aggregate and condense it, and then fine-tuned the model with part of the labeled data to complete the accurate recognition of domain information. [Result/conclusion] Compared with the model without transfer learning and the classic model TextCNN in 10 fields, the model in this paper is much better than the comparison models. After average, the precision is increased by 4.15% and 3.43%, the recall is increased by 4.55% and 3.44%, and the F1 score is increased by 4.52% and 3.44%. It shows that knowledge transfer using big data can effectively improve the information recognition effect in the target field.

Hits 215 Downloads 106 Comment 0

友情链接: ChinaXiv PubScholar 哲学社会科学预印本

Operating Unit: National Science Library，Chinese Academy of Sciences
Production Maintenance: National Science Library，Chinese Academy of Sciences
Mail: eprint@mail.las.ac.cn
Address: 33 Beisihuan Xilu,Zhongguancun,Beijing P.R.China

Recruiting preprint review experts License Information Term & Conditions