Multi-label Patent Classification using Attention-Aware Deep Learning Model

  • Roudsari, Arousha Haghighian
  • Afshar, Jafar
  • Lee, Charles Cheolgi
  • Lee, Wookey
Citations

WEB OF SCIENCE

23
Citations

SCOPUS

32

초록

Patent classification is challenging and essential for any further patent analysis task. We tackle the classification task on lower level patent classification (subgroup level) by using AttentionXML. Recently, pretraining methods for Natural Language Processing (NLP), such as DistiIBERT pre-trained model, have achieved state-of-the-art results on sonic NLP tasks such as text classification. In this work, we focus on investigating the effect of applying DistiIBERT pre-trained model and fine-tuning it for the important task of multi-label patent classification. Moreover, the large USPTO-3M dataset (3,050,625 patents) based on CPC subclass and subgroup level is used for the purpose of comparing previous deep learning related studies.

키워드

patent classificationmulti-level text classificationDistilBERTAttentionXML
제목
Multi-label Patent Classification using Attention-Aware Deep Learning Model
저자
Roudsari, Arousha HaghighianAfshar, JafarLee, Charles CheolgiLee, Wookey
DOI
10.1109/BigComp48618.2020.000-2
발행일
2020
유형
Proceedings Paper
저널명
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020)
페이지
558 ~ 559