Integrating information by Kullback–Leibler constraint for text classification

Shu Yin; Peican Zhu; Xinyu Wu; Jiajin Huang; Xianghua Li; Zhen Wang; Chao Gao

doi:10.1007/s00521-023-08602-0

Integrating information by Kullback–Leibler constraint for text classification

Shu Yin, Peican Zhu, Xinyu Wu, Jiajin Huang, Xianghua Li, Zhen Wang, Chao Gao

光电与智能研究院

科研成果: 期刊稿件 › 文章 › 同行评审

4 引用（Scopus）

摘要

Text classification is an important assignment for various text-related downstream assignments, such as fake news detection, sentiment analysis, and question answering. In recent years, the graph-based method achieves excellent results in text classification tasks. Instead of regarding a text as a sequence structure, this method regards it as a co-occurrence set of words. The task of text classification is then accomplished by aggregating the data from nearby nodes using the graph neural network. However, existing corpus-level graph models are difficult to incorporate the local semantic information and classify new coming texts. To address these issues, we propose a Global–Local Text Classification (GLTC) model, based on the KL constraints to realize inductive learning for text classification. Firstly, a global structural feature extractor and a local semantic feature extractor are designed to capture the structural and semantic information of text comprehensively. Then, the KL divergence is introduced as a regularization term in the loss calculation process, which ensures that the global structural feature extractor can constrain the learning of the local semantic feature extractor to achieve inductive learning. The comprehensive experiments on benchmark datasets present that GLTC outperforms baseline methods in terms of accuracy.

源语言	英语
页（从-至）	17521-17535
页数	15
期刊	Neural Computing and Applications
卷	35
期	24
DOI	https://doi.org/10.1007/s00521-023-08602-0
出版状态	已出版 - 8月 2023

访问文件

10.1007/s00521-023-08602-0

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{bc316f3d76a04ea0af8bcaaf92e48196,

title = "Integrating information by Kullback–Leibler constraint for text classification",

abstract = "Text classification is an important assignment for various text-related downstream assignments, such as fake news detection, sentiment analysis, and question answering. In recent years, the graph-based method achieves excellent results in text classification tasks. Instead of regarding a text as a sequence structure, this method regards it as a co-occurrence set of words. The task of text classification is then accomplished by aggregating the data from nearby nodes using the graph neural network. However, existing corpus-level graph models are difficult to incorporate the local semantic information and classify new coming texts. To address these issues, we propose a Global–Local Text Classification (GLTC) model, based on the KL constraints to realize inductive learning for text classification. Firstly, a global structural feature extractor and a local semantic feature extractor are designed to capture the structural and semantic information of text comprehensively. Then, the KL divergence is introduced as a regularization term in the loss calculation process, which ensures that the global structural feature extractor can constrain the learning of the local semantic feature extractor to achieve inductive learning. The comprehensive experiments on benchmark datasets present that GLTC outperforms baseline methods in terms of accuracy.",

keywords = "Constraint, Graph neural network, Kullback–Leibler divergence, Text classification",

author = "Shu Yin and Peican Zhu and Xinyu Wu and Jiajin Huang and Xianghua Li and Zhen Wang and Chao Gao",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.",

year = "2023",

month = aug,

doi = "10.1007/s00521-023-08602-0",

language = "英语",

volume = "35",

pages = "17521--17535",

journal = "Neural Computing and Applications",

issn = "0941-0643",

publisher = "Springer London",

number = "24",

}

TY - JOUR

T1 - Integrating information by Kullback–Leibler constraint for text classification

AU - Yin, Shu

AU - Zhu, Peican

AU - Wu, Xinyu

AU - Huang, Jiajin

AU - Li, Xianghua

AU - Wang, Zhen

AU - Gao, Chao

PY - 2023/8

Y1 - 2023/8

N2 - Text classification is an important assignment for various text-related downstream assignments, such as fake news detection, sentiment analysis, and question answering. In recent years, the graph-based method achieves excellent results in text classification tasks. Instead of regarding a text as a sequence structure, this method regards it as a co-occurrence set of words. The task of text classification is then accomplished by aggregating the data from nearby nodes using the graph neural network. However, existing corpus-level graph models are difficult to incorporate the local semantic information and classify new coming texts. To address these issues, we propose a Global–Local Text Classification (GLTC) model, based on the KL constraints to realize inductive learning for text classification. Firstly, a global structural feature extractor and a local semantic feature extractor are designed to capture the structural and semantic information of text comprehensively. Then, the KL divergence is introduced as a regularization term in the loss calculation process, which ensures that the global structural feature extractor can constrain the learning of the local semantic feature extractor to achieve inductive learning. The comprehensive experiments on benchmark datasets present that GLTC outperforms baseline methods in terms of accuracy.

AB - Text classification is an important assignment for various text-related downstream assignments, such as fake news detection, sentiment analysis, and question answering. In recent years, the graph-based method achieves excellent results in text classification tasks. Instead of regarding a text as a sequence structure, this method regards it as a co-occurrence set of words. The task of text classification is then accomplished by aggregating the data from nearby nodes using the graph neural network. However, existing corpus-level graph models are difficult to incorporate the local semantic information and classify new coming texts. To address these issues, we propose a Global–Local Text Classification (GLTC) model, based on the KL constraints to realize inductive learning for text classification. Firstly, a global structural feature extractor and a local semantic feature extractor are designed to capture the structural and semantic information of text comprehensively. Then, the KL divergence is introduced as a regularization term in the loss calculation process, which ensures that the global structural feature extractor can constrain the learning of the local semantic feature extractor to achieve inductive learning. The comprehensive experiments on benchmark datasets present that GLTC outperforms baseline methods in terms of accuracy.

KW - Constraint

KW - Graph neural network

KW - Kullback–Leibler divergence

KW - Text classification

UR - http://www.scopus.com/inward/record.url?scp=85158086909&partnerID=8YFLogxK

U2 - 10.1007/s00521-023-08602-0

DO - 10.1007/s00521-023-08602-0

M3 - 文章

AN - SCOPUS:85158086909

SN - 0941-0643

VL - 35

SP - 17521

EP - 17535

JO - Neural Computing and Applications

JF - Neural Computing and Applications

IS - 24

ER -

Integrating information by Kullback–Leibler constraint for text classification

摘要

访问文件

其它文件与链接

指纹

引用此