Feature Analysis Network: An Interpretable Idea in Deep Learning

Xinyu Li; Xiaoguang Gao; Qianglong Wang; Chenfeng Wang; Bo Li; Kaifang Wan

doi:10.1007/s12559-023-10238-0

Feature Analysis Network: An Interpretable Idea in Deep Learning

Xinyu Li, Xiaoguang Gao, Qianglong Wang, Chenfeng Wang, Bo Li, Kaifang Wan

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

5 Scopus citations

Abstract

Deep Learning (DL) stands out as a leading model for processing high-dimensional data, where the nonlinear transformation of hidden layers effectively extracts features. However, these unexplainable features make DL a low interpretability model. Conversely, Bayesian network (BN) is transparent and highly interpretable, and it can be helpful for interpreting DL. To improve the interpretability of DL from the perspective of feature cognition, we propose the feature analysis network (FAN), a DL structure fused with BN. FAN retains the DL feature extraction capability and applies BN as the output layer to learn the relationships between the features and the outputs. These relationships can be probabilistically represented by the structure and parameters of the BN, intuitively. In a further study, a correlation clustering-based feature analysis network (cc-FAN) is proposed to detect the correlations among inputs and to preserve this information to explain the features’ physical meaning to a certain extent. To quantitatively evaluate the interpretability of the model, we design the network simplification and interpretability indicators separately. Experiments on eight datasets show that FAN has better interpretability than that of the other models with basically unchanged model accuracy and similar model complexities. On the radar effect mechanism dataset, from the feature structure-based relevance interpretability indicator, FAN is up to 4.8 times better than that of the other models, and cc-FAN is up to 21.5 times better than that of the other models. FAN and cc-FAN enhance the interpretability of the DL model structure from the aspects of features; moreover, based on the input correlations, cc-FAN can help us to better understand the physical meaning of features.

Original language	English
Pages (from-to)	803-826
Number of pages	24
Journal	Cognitive Computation
Volume	16
Issue number	3
DOIs	https://doi.org/10.1007/s12559-023-10238-0
State	Published - May 2024

Keywords

Bayesian networks
Correlation clustering
Deep learning
Feature analysis

Access to Document

10.1007/s12559-023-10238-0

Cite this

@article{77a18714e10a495d9d5730964224f952,

title = "Feature Analysis Network: An Interpretable Idea in Deep Learning",

abstract = "Deep Learning (DL) stands out as a leading model for processing high-dimensional data, where the nonlinear transformation of hidden layers effectively extracts features. However, these unexplainable features make DL a low interpretability model. Conversely, Bayesian network (BN) is transparent and highly interpretable, and it can be helpful for interpreting DL. To improve the interpretability of DL from the perspective of feature cognition, we propose the feature analysis network (FAN), a DL structure fused with BN. FAN retains the DL feature extraction capability and applies BN as the output layer to learn the relationships between the features and the outputs. These relationships can be probabilistically represented by the structure and parameters of the BN, intuitively. In a further study, a correlation clustering-based feature analysis network (cc-FAN) is proposed to detect the correlations among inputs and to preserve this information to explain the features{\textquoteright} physical meaning to a certain extent. To quantitatively evaluate the interpretability of the model, we design the network simplification and interpretability indicators separately. Experiments on eight datasets show that FAN has better interpretability than that of the other models with basically unchanged model accuracy and similar model complexities. On the radar effect mechanism dataset, from the feature structure-based relevance interpretability indicator, FAN is up to 4.8 times better than that of the other models, and cc-FAN is up to 21.5 times better than that of the other models. FAN and cc-FAN enhance the interpretability of the DL model structure from the aspects of features; moreover, based on the input correlations, cc-FAN can help us to better understand the physical meaning of features.",

keywords = "Bayesian networks, Correlation clustering, Deep learning, Feature analysis",

author = "Xinyu Li and Xiaoguang Gao and Qianglong Wang and Chenfeng Wang and Bo Li and Kaifang Wan",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.",

year = "2024",

month = may,

doi = "10.1007/s12559-023-10238-0",

language = "英语",

volume = "16",

pages = "803--826",

journal = "Cognitive Computation",

issn = "1866-9956",

publisher = "Springer New York",

number = "3",

}

TY - JOUR

T1 - Feature Analysis Network

T2 - An Interpretable Idea in Deep Learning

AU - Li, Xinyu

AU - Gao, Xiaoguang

AU - Wang, Qianglong

AU - Wang, Chenfeng

AU - Li, Bo

AU - Wan, Kaifang

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.

PY - 2024/5

Y1 - 2024/5

N2 - Deep Learning (DL) stands out as a leading model for processing high-dimensional data, where the nonlinear transformation of hidden layers effectively extracts features. However, these unexplainable features make DL a low interpretability model. Conversely, Bayesian network (BN) is transparent and highly interpretable, and it can be helpful for interpreting DL. To improve the interpretability of DL from the perspective of feature cognition, we propose the feature analysis network (FAN), a DL structure fused with BN. FAN retains the DL feature extraction capability and applies BN as the output layer to learn the relationships between the features and the outputs. These relationships can be probabilistically represented by the structure and parameters of the BN, intuitively. In a further study, a correlation clustering-based feature analysis network (cc-FAN) is proposed to detect the correlations among inputs and to preserve this information to explain the features’ physical meaning to a certain extent. To quantitatively evaluate the interpretability of the model, we design the network simplification and interpretability indicators separately. Experiments on eight datasets show that FAN has better interpretability than that of the other models with basically unchanged model accuracy and similar model complexities. On the radar effect mechanism dataset, from the feature structure-based relevance interpretability indicator, FAN is up to 4.8 times better than that of the other models, and cc-FAN is up to 21.5 times better than that of the other models. FAN and cc-FAN enhance the interpretability of the DL model structure from the aspects of features; moreover, based on the input correlations, cc-FAN can help us to better understand the physical meaning of features.

AB - Deep Learning (DL) stands out as a leading model for processing high-dimensional data, where the nonlinear transformation of hidden layers effectively extracts features. However, these unexplainable features make DL a low interpretability model. Conversely, Bayesian network (BN) is transparent and highly interpretable, and it can be helpful for interpreting DL. To improve the interpretability of DL from the perspective of feature cognition, we propose the feature analysis network (FAN), a DL structure fused with BN. FAN retains the DL feature extraction capability and applies BN as the output layer to learn the relationships between the features and the outputs. These relationships can be probabilistically represented by the structure and parameters of the BN, intuitively. In a further study, a correlation clustering-based feature analysis network (cc-FAN) is proposed to detect the correlations among inputs and to preserve this information to explain the features’ physical meaning to a certain extent. To quantitatively evaluate the interpretability of the model, we design the network simplification and interpretability indicators separately. Experiments on eight datasets show that FAN has better interpretability than that of the other models with basically unchanged model accuracy and similar model complexities. On the radar effect mechanism dataset, from the feature structure-based relevance interpretability indicator, FAN is up to 4.8 times better than that of the other models, and cc-FAN is up to 21.5 times better than that of the other models. FAN and cc-FAN enhance the interpretability of the DL model structure from the aspects of features; moreover, based on the input correlations, cc-FAN can help us to better understand the physical meaning of features.

KW - Bayesian networks

KW - Correlation clustering

KW - Deep learning

KW - Feature analysis

UR - http://www.scopus.com/inward/record.url?scp=85182651873&partnerID=8YFLogxK

U2 - 10.1007/s12559-023-10238-0

DO - 10.1007/s12559-023-10238-0

M3 - 文章

AN - SCOPUS:85182651873

SN - 1866-9956

VL - 16

SP - 803

EP - 826

JO - Cognitive Computation

JF - Cognitive Computation

IS - 3

ER -

Feature Analysis Network: An Interpretable Idea in Deep Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this