Learning causal representations based on a GAE embedded autoencoder

Kuang Zhou; Ming Jiang; Bogdan Gabrys; Yong Xu

doi:10.1109/TKDE.2025.3546607

Learning causal representations based on a GAE embedded autoencoder

Kuang Zhou, Ming Jiang, Bogdan Gabrys, Yong Xu

School of Mathematics and Statistics

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

Traditional machine-learning approaches face limitations when confronted with insufficient data. Transfer learning addresses this by leveraging knowledge from closely related domains. The key in transfer learning is to find a transferable feature representation to enhance cross-domain classification models. However, in some scenarios, some features correlated with samples in the source domain may not be relevant to those in the target. Causal inference enables us to uncover the underlying patterns and mechanisms within the data, mitigating the impact of confounding factors. Nevertheless, most existing causal inference algorithms have limitations when applied to high-dimensional datasets with nonlinear causal relationships. In this work, a new causal representation method based on a Graph autoencoder embedded AutoEncoder, named GeAE, is introduced to learn invariant representations across domains. The proposed approach employs a causal structure learning module, similar to a graph autoencoder, to account for nonlinear causal relationships present in the data. Moreover, the cross-entropy loss as well as the causal structure learning loss and the reconstruction loss are incorporated in the objective function designed in a united autoencoder. This method allows for the handling of high-dimensional data and can provide effective representations for cross-domain classification tasks. Experimental results on generated and real-world datasets demonstrate the effectiveness of GeAE compared with the state-of-the-art methods.

Original language	English
Journal	IEEE Transactions on Knowledge and Data Engineering
DOIs	https://doi.org/10.1109/TKDE.2025.3546607
State	Accepted/In press - 2025

Keywords

Causal representation
autoencoder
causal discovery
transfer learning

Access to Document

10.1109/TKDE.2025.3546607

Cite this

@article{fb24b37db89e4b65b8b1a54c84129593,

title = "Learning causal representations based on a GAE embedded autoencoder",

abstract = "Traditional machine-learning approaches face limitations when confronted with insufficient data. Transfer learning addresses this by leveraging knowledge from closely related domains. The key in transfer learning is to find a transferable feature representation to enhance cross-domain classification models. However, in some scenarios, some features correlated with samples in the source domain may not be relevant to those in the target. Causal inference enables us to uncover the underlying patterns and mechanisms within the data, mitigating the impact of confounding factors. Nevertheless, most existing causal inference algorithms have limitations when applied to high-dimensional datasets with nonlinear causal relationships. In this work, a new causal representation method based on a Graph autoencoder embedded AutoEncoder, named GeAE, is introduced to learn invariant representations across domains. The proposed approach employs a causal structure learning module, similar to a graph autoencoder, to account for nonlinear causal relationships present in the data. Moreover, the cross-entropy loss as well as the causal structure learning loss and the reconstruction loss are incorporated in the objective function designed in a united autoencoder. This method allows for the handling of high-dimensional data and can provide effective representations for cross-domain classification tasks. Experimental results on generated and real-world datasets demonstrate the effectiveness of GeAE compared with the state-of-the-art methods.",

keywords = "Causal representation, autoencoder, causal discovery, transfer learning",

author = "Kuang Zhou and Ming Jiang and Bogdan Gabrys and Yong Xu",

note = "Publisher Copyright: {\textcopyright} 1989-2012 IEEE.",

year = "2025",

doi = "10.1109/TKDE.2025.3546607",

language = "英语",

journal = "IEEE Transactions on Knowledge and Data Engineering",

issn = "1041-4347",

publisher = "IEEE Computer Society",

}

TY - JOUR

T1 - Learning causal representations based on a GAE embedded autoencoder

AU - Zhou, Kuang

AU - Jiang, Ming

AU - Gabrys, Bogdan

AU - Xu, Yong

PY - 2025

Y1 - 2025

N2 - Traditional machine-learning approaches face limitations when confronted with insufficient data. Transfer learning addresses this by leveraging knowledge from closely related domains. The key in transfer learning is to find a transferable feature representation to enhance cross-domain classification models. However, in some scenarios, some features correlated with samples in the source domain may not be relevant to those in the target. Causal inference enables us to uncover the underlying patterns and mechanisms within the data, mitigating the impact of confounding factors. Nevertheless, most existing causal inference algorithms have limitations when applied to high-dimensional datasets with nonlinear causal relationships. In this work, a new causal representation method based on a Graph autoencoder embedded AutoEncoder, named GeAE, is introduced to learn invariant representations across domains. The proposed approach employs a causal structure learning module, similar to a graph autoencoder, to account for nonlinear causal relationships present in the data. Moreover, the cross-entropy loss as well as the causal structure learning loss and the reconstruction loss are incorporated in the objective function designed in a united autoencoder. This method allows for the handling of high-dimensional data and can provide effective representations for cross-domain classification tasks. Experimental results on generated and real-world datasets demonstrate the effectiveness of GeAE compared with the state-of-the-art methods.

AB - Traditional machine-learning approaches face limitations when confronted with insufficient data. Transfer learning addresses this by leveraging knowledge from closely related domains. The key in transfer learning is to find a transferable feature representation to enhance cross-domain classification models. However, in some scenarios, some features correlated with samples in the source domain may not be relevant to those in the target. Causal inference enables us to uncover the underlying patterns and mechanisms within the data, mitigating the impact of confounding factors. Nevertheless, most existing causal inference algorithms have limitations when applied to high-dimensional datasets with nonlinear causal relationships. In this work, a new causal representation method based on a Graph autoencoder embedded AutoEncoder, named GeAE, is introduced to learn invariant representations across domains. The proposed approach employs a causal structure learning module, similar to a graph autoencoder, to account for nonlinear causal relationships present in the data. Moreover, the cross-entropy loss as well as the causal structure learning loss and the reconstruction loss are incorporated in the objective function designed in a united autoencoder. This method allows for the handling of high-dimensional data and can provide effective representations for cross-domain classification tasks. Experimental results on generated and real-world datasets demonstrate the effectiveness of GeAE compared with the state-of-the-art methods.

KW - Causal representation

KW - autoencoder

KW - causal discovery

KW - transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85219345163&partnerID=8YFLogxK

U2 - 10.1109/TKDE.2025.3546607

DO - 10.1109/TKDE.2025.3546607

M3 - 文章

AN - SCOPUS:85219345163

SN - 1041-4347

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

ER -

Learning causal representations based on a GAE embedded autoencoder

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this