Bayesian network parameter learning using constraint-based data extension method

Xinxin Ru; Xiaoguang Gao; Yangyang Wang; Xiaohan Liu

doi:10.1007/s10489-022-03941-2

Bayesian network parameter learning using constraint-based data extension method

Xinxin Ru, Xiaoguang Gao, Yangyang Wang, Xiaohan Liu

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

Bayesian networks (BNs) are one of the most compelling theoretical models in uncertain knowledge representation and inference. However, many domains are encountering the dilemma of insufficient data. Learning BN parameters using raw data may lead to low learning accuracy. Therefore, this paper seeks to solve the problem via two novel data extension methods. First, a constraint-based nonparametric bootstrap (CNB) method is proposed, which extends the raw data and guides the parameter distribution of the extended data through a constraint-based sample scoring function. The experimental results on 12 BNs show that the extended data can improve the parameter learning accuracy and enhance the existing parameter learning approaches. The CNB is still valid for medium and large networks with relatively large data. When the original data are of inferior quality, the CNB is unattainable to extend it. Then, a constraint-based parametric bootstrap (CPB) method is proposed, creating a new parameter distribution by constraints and the original samples. The experimental results for the missing data demonstrate that the extended data perform better. The CPB is insensitive to the proportion of missing data and remains superior in relatively large data.

Original language	English
Pages (from-to)	9958-9977
Number of pages	20
Journal	Applied Intelligence
Volume	53
Issue number	9
DOIs	https://doi.org/10.1007/s10489-022-03941-2
State	Published - May 2023

Keywords

Bayesian network
Constraints
Data extension
Parameter learning

Access to Document

10.1007/s10489-022-03941-2

Cite this

@article{d65bf817584d4e708b43b5efb259ee02,

title = "Bayesian network parameter learning using constraint-based data extension method",

abstract = "Bayesian networks (BNs) are one of the most compelling theoretical models in uncertain knowledge representation and inference. However, many domains are encountering the dilemma of insufficient data. Learning BN parameters using raw data may lead to low learning accuracy. Therefore, this paper seeks to solve the problem via two novel data extension methods. First, a constraint-based nonparametric bootstrap (CNB) method is proposed, which extends the raw data and guides the parameter distribution of the extended data through a constraint-based sample scoring function. The experimental results on 12 BNs show that the extended data can improve the parameter learning accuracy and enhance the existing parameter learning approaches. The CNB is still valid for medium and large networks with relatively large data. When the original data are of inferior quality, the CNB is unattainable to extend it. Then, a constraint-based parametric bootstrap (CPB) method is proposed, creating a new parameter distribution by constraints and the original samples. The experimental results for the missing data demonstrate that the extended data perform better. The CPB is insensitive to the proportion of missing data and remains superior in relatively large data.",

keywords = "Bayesian network, Constraints, Data extension, Parameter learning",

author = "Xinxin Ru and Xiaoguang Gao and Yangyang Wang and Xiaohan Liu",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.",

year = "2023",

month = may,

doi = "10.1007/s10489-022-03941-2",

language = "英语",

volume = "53",

pages = "9958--9977",

journal = "Applied Intelligence",

issn = "0924-669X",

publisher = "Springer Netherlands",

number = "9",

}

TY - JOUR

T1 - Bayesian network parameter learning using constraint-based data extension method

AU - Ru, Xinxin

AU - Gao, Xiaoguang

AU - Wang, Yangyang

AU - Liu, Xiaohan

PY - 2023/5

Y1 - 2023/5

N2 - Bayesian networks (BNs) are one of the most compelling theoretical models in uncertain knowledge representation and inference. However, many domains are encountering the dilemma of insufficient data. Learning BN parameters using raw data may lead to low learning accuracy. Therefore, this paper seeks to solve the problem via two novel data extension methods. First, a constraint-based nonparametric bootstrap (CNB) method is proposed, which extends the raw data and guides the parameter distribution of the extended data through a constraint-based sample scoring function. The experimental results on 12 BNs show that the extended data can improve the parameter learning accuracy and enhance the existing parameter learning approaches. The CNB is still valid for medium and large networks with relatively large data. When the original data are of inferior quality, the CNB is unattainable to extend it. Then, a constraint-based parametric bootstrap (CPB) method is proposed, creating a new parameter distribution by constraints and the original samples. The experimental results for the missing data demonstrate that the extended data perform better. The CPB is insensitive to the proportion of missing data and remains superior in relatively large data.

AB - Bayesian networks (BNs) are one of the most compelling theoretical models in uncertain knowledge representation and inference. However, many domains are encountering the dilemma of insufficient data. Learning BN parameters using raw data may lead to low learning accuracy. Therefore, this paper seeks to solve the problem via two novel data extension methods. First, a constraint-based nonparametric bootstrap (CNB) method is proposed, which extends the raw data and guides the parameter distribution of the extended data through a constraint-based sample scoring function. The experimental results on 12 BNs show that the extended data can improve the parameter learning accuracy and enhance the existing parameter learning approaches. The CNB is still valid for medium and large networks with relatively large data. When the original data are of inferior quality, the CNB is unattainable to extend it. Then, a constraint-based parametric bootstrap (CPB) method is proposed, creating a new parameter distribution by constraints and the original samples. The experimental results for the missing data demonstrate that the extended data perform better. The CPB is insensitive to the proportion of missing data and remains superior in relatively large data.

KW - Bayesian network

KW - Constraints

KW - Data extension

KW - Parameter learning

UR - http://www.scopus.com/inward/record.url?scp=85136997993&partnerID=8YFLogxK

U2 - 10.1007/s10489-022-03941-2

DO - 10.1007/s10489-022-03941-2

M3 - 文章

AN - SCOPUS:85136997993

SN - 0924-669X

VL - 53

SP - 9958

EP - 9977

JO - Applied Intelligence

JF - Applied Intelligence

IS - 9

ER -

Bayesian network parameter learning using constraint-based data extension method

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this