Code Multiview Hypergraph Representation Learning for Software Defect Prediction

Shaojian Qiu, Mengyang Huang, Yun Liang, Chaoda Peng, Yuan Yuan

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Software defect prediction technology aids the reliability assurance team in identifying defect-prone code and assists the team in reasonably allocating limited testing resources. Recently, researchers assumed that the topological associations among code fragments could be harnessed to construct defect prediction models. Nevertheless, existing graph-based methods only concentrate on features of single-view association, which fail to fully capture the rich information hidden in the code. In addition, software defects may involve multiple code fragments simultaneously, but traditional binary graph structures are insufficient for representing these multivariate associations. To address these two challenges, this article proposes a multiview hypergraph representation learning approach (MVHR-DP) to amplify the potency of code features in defect prediction. MVHR-DP initiates by creating hypergraph structures for each code view, which are then amalgamated into a comprehensive fusion hypergraph. Following this, a hypergraph neural network is established to extract code features from multiple views and intricate associations, thereby enhancing the comprehensiveness of representation in the modeling data. Empirical study shows that the prediction model utilizing features generated by MVHR-DP exhibits superior area under the curve (AUC), F-measure, and matthews correlation coefficient (MCC) results compared to baseline approaches across within-project, cross-version, and cross-project prediction tasks.

Original languageEnglish
Pages (from-to)1863-1876
Number of pages14
JournalIEEE Transactions on Reliability
Volume73
Issue number4
DOIs
StatePublished - 2024
Externally publishedYes

Keywords

  • Code multiview fusion
  • code representation learning
  • hypergraph construction
  • software defect prediction
  • software reliability

Fingerprint

Dive into the research topics of 'Code Multiview Hypergraph Representation Learning for Software Defect Prediction'. Together they form a unique fingerprint.

Cite this