Skip to main navigation Skip to search Skip to main content

Hierarchical similarity network fusion for discovering cancer subtypes

  • Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

Recent breakthroughs in biologic sequencing technologies have cost-effectively yielded diverse types of observations. Integrative analysis of multiple platform cancer data, which is capable of revealing intrinsic characteristics of a biological process, has become an attractive research route on cancer subtypes discovery. Most machine learning based methods need represent each input data in unified space, losing certain important features or resulting in various noises in some data types. Furthermore, many network based data integration methods treat each type data independently, leading to a lot of inconsistent conclusions. Subsequently, similarity network fusion (SNF) was developed to deal with such questions. However, Euclidean distance metrics employed in SNF suffers curse of dimensionality and thus gives rise to poor results. To this end, we propose a new integrated method, dubbed hierarchical similarity network (HSNF), to learn a fused discriminating patient similarity network. HSNF randomly samples sub-features from different input data to construct multiple input similarity matrixes used as a basic of fusion so that diverse similarity matrixes are generated by multiple random sampling. Then we design a hierarchical fusion framework to make full use of the complementariness of diverse similarity networks from different feature modalities. Finally, based on the final fused similarity matrix, spectral clustering was used to discover cancer subtypes. Experimental results on five public cancer datasets manifest that HSNF can discover significantly different subtypes and can consistently outperform the-state-of-the-art in terms of silhouette, and p-value of survival analysis.

Original languageEnglish
Title of host publicationBioinformatics Research and Applications - 14th International Symposium, ISBRA 2018, Proceedings
EditorsFa Zhang, Shihua Zhang, Zhipeng Cai, Pavel Skums
PublisherSpringer Verlag
Pages125-136
Number of pages12
ISBN (Print)9783319949673
DOIs
StatePublished - 2018
Event14th International Symposium on Bioinformatics Research and Applications, ISBRA 2018 - Beijing, China
Duration: 8 Jun 201811 Jun 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10847 LNBI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Symposium on Bioinformatics Research and Applications, ISBRA 2018
Country/TerritoryChina
CityBeijing
Period8/06/1811/06/18

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Cancer subtypes discovery
  • Data integration
  • Hierarchical similarity network fusion
  • Multi-platform cancer data

Fingerprint

Dive into the research topics of 'Hierarchical similarity network fusion for discovering cancer subtypes'. Together they form a unique fingerprint.

Cite this