Abstract
Recent breakthroughs in biologic sequencing technologies have cost-effectively yielded diverse types of observations. Integrative analysis of multiple platform cancer data, which is capable of revealing intrinsic characteristics of a biological process, has become an attractive research route on cancer subtypes discovery. Most machine learning based methods need represent each input data in unified space, losing certain important features or resulting in various noises in some data types. Furthermore, many network based data integration methods treat each type data independently, leading to a lot of inconsistent conclusions. Subsequently, similarity network fusion (SNF) was developed to deal with such questions. However, Euclidean distance metrics employed in SNF suffers curse of dimensionality and thus gives rise to poor results. To this end, we propose a new integrated method, dubbed hierarchical similarity network (HSNF), to learn a fused discriminating patient similarity network. HSNF randomly samples sub-features from different input data to construct multiple input similarity matrixes used as a basic of fusion so that diverse similarity matrixes are generated by multiple random sampling. Then we design a hierarchical fusion framework to make full use of the complementariness of diverse similarity networks from different feature modalities. Finally, based on the final fused similarity matrix, spectral clustering was used to discover cancer subtypes. Experimental results on five public cancer datasets manifest that HSNF can discover significantly different subtypes and can consistently outperform the-state-of-the-art in terms of silhouette, and p-value of survival analysis.
| Original language | English |
|---|---|
| Title of host publication | Bioinformatics Research and Applications - 14th International Symposium, ISBRA 2018, Proceedings |
| Editors | Fa Zhang, Shihua Zhang, Zhipeng Cai, Pavel Skums |
| Publisher | Springer Verlag |
| Pages | 125-136 |
| Number of pages | 12 |
| ISBN (Print) | 9783319949673 |
| DOIs | |
| State | Published - 2018 |
| Event | 14th International Symposium on Bioinformatics Research and Applications, ISBRA 2018 - Beijing, China Duration: 8 Jun 2018 → 11 Jun 2018 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 10847 LNBI |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 14th International Symposium on Bioinformatics Research and Applications, ISBRA 2018 |
|---|---|
| Country/Territory | China |
| City | Beijing |
| Period | 8/06/18 → 11/06/18 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Cancer subtypes discovery
- Data integration
- Hierarchical similarity network fusion
- Multi-platform cancer data
Fingerprint
Dive into the research topics of 'Hierarchical similarity network fusion for discovering cancer subtypes'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver