Constrained Query of Order-Preserving Submatrix Based on Signature and Trie

Tao Jiang, Zhan Huai Li, Xue Qun Shang, Bo Lin Chen, Wei Bang Li, Zhi Lei Yin

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

The advances of microarray technology have made large amount of gene expression data available from a variety of different experimental conditions. Analyzing the microarray data plays a key role in understanding gene functions, gene regulation and cellular process. Order-Preserving Submatrix (OPSM) is an important model in microarray data analysis, which captures the identical tendency of gene expressions across a subset of conditions. In the process of analyzing mechanism of gene expression, OPSM search undoubtedly saves the time and effort of biologists. However, OPSM retrieval mainly depends on keyword search, resulting a weak control on the obtained clusters. Typically, the analyst can determine the ad-hoc parameters which are far from the declarative specification of desired properties on operation and concept. Motivated by obtaining much more accurate query relevancy, this paper proposes two types of OPSM indexing and constrained query methods based on signature and Trie. Extensive experiments conducted on real datasets demonstrate the proposed methods have better behaviors than the state-of-the-art methods on efficiency and effectiveness.

Original languageEnglish
Pages (from-to)2175-2195
Number of pages21
JournalRuan Jian Xue Bao/Journal of Software
Volume28
Issue number8
DOIs
StatePublished - 1 Aug 2017

Keywords

  • Constrained query
  • Enumerated sequence
  • Gene expression data
  • Order-preserving submatrix (OPSM)
  • Signature
  • Trie

Fingerprint

Dive into the research topics of 'Constrained Query of Order-Preserving Submatrix Based on Signature and Trie'. Together they form a unique fingerprint.

Cite this