摘要
Three-dimensional shapes contain different kinds of information that jointly characterize the shape. Traditional methods, however, perform recognition or retrieval using only one type. This article presents a 3D feature learning framework that combines different modality data effectively to promote the discriminability of unimodal features. Two independent deep belief networks (DBNs) are employed to learn high-level features from low-level features, and a restricted Boltzmann machine (RBM) is trained for mining the deep correlations between the different modalities. Experiments demonstrate that the proposed method can achieve better performance.
源语言 | 英语 |
---|---|
文章编号 | 52 |
页(从-至) | 38-46 |
页数 | 9 |
期刊 | IEEE Multimedia |
卷 | 21 |
期 | 4 |
DOI | |
出版状态 | 已出版 - 1 10月 2014 |