Multimodal feature fusion for 3D shape recognition and retrieval

Shuhui Bu; Shaoguang Cheng; Zhenbao Liu; Junwei Han

doi:10.1109/MMUL.2014.52

Multimodal feature fusion for 3D shape recognition and retrieval

Shuhui Bu, Shaoguang Cheng, Zhenbao Liu, Junwei Han

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

16 引用（Scopus）

摘要

Three-dimensional shapes contain different kinds of information that jointly characterize the shape. Traditional methods, however, perform recognition or retrieval using only one type. This article presents a 3D feature learning framework that combines different modality data effectively to promote the discriminability of unimodal features. Two independent deep belief networks (DBNs) are employed to learn high-level features from low-level features, and a restricted Boltzmann machine (RBM) is trained for mining the deep correlations between the different modalities. Experiments demonstrate that the proposed method can achieve better performance.

源语言	英语
文章编号	52
页（从-至）	38-46
页数	9
期刊	IEEE Multimedia
卷	21
期	4
DOI	https://doi.org/10.1109/MMUL.2014.52
出版状态	已出版 - 1 10月 2014

访问文件

10.1109/MMUL.2014.52

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{735940c0b358438b9db89ae2b813b919,

title = "Multimodal feature fusion for 3D shape recognition and retrieval",

abstract = "Three-dimensional shapes contain different kinds of information that jointly characterize the shape. Traditional methods, however, perform recognition or retrieval using only one type. This article presents a 3D feature learning framework that combines different modality data effectively to promote the discriminability of unimodal features. Two independent deep belief networks (DBNs) are employed to learn high-level features from low-level features, and a restricted Boltzmann machine (RBM) is trained for mining the deep correlations between the different modalities. Experiments demonstrate that the proposed method can achieve better performance.",

keywords = "Accuracy, Deep learning, Feature extraction, Fusion, Learning systems, Multimedia, Multimodal feature fusion, Research and development, Shape analysis, Shape recognition, Shape retrieval, Solid modeling, Three-dimensional displays",

author = "Shuhui Bu and Shaoguang Cheng and Zhenbao Liu and Junwei Han",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.",

year = "2014",

month = oct,

day = "1",

doi = "10.1109/MMUL.2014.52",

language = "英语",

volume = "21",

pages = "38--46",

journal = "IEEE Multimedia",

issn = "1070-986X",

publisher = "IEEE Computer Society",

number = "4",

}

TY - JOUR

T1 - Multimodal feature fusion for 3D shape recognition and retrieval

AU - Bu, Shuhui

AU - Cheng, Shaoguang

AU - Liu, Zhenbao

AU - Han, Junwei

PY - 2014/10/1

Y1 - 2014/10/1

N2 - Three-dimensional shapes contain different kinds of information that jointly characterize the shape. Traditional methods, however, perform recognition or retrieval using only one type. This article presents a 3D feature learning framework that combines different modality data effectively to promote the discriminability of unimodal features. Two independent deep belief networks (DBNs) are employed to learn high-level features from low-level features, and a restricted Boltzmann machine (RBM) is trained for mining the deep correlations between the different modalities. Experiments demonstrate that the proposed method can achieve better performance.

AB - Three-dimensional shapes contain different kinds of information that jointly characterize the shape. Traditional methods, however, perform recognition or retrieval using only one type. This article presents a 3D feature learning framework that combines different modality data effectively to promote the discriminability of unimodal features. Two independent deep belief networks (DBNs) are employed to learn high-level features from low-level features, and a restricted Boltzmann machine (RBM) is trained for mining the deep correlations between the different modalities. Experiments demonstrate that the proposed method can achieve better performance.

KW - Accuracy

KW - Deep learning

KW - Feature extraction

KW - Fusion

KW - Learning systems

KW - Multimedia

KW - Multimodal feature fusion

KW - Research and development

KW - Shape analysis

KW - Shape recognition

KW - Shape retrieval

KW - Solid modeling

KW - Three-dimensional displays

UR - http://www.scopus.com/inward/record.url?scp=84910093108&partnerID=8YFLogxK

U2 - 10.1109/MMUL.2014.52

DO - 10.1109/MMUL.2014.52

M3 - 文章

AN - SCOPUS:84910093108

SN - 1070-986X

VL - 21

SP - 38

EP - 46

JO - IEEE Multimedia

JF - IEEE Multimedia

IS - 4

M1 - 52

ER -

Multimodal feature fusion for 3D shape recognition and retrieval

摘要

访问文件

其它文件与链接

指纹

引用此