Multimodal feature fusion for 3D shape recognition and retrieval

Shuhui Bu; Shaoguang Cheng; Zhenbao Liu; Junwei Han

doi:10.1109/MMUL.2014.52

Multimodal feature fusion for 3D shape recognition and retrieval

Shuhui Bu, Shaoguang Cheng, Zhenbao Liu, Junwei Han

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

16 Scopus citations

Abstract

Three-dimensional shapes contain different kinds of information that jointly characterize the shape. Traditional methods, however, perform recognition or retrieval using only one type. This article presents a 3D feature learning framework that combines different modality data effectively to promote the discriminability of unimodal features. Two independent deep belief networks (DBNs) are employed to learn high-level features from low-level features, and a restricted Boltzmann machine (RBM) is trained for mining the deep correlations between the different modalities. Experiments demonstrate that the proposed method can achieve better performance.

Original language	English
Article number	52
Pages (from-to)	38-46
Number of pages	9
Journal	IEEE Multimedia
Volume	21
Issue number	4
DOIs	https://doi.org/10.1109/MMUL.2014.52
State	Published - 1 Oct 2014

Keywords

Accuracy
Deep learning
Feature extraction
Fusion
Learning systems
Multimedia
Multimodal feature fusion
Research and development
Shape analysis
Shape recognition
Shape retrieval
Solid modeling
Three-dimensional displays

Access to Document

10.1109/MMUL.2014.52

Cite this

@article{735940c0b358438b9db89ae2b813b919,

title = "Multimodal feature fusion for 3D shape recognition and retrieval",

abstract = "Three-dimensional shapes contain different kinds of information that jointly characterize the shape. Traditional methods, however, perform recognition or retrieval using only one type. This article presents a 3D feature learning framework that combines different modality data effectively to promote the discriminability of unimodal features. Two independent deep belief networks (DBNs) are employed to learn high-level features from low-level features, and a restricted Boltzmann machine (RBM) is trained for mining the deep correlations between the different modalities. Experiments demonstrate that the proposed method can achieve better performance.",

keywords = "Accuracy, Deep learning, Feature extraction, Fusion, Learning systems, Multimedia, Multimodal feature fusion, Research and development, Shape analysis, Shape recognition, Shape retrieval, Solid modeling, Three-dimensional displays",

author = "Shuhui Bu and Shaoguang Cheng and Zhenbao Liu and Junwei Han",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.",

year = "2014",

month = oct,

day = "1",

doi = "10.1109/MMUL.2014.52",

language = "英语",

volume = "21",

pages = "38--46",

journal = "IEEE Multimedia",

issn = "1070-986X",

publisher = "IEEE Computer Society",

number = "4",

}

TY - JOUR

T1 - Multimodal feature fusion for 3D shape recognition and retrieval

AU - Bu, Shuhui

AU - Cheng, Shaoguang

AU - Liu, Zhenbao

AU - Han, Junwei

PY - 2014/10/1

Y1 - 2014/10/1

N2 - Three-dimensional shapes contain different kinds of information that jointly characterize the shape. Traditional methods, however, perform recognition or retrieval using only one type. This article presents a 3D feature learning framework that combines different modality data effectively to promote the discriminability of unimodal features. Two independent deep belief networks (DBNs) are employed to learn high-level features from low-level features, and a restricted Boltzmann machine (RBM) is trained for mining the deep correlations between the different modalities. Experiments demonstrate that the proposed method can achieve better performance.

AB - Three-dimensional shapes contain different kinds of information that jointly characterize the shape. Traditional methods, however, perform recognition or retrieval using only one type. This article presents a 3D feature learning framework that combines different modality data effectively to promote the discriminability of unimodal features. Two independent deep belief networks (DBNs) are employed to learn high-level features from low-level features, and a restricted Boltzmann machine (RBM) is trained for mining the deep correlations between the different modalities. Experiments demonstrate that the proposed method can achieve better performance.

KW - Accuracy

KW - Deep learning

KW - Feature extraction

KW - Fusion

KW - Learning systems

KW - Multimedia

KW - Multimodal feature fusion

KW - Research and development

KW - Shape analysis

KW - Shape recognition

KW - Shape retrieval

KW - Solid modeling

KW - Three-dimensional displays

UR - http://www.scopus.com/inward/record.url?scp=84910093108&partnerID=8YFLogxK

U2 - 10.1109/MMUL.2014.52

DO - 10.1109/MMUL.2014.52

M3 - 文章

AN - SCOPUS:84910093108

SN - 1070-986X

VL - 21

SP - 38

EP - 46

JO - IEEE Multimedia

JF - IEEE Multimedia

IS - 4

M1 - 52

ER -

Multimodal feature fusion for 3D shape recognition and retrieval

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this