Deep Non-Rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling

Hui Deng; Jiawei Shi; Zhen Qin; Yiran Zhong; Yuchao Dai

doi:10.1609/aaai.v39i3.32272

Deep Non-Rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling

Hui Deng, Jiawei Shi, Zhen Qin, Yiran Zhong, Yuchao Dai

电子信息学院

科研成果: 期刊稿件 › 会议文章 › 同行评审

摘要

Non-Rigid Structure-from-Motion (NRSfM) is a classic 3D vision problem, where a 2D sequence is taken as input to estimate the corresponding 3D sequence. Recently, deep neural networks have greatly advanced the task of NRSfM. However, existing deep NRSfM methods still have limitations in handling the inherent sequence property and motion ambiguity associated with the NRSfM problem. In this paper, we revisit deep NRSfM from two perspectives to address the limitations of current deep NRSfM methods: (1) canonicalization and (2) sequence modeling. We propose an easy-to-implement per-sequence canonicalization method as opposed to the previous per-dataset canonicalization approaches. With this in mind, we propose a sequence modeling method that combines temporal information and subspace constraints. As a result, we have achieved a more optimal NRSfM reconstruction pipeline compared to previous efforts. The effectiveness of our method is verified by testing the sequence-to-sequence deep NRSfM pipeline with corresponding regularization modules on several commonly used datasets.

源语言	英语
页（从-至）	2681-2689
页数	9
期刊	Proceedings of the AAAI Conference on Artificial Intelligence
卷	39
期	3
DOI	https://doi.org/10.1609/aaai.v39i3.32272
出版状态	已出版 - 11 4月 2025
活动	39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025 - Philadelphia, 美国期限: 25 2月 2025 → 4 3月 2025

访问文件

10.1609/aaai.v39i3.32272

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{19e661e2f509491fbc56ddcce7b8f3f6,

title = "Deep Non-Rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling",

abstract = "Non-Rigid Structure-from-Motion (NRSfM) is a classic 3D vision problem, where a 2D sequence is taken as input to estimate the corresponding 3D sequence. Recently, deep neural networks have greatly advanced the task of NRSfM. However, existing deep NRSfM methods still have limitations in handling the inherent sequence property and motion ambiguity associated with the NRSfM problem. In this paper, we revisit deep NRSfM from two perspectives to address the limitations of current deep NRSfM methods: (1) canonicalization and (2) sequence modeling. We propose an easy-to-implement per-sequence canonicalization method as opposed to the previous per-dataset canonicalization approaches. With this in mind, we propose a sequence modeling method that combines temporal information and subspace constraints. As a result, we have achieved a more optimal NRSfM reconstruction pipeline compared to previous efforts. The effectiveness of our method is verified by testing the sequence-to-sequence deep NRSfM pipeline with corresponding regularization modules on several commonly used datasets.",

author = "Hui Deng and Jiawei Shi and Zhen Qin and Yiran Zhong and Yuchao Dai",

note = "Publisher Copyright: {\textcopyright} 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.; 39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025 ; Conference date: 25-02-2025 Through 04-03-2025",

year = "2025",

month = apr,

day = "11",

doi = "10.1609/aaai.v39i3.32272",

language = "英语",

volume = "39",

pages = "2681--2689",

journal = "Proceedings of the AAAI Conference on Artificial Intelligence",

issn = "2159-5399",

publisher = "Association for the Advancement of Artificial Intelligence",

number = "3",

}

TY - JOUR

T1 - Deep Non-Rigid Structure-from-Motion Revisited

T2 - 39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025

AU - Deng, Hui

AU - Shi, Jiawei

AU - Qin, Zhen

AU - Zhong, Yiran

AU - Dai, Yuchao

PY - 2025/4/11

Y1 - 2025/4/11

N2 - Non-Rigid Structure-from-Motion (NRSfM) is a classic 3D vision problem, where a 2D sequence is taken as input to estimate the corresponding 3D sequence. Recently, deep neural networks have greatly advanced the task of NRSfM. However, existing deep NRSfM methods still have limitations in handling the inherent sequence property and motion ambiguity associated with the NRSfM problem. In this paper, we revisit deep NRSfM from two perspectives to address the limitations of current deep NRSfM methods: (1) canonicalization and (2) sequence modeling. We propose an easy-to-implement per-sequence canonicalization method as opposed to the previous per-dataset canonicalization approaches. With this in mind, we propose a sequence modeling method that combines temporal information and subspace constraints. As a result, we have achieved a more optimal NRSfM reconstruction pipeline compared to previous efforts. The effectiveness of our method is verified by testing the sequence-to-sequence deep NRSfM pipeline with corresponding regularization modules on several commonly used datasets.

AB - Non-Rigid Structure-from-Motion (NRSfM) is a classic 3D vision problem, where a 2D sequence is taken as input to estimate the corresponding 3D sequence. Recently, deep neural networks have greatly advanced the task of NRSfM. However, existing deep NRSfM methods still have limitations in handling the inherent sequence property and motion ambiguity associated with the NRSfM problem. In this paper, we revisit deep NRSfM from two perspectives to address the limitations of current deep NRSfM methods: (1) canonicalization and (2) sequence modeling. We propose an easy-to-implement per-sequence canonicalization method as opposed to the previous per-dataset canonicalization approaches. With this in mind, we propose a sequence modeling method that combines temporal information and subspace constraints. As a result, we have achieved a more optimal NRSfM reconstruction pipeline compared to previous efforts. The effectiveness of our method is verified by testing the sequence-to-sequence deep NRSfM pipeline with corresponding regularization modules on several commonly used datasets.

UR - http://www.scopus.com/inward/record.url?scp=105003998762&partnerID=8YFLogxK

U2 - 10.1609/aaai.v39i3.32272

DO - 10.1609/aaai.v39i3.32272

M3 - 会议文章

AN - SCOPUS:105003998762

SN - 2159-5399

VL - 39

SP - 2681

EP - 2689

JO - Proceedings of the AAAI Conference on Artificial Intelligence

JF - Proceedings of the AAAI Conference on Artificial Intelligence

IS - 3

Y2 - 25 February 2025 through 4 March 2025

ER -

Deep Non-Rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling

摘要

访问文件

其它文件与链接

指纹

引用此