Capsule Networks with Residual Pose Routing

Yi Liu; De Cheng; Dingwen Zhang; Shoukun Xu; Jungong Han

doi:10.1109/TNNLS.2023.3347722

Capsule Networks with Residual Pose Routing

Yi Liu, De Cheng, Dingwen Zhang, Shoukun Xu, Jungong Han

School of Automation

Research output: Contribution to journal › Article › peer-review

24 Scopus citations

Abstract

Capsule networks (CapsNets) have been known difficult to develop a deeper architecture, which is desirable for high performance in the deep learning era, due to the complex capsule routing algorithms. In this article, we present a simple yet effective capsule routing algorithm, which is presented by a residual pose routing. Specifically, the higher-layer capsule pose is achieved by an identity mapping on the adjacently lower-layer capsule pose. Such simple residual pose routing has two advantages: 1) reducing the routing computation complexity and 2) avoiding gradient vanishing due to its residual learning framework. On top of that, we explicitly reformulate the capsule layers by building a residual pose block. Stacking multiple such blocks results in a deep residual CapsNets (ResCaps) with a ResNet-like architecture. Results on MNIST, AffNIST, SmallNORB, and CIFAR-10/100 show the effectiveness of ResCaps for image classification. Furthermore, we successfully extend our residual pose routing to large-scale real-world applications, including 3-D object reconstruction and classification, and 2-D saliency dense prediction. The source code has been released on https://github.com/liuyi1989/ResCaps.

Original language	English
Pages (from-to)	2648-2661
Number of pages	14
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	36
Issue number	2
DOIs	https://doi.org/10.1109/TNNLS.2023.3347722
State	Published - 2025

Keywords

3-D point cloud
capsule network (CapsNet)
part-whole
residual routing
salient object detection

Access to Document

10.1109/TNNLS.2023.3347722

Cite this

@article{dd162da715084476bd0c8146fafc450c,

title = "Capsule Networks with Residual Pose Routing",

abstract = "Capsule networks (CapsNets) have been known difficult to develop a deeper architecture, which is desirable for high performance in the deep learning era, due to the complex capsule routing algorithms. In this article, we present a simple yet effective capsule routing algorithm, which is presented by a residual pose routing. Specifically, the higher-layer capsule pose is achieved by an identity mapping on the adjacently lower-layer capsule pose. Such simple residual pose routing has two advantages: 1) reducing the routing computation complexity and 2) avoiding gradient vanishing due to its residual learning framework. On top of that, we explicitly reformulate the capsule layers by building a residual pose block. Stacking multiple such blocks results in a deep residual CapsNets (ResCaps) with a ResNet-like architecture. Results on MNIST, AffNIST, SmallNORB, and CIFAR-10/100 show the effectiveness of ResCaps for image classification. Furthermore, we successfully extend our residual pose routing to large-scale real-world applications, including 3-D object reconstruction and classification, and 2-D saliency dense prediction. The source code has been released on https://github.com/liuyi1989/ResCaps.",

keywords = "3-D point cloud, capsule network (CapsNet), part-whole, residual routing, salient object detection",

author = "Yi Liu and De Cheng and Dingwen Zhang and Shoukun Xu and Jungong Han",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.",

year = "2025",

doi = "10.1109/TNNLS.2023.3347722",

language = "英语",

volume = "36",

pages = "2648--2661",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "2",

}

TY - JOUR

T1 - Capsule Networks with Residual Pose Routing

AU - Liu, Yi

AU - Cheng, De

AU - Zhang, Dingwen

AU - Xu, Shoukun

AU - Han, Jungong

PY - 2025

Y1 - 2025

N2 - Capsule networks (CapsNets) have been known difficult to develop a deeper architecture, which is desirable for high performance in the deep learning era, due to the complex capsule routing algorithms. In this article, we present a simple yet effective capsule routing algorithm, which is presented by a residual pose routing. Specifically, the higher-layer capsule pose is achieved by an identity mapping on the adjacently lower-layer capsule pose. Such simple residual pose routing has two advantages: 1) reducing the routing computation complexity and 2) avoiding gradient vanishing due to its residual learning framework. On top of that, we explicitly reformulate the capsule layers by building a residual pose block. Stacking multiple such blocks results in a deep residual CapsNets (ResCaps) with a ResNet-like architecture. Results on MNIST, AffNIST, SmallNORB, and CIFAR-10/100 show the effectiveness of ResCaps for image classification. Furthermore, we successfully extend our residual pose routing to large-scale real-world applications, including 3-D object reconstruction and classification, and 2-D saliency dense prediction. The source code has been released on https://github.com/liuyi1989/ResCaps.

AB - Capsule networks (CapsNets) have been known difficult to develop a deeper architecture, which is desirable for high performance in the deep learning era, due to the complex capsule routing algorithms. In this article, we present a simple yet effective capsule routing algorithm, which is presented by a residual pose routing. Specifically, the higher-layer capsule pose is achieved by an identity mapping on the adjacently lower-layer capsule pose. Such simple residual pose routing has two advantages: 1) reducing the routing computation complexity and 2) avoiding gradient vanishing due to its residual learning framework. On top of that, we explicitly reformulate the capsule layers by building a residual pose block. Stacking multiple such blocks results in a deep residual CapsNets (ResCaps) with a ResNet-like architecture. Results on MNIST, AffNIST, SmallNORB, and CIFAR-10/100 show the effectiveness of ResCaps for image classification. Furthermore, we successfully extend our residual pose routing to large-scale real-world applications, including 3-D object reconstruction and classification, and 2-D saliency dense prediction. The source code has been released on https://github.com/liuyi1989/ResCaps.

KW - 3-D point cloud

KW - capsule network (CapsNet)

KW - part-whole

KW - residual routing

KW - salient object detection

UR - http://www.scopus.com/inward/record.url?scp=85182354096&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2023.3347722

DO - 10.1109/TNNLS.2023.3347722

M3 - 文章

AN - SCOPUS:85182354096

SN - 2162-237X

VL - 36

SP - 2648

EP - 2661

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 2

ER -

Capsule Networks with Residual Pose Routing

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this