TY - JOUR
T1 - Advances in intelligent animal pose tracking for neuro-behavioral integration
AU - Zhong, Yuzhe
AU - Wang, Lanjing
AU - Wang, Xiao Yue
AU - Sun, Chong
AU - Chen, Jun
AU - Yang, Haitao
N1 - Publisher Copyright:
© 2026 Elsevier B.V.
PY - 2026/6
Y1 - 2026/6
N2 - Animal behavior serves as an external manifestation of neural function and a key window into brain mechanisms and disease states. Traditional behavioral studies relying on manual observation and empirical classification suffer from subjective bias and limited spatiotemporal resolution, hindering their ability to quantitatively link behavior with neural activity. With rapid advances in computer vision and deep learning, pose tracking has evolved from Two-Dimensional (2D) estimation to Three-Dimensional (3D) reconstruction and further toward multimodal alignment. This review summarizes the evolutionary trajectory of machine learning-based animal pose tracking methods, ranging from early unsupervised and supervised paradigms to deep neural frameworks achieving high-precision 2D and 3D tracking, and further to the integration of frontier technologies—including Foundation Models, Generative Models, and Novel Neural Networks—alongside multimodal alignment with neural activity. Deep learning enables high-fidelity behavioral phenotyping and pose-neural mapping, driving advancements in multi-animal tracking and neuro-medical research. Despite remarkable progress, challenges persist in cross-species generalization, inconsistent annotation standards, and incomplete modeling of behavior–neural causality. Future research will emphasize self-supervised and generative learning to reduce annotation dependency, and multimodal temporal–spatial integration to align neural states with behavioral dynamics. The convergence of computational ethology and neuro-medicine is transforming neuroscience from passive observation to mechanistic understanding, advancing research and intervention in neurological and psychiatric disorders.
AB - Animal behavior serves as an external manifestation of neural function and a key window into brain mechanisms and disease states. Traditional behavioral studies relying on manual observation and empirical classification suffer from subjective bias and limited spatiotemporal resolution, hindering their ability to quantitatively link behavior with neural activity. With rapid advances in computer vision and deep learning, pose tracking has evolved from Two-Dimensional (2D) estimation to Three-Dimensional (3D) reconstruction and further toward multimodal alignment. This review summarizes the evolutionary trajectory of machine learning-based animal pose tracking methods, ranging from early unsupervised and supervised paradigms to deep neural frameworks achieving high-precision 2D and 3D tracking, and further to the integration of frontier technologies—including Foundation Models, Generative Models, and Novel Neural Networks—alongside multimodal alignment with neural activity. Deep learning enables high-fidelity behavioral phenotyping and pose-neural mapping, driving advancements in multi-animal tracking and neuro-medical research. Despite remarkable progress, challenges persist in cross-species generalization, inconsistent annotation standards, and incomplete modeling of behavior–neural causality. Future research will emphasize self-supervised and generative learning to reduce annotation dependency, and multimodal temporal–spatial integration to align neural states with behavioral dynamics. The convergence of computational ethology and neuro-medicine is transforming neuroscience from passive observation to mechanistic understanding, advancing research and intervention in neurological and psychiatric disorders.
KW - Animal pose tracking
KW - Computational ethology
KW - Deep learning
KW - Multimodal alignment
KW - Neurobehavioral analysis
UR - https://www.scopus.com/pages/publications/105034717062
U2 - 10.1016/j.neucom.2026.133305
DO - 10.1016/j.neucom.2026.133305
M3 - 文献综述
AN - SCOPUS:105034717062
SN - 0925-2312
VL - 680
JO - Neurocomputing
JF - Neurocomputing
M1 - 133305
ER -