Identifying Children With Autism Spectrum Disorder via Transformer-Based Representation Learning From Dynamic Facial Cues

Chen Xia, Hexu Chen, Junwei Han, Dingwen Zhang, Kuan Li

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

Recognizing autism spectrum disorder (ASD) has faced great challenges due to insufficient professional clinicians and complex procedures. Automated data-driven ASD recognition models can reduce the subjectivity and physician dependency of traditional evaluation methods. Facial data, which can encode important perceptual and social behaviors, have emerged in ASD research to explore novel biomarkers for screening, diagnosing, and treating ASD. However, existing research mainly focuses on extracting low-level hand-crafted facial features for analysis and classification. Determining how to learn discriminative deep representations from dynamic facial data for computational model construction remains an unresolved challenge. In this study, we propose an ASD recognition model based on facial videos to fill the lack of temporal correlation learning of facial features. First, we utilize a vision transformer to extract frame-based global facial features. Then, we use a Longformer to establish the correlation of facial features over time. In the experiment, we recruited 146 subjects between 2 and 8 years of age to record their facial videos under a computer-based eye-tracking experiment and 76 subjects to conduct a smartphone-based experiment. Quantitative comparisons have shown the effectiveness and reliability of the proposed model. Furthermore, we have confirmed the correlation between facial and eye-tracking modalities in visual attention.

源语言英语
页(从-至)83-97
页数15
期刊IEEE Transactions on Affective Computing
16
1
DOI
出版状态已出版 - 2025

指纹

探究 'Identifying Children With Autism Spectrum Disorder via Transformer-Based Representation Learning From Dynamic Facial Cues' 的科研主题。它们共同构成独一无二的指纹。

引用此