Abstract
Exploring the association between high-level semantic brain responses and multimedia features is crucial for understanding the human semantic processing mechanism. However, a significant 'semantic gap' persists between abstract brain representations captured by functional Magnetic Resonance Imaging (fMRI) and concrete multimedia features, remaining both unclear and challenging to quantify. To address this, we introduce DyBrainFormer, a novel Transformer-based Brain Dynamics Decoder for Brain-Multimedia Association. Inspired by the topological structure and dynamic properties of the human brain, DyBrainFormer uniquely integrates Graph Convolutional Networks (GCNs) and Hierarchical Temporal Transformer (HTT). It first encodes each sequenced dynamic brain graph using GCNs to capture spatial dependencies and derive brain temporal node attention. Subsequently, these temporal graph representations are fed into the HTT, which excels at modeling complex dynamic changes and long-range temporal dependencies within brain networks. The learned temporal weights from HTT serve as interpretable semantic descriptors, forming a quantifiable bridge that links high-level brain semantics to dynamic multimedia features. Evaluated on the Healthy Brain Network naturalistic fMRI dataset, DyBrainFormer effectively learns distinguishable brain dynamics, achieving ∼ 83% classification accuracy in differentiating between children and adolescents. Our analysis further identifies distinct age-related patterns in semantic processing, demonstrating that children emphasize perceptual features while adolescents focus on higher-level conceptual elements. This work provides important references for bridging the semantic gap by establishing a robust and interpretable link between high-level semantic features and multimedia features, offering a novel perspective to uncover the human semantic understanding mechanism.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2025 |
| Editors | Juan Liu, Jingshan Huang, Xiaowo Wang, Fa Zhang, Xiufen Zou, Tian Tian, Xiaohua Hu, Bin Hu, Yi Xiong |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 6449-6455 |
| Number of pages | 7 |
| ISBN (Electronic) | 9798331515577 |
| DOIs | |
| State | Published - 2025 |
| Event | 2025 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2025 - Wuhan, China Duration: 15 Dec 2025 → 18 Dec 2025 |
Publication series
| Name | Proceedings - 2025 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2025 |
|---|
Conference
| Conference | 2025 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2025 |
|---|---|
| Country/Territory | China |
| City | Wuhan |
| Period | 15/12/25 → 18/12/25 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- brain dynamics
- hierarchical transformer
- naturalistic fMRI
- semantic gap
Fingerprint
Dive into the research topics of 'Dybrainformer: Decoding Dynamic Brain Semantics with Hierarchical Transformer for Brainmultimedia Association'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver