Exploring auditory network composition during free listening to audio excerpts via group-wise sparse representation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

With the growing number of audio excerpts through various media and distribution channels, advanced audio analysis approaches have received significant interest in the multimedia field. However, current audio analysis approaches are still far from satisfactory due to the semantic gaps between the low-level acoustic features and high-level semantics perceived by human brain. In order to alleviate the problem, this paper propose a novel computational framework to bridge acoustic features with high-level semantic features derived from functional magnetic resonance imaging (fMRI) signals which record the brain's response during free listening to music/speech excerpts, and to explore the brain auditory network composition of acoustic features for different types of music/speech excerpts. Specifically, we identify meaningful brain networks and corresponding brain activities representing high-level semantic features via a novel group-wise sparse representation of whole brain fMRI signals. Then we associate the brain activities with specific low-level acoustic features and analyze the auditory network composition of acoustic features for different types of music/speech excerpts. Experimental results demonstrate that multiple acoustic features are involved in the brain auditory networks during free listening to music/speech excerpts. Meanwhile, there is considerable variability of auditory network composition of acoustic features for different types of music/speech. Our results provide new insights of how to narrow the semantic gaps in audio content analysis.

Original languageEnglish
Title of host publication2016 IEEE International Conference on Multimedia and Expo, ICME 2016
PublisherIEEE Computer Society
ISBN (Electronic)9781467372589
DOIs
StatePublished - 25 Aug 2016
Event2016 IEEE International Conference on Multimedia and Expo, ICME 2016 - Seattle, United States
Duration: 11 Jul 201615 Jul 2016

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
Volume2016-August
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2016 IEEE International Conference on Multimedia and Expo, ICME 2016
Country/TerritoryUnited States
CitySeattle
Period11/07/1615/07/16

Keywords

  • Functional magnetic resonance imaging
  • Semantic gap
  • highlevel features
  • low-level feature

Fingerprint

Dive into the research topics of 'Exploring auditory network composition during free listening to audio excerpts via group-wise sparse representation'. Together they form a unique fingerprint.

Cite this