A block-based blind source separation approach with equilateral triangular microphone array

Jian Zhang, Zhonghua Fu, Lei Xie

Research output: Contribution to conferencePaperpeer-review

Abstract

In this paper we describe a method for multiple speech sources separation using an equilateral triangular microphone array. Firstly, the azimuths of horizontal plane are divided into many units and the spatial features of some directions observed by the microphone array are modeled precisely. Secondly, the input mixing signals are segmented into blocks, and then the number of active speakers and their directions are estimated in each block. Thirdly, the pre-trained model with the nearest azimuth to each speaker is adapted to obtain a precise model, which is then used for time-frequency binary mask estimation. Finally, we separate every source appeared in each block and concatenate those sounds from same unit to reproduce the whole stream. The experiments are set up in a real meeting room. The results show that our method can separate multiple speech sources correctly with low distortion, and are competitive with the total un-blind separation results.

Original languageEnglish
Pages1126-1130
Number of pages5
StatePublished - 2011
EventAsia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, APSIPA ASC 2011 - Xi'an, China
Duration: 18 Oct 201121 Oct 2011

Conference

ConferenceAsia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, APSIPA ASC 2011
Country/TerritoryChina
CityXi'an
Period18/10/1121/10/11

Keywords

  • Blind source separation
  • Directions of arrival estimation
  • Equilateral triangular microphone array
  • Time-frequency mask

Fingerprint

Dive into the research topics of 'A block-based blind source separation approach with equilateral triangular microphone array'. Together they form a unique fingerprint.

Cite this