Quality assessment for networked speech based on active packet detection

Wei Li, Fuzheng Yang, Shuai Wan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper the assessment of perceived speech quality under packet loss is investigated. Since packet loss would impair the perceived speech quality differently when lost packets are located in different segments in the speech signal, e.g. in speech or silence segments, a new detection method is first proposed to accurately determine the active packets which are related to speech signals only. Based on the detection of active packets, the packet loss rate within speech segments can be determined, which is further incorporated in the E-model for speech quality assessment. The efficiency and accuracy of the proposed detection method and the method for speech quality assessment have been verified by experimental results. Performance evaluation shows that the proposed method based on the active packet detection outperforms the original the E-model by over 8% in accuracy of prediction of the subjective speech quality.

Original languageEnglish
Title of host publicationCCTAE 2010 - 2010 International Conference on Computer and Communication Technologies in Agriculture Engineering
Pages561-564
Number of pages4
DOIs
StatePublished - 2010
Event2010 International Conference on Computer and Communication Technologies in Agriculture Engineering, CCTAE 2010 - Chengdu, China
Duration: 12 Jun 201013 Jun 2010

Publication series

NameCCTAE 2010 - 2010 International Conference on Computer and Communication Technologies in Agriculture Engineering
Volume3

Conference

Conference2010 International Conference on Computer and Communication Technologies in Agriculture Engineering, CCTAE 2010
Country/TerritoryChina
CityChengdu
Period12/06/1013/06/10

Keywords

  • Active packet detection
  • E-model
  • Packet loss
  • Speech quality assessment

Fingerprint

Dive into the research topics of 'Quality assessment for networked speech based on active packet detection'. Together they form a unique fingerprint.

Cite this