Study of the noise-reduction problem in the Karhunen-Loàve expansion domain

Jingdong Chen; Jacob Benesty; Yiteng Arden Huang

doi:10.1109/TASL.2009.2014793

Study of the noise-reduction problem in the Karhunen-Loàve expansion domain

Jingdong Chen, Jacob Benesty, Yiteng Arden Huang

Research output: Contribution to journal › Article › peer-review

27 Scopus citations

Abstract

Noise reduction, which aims at estimating a clean speech from a noisy observation, has long been an active research area. The standard approach to this problem is to obtain the clean speech estimate by linearly filtering the noisy signal. The core issue, then, becomes how to design an optimal linear filter that can significantly suppress noise without introducing perceptually noticeable speech distortion. Traditionally, the optimal noise-reduction filters are formulated in either the time or the frequency domains. This paper studies the problem in the Karhunen-Loàve expansion domain. We develop two classes of optimal filters. The first class achieves a frame of speech estimate by filtering the corresponding frame of the noisy speech. We will show that many existing methods such as the widely used Wiener filter and subspace technique are closely related to this category. The second class obtains noise reduction by filtering not only the current frame, but also a number of previous consecutive frames of the noisy speech. We will discuss how to design the optimal noise-reduction filters in each class and demonstrate, through both theoretical analysis and experiments, the properties of the deduced optimal filters.

Original language	English
Article number	4806284
Pages (from-to)	787-802
Number of pages	16
Journal	IEEE Transactions on Audio, Speech and Language Processing
Volume	17
Issue number	4
DOIs	https://doi.org/10.1109/TASL.2009.2014793
State	Published - May 2009
Externally published	Yes

Keywords

Karhunen-Loàve expansion (KLE)
Maximum signal-to-noise ratio (SNR) filter
Noise reduction
Pearson correlation coefficient
Speech enhancement
Subspace approach
Wiener filter

Access to Document

10.1109/TASL.2009.2014793

Cite this

@article{e60b83fce5ff43aaa152e361149ddb43,

title = "Study of the noise-reduction problem in the Karhunen-Lo{\`a}ve expansion domain",

abstract = "Noise reduction, which aims at estimating a clean speech from a noisy observation, has long been an active research area. The standard approach to this problem is to obtain the clean speech estimate by linearly filtering the noisy signal. The core issue, then, becomes how to design an optimal linear filter that can significantly suppress noise without introducing perceptually noticeable speech distortion. Traditionally, the optimal noise-reduction filters are formulated in either the time or the frequency domains. This paper studies the problem in the Karhunen-Lo{\`a}ve expansion domain. We develop two classes of optimal filters. The first class achieves a frame of speech estimate by filtering the corresponding frame of the noisy speech. We will show that many existing methods such as the widely used Wiener filter and subspace technique are closely related to this category. The second class obtains noise reduction by filtering not only the current frame, but also a number of previous consecutive frames of the noisy speech. We will discuss how to design the optimal noise-reduction filters in each class and demonstrate, through both theoretical analysis and experiments, the properties of the deduced optimal filters.",

keywords = "Karhunen-Lo{\`a}ve expansion (KLE), Maximum signal-to-noise ratio (SNR) filter, Noise reduction, Pearson correlation coefficient, Speech enhancement, Subspace approach, Wiener filter",

author = "Jingdong Chen and Jacob Benesty and Huang, {Yiteng Arden}",

year = "2009",

month = may,

doi = "10.1109/TASL.2009.2014793",

language = "英语",

volume = "17",

pages = "787--802",

journal = "IEEE Transactions on Audio, Speech and Language Processing",

issn = "1558-7916",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "4",

}

TY - JOUR

T1 - Study of the noise-reduction problem in the Karhunen-Loàve expansion domain

AU - Chen, Jingdong

AU - Benesty, Jacob

AU - Huang, Yiteng Arden

PY - 2009/5

Y1 - 2009/5

N2 - Noise reduction, which aims at estimating a clean speech from a noisy observation, has long been an active research area. The standard approach to this problem is to obtain the clean speech estimate by linearly filtering the noisy signal. The core issue, then, becomes how to design an optimal linear filter that can significantly suppress noise without introducing perceptually noticeable speech distortion. Traditionally, the optimal noise-reduction filters are formulated in either the time or the frequency domains. This paper studies the problem in the Karhunen-Loàve expansion domain. We develop two classes of optimal filters. The first class achieves a frame of speech estimate by filtering the corresponding frame of the noisy speech. We will show that many existing methods such as the widely used Wiener filter and subspace technique are closely related to this category. The second class obtains noise reduction by filtering not only the current frame, but also a number of previous consecutive frames of the noisy speech. We will discuss how to design the optimal noise-reduction filters in each class and demonstrate, through both theoretical analysis and experiments, the properties of the deduced optimal filters.

AB - Noise reduction, which aims at estimating a clean speech from a noisy observation, has long been an active research area. The standard approach to this problem is to obtain the clean speech estimate by linearly filtering the noisy signal. The core issue, then, becomes how to design an optimal linear filter that can significantly suppress noise without introducing perceptually noticeable speech distortion. Traditionally, the optimal noise-reduction filters are formulated in either the time or the frequency domains. This paper studies the problem in the Karhunen-Loàve expansion domain. We develop two classes of optimal filters. The first class achieves a frame of speech estimate by filtering the corresponding frame of the noisy speech. We will show that many existing methods such as the widely used Wiener filter and subspace technique are closely related to this category. The second class obtains noise reduction by filtering not only the current frame, but also a number of previous consecutive frames of the noisy speech. We will discuss how to design the optimal noise-reduction filters in each class and demonstrate, through both theoretical analysis and experiments, the properties of the deduced optimal filters.

KW - Karhunen-Loàve expansion (KLE)

KW - Maximum signal-to-noise ratio (SNR) filter

KW - Noise reduction

KW - Pearson correlation coefficient

KW - Speech enhancement

KW - Subspace approach

KW - Wiener filter

UR - http://www.scopus.com/inward/record.url?scp=65249162371&partnerID=8YFLogxK

U2 - 10.1109/TASL.2009.2014793

DO - 10.1109/TASL.2009.2014793

M3 - 文章

AN - SCOPUS:65249162371

SN - 1558-7916

VL - 17

SP - 787

EP - 802

JO - IEEE Transactions on Audio, Speech and Language Processing

JF - IEEE Transactions on Audio, Speech and Language Processing

IS - 4

M1 - 4806284

ER -

Study of the noise-reduction problem in the Karhunen-Loàve expansion domain

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this