This paper presents a novel approach to single channel speech enhancement in noisy environments. Widely adopted noise reduction techniques based on the spectral subtraction are generally expressed as a spectral gain depending on the signal-to-noise ratio (SNR) [1]-[4]. As the estimation method of the SNR, the well-known decision-directed (DD) estimator of Ephraim and Malah efficiently is known to reduces musical noise in noise frames, but the a priori SNR, which is a crucial parameter of the spectral gain, follows the a posteriori SNR with a delay of one frame in speech frames [5]. Therefore, the noise suppression gain using the delayed a priori SNR, which is estimated by the DD algorithm matches the previous frame rather than the current one, so after noise suppression, this degrades the performance of a noise reduction during abrupt transient parts. To overcome this artifact, we propose a computationally simple but effective speech enhancement technique based on the sigmoid type function to adaptively determine the weighting factor of the DD algorithm. Actually, the proposed approach avoids the delay problem of the a priori SNR while maintaining the advantage of the DD algorithm. The performance of the proposed enhancement algorithm is evaluated by the objective and subjective test under various environments and yields better results compared with the conventional DD scheme based approach.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Yun-Sik PARK, Joon-Hyuk CHANG, "A Novel Approach to a Robust a Priori SNR Estimator in Speech Enhancement" in IEICE TRANSACTIONS on Communications,
vol. E90-B, no. 8, pp. 2182-2185, August 2007, doi: 10.1093/ietcom/e90-b.8.2182.
Abstract: This paper presents a novel approach to single channel speech enhancement in noisy environments. Widely adopted noise reduction techniques based on the spectral subtraction are generally expressed as a spectral gain depending on the signal-to-noise ratio (SNR) [1]-[4]. As the estimation method of the SNR, the well-known decision-directed (DD) estimator of Ephraim and Malah efficiently is known to reduces musical noise in noise frames, but the a priori SNR, which is a crucial parameter of the spectral gain, follows the a posteriori SNR with a delay of one frame in speech frames [5]. Therefore, the noise suppression gain using the delayed a priori SNR, which is estimated by the DD algorithm matches the previous frame rather than the current one, so after noise suppression, this degrades the performance of a noise reduction during abrupt transient parts. To overcome this artifact, we propose a computationally simple but effective speech enhancement technique based on the sigmoid type function to adaptively determine the weighting factor of the DD algorithm. Actually, the proposed approach avoids the delay problem of the a priori SNR while maintaining the advantage of the DD algorithm. The performance of the proposed enhancement algorithm is evaluated by the objective and subjective test under various environments and yields better results compared with the conventional DD scheme based approach.
URL: https://globals.ieice.org/en_transactions/communications/10.1093/ietcom/e90-b.8.2182/_p
Copy
@ARTICLE{e90-b_8_2182,
author={Yun-Sik PARK, Joon-Hyuk CHANG, },
journal={IEICE TRANSACTIONS on Communications},
title={A Novel Approach to a Robust a Priori SNR Estimator in Speech Enhancement},
year={2007},
volume={E90-B},
number={8},
pages={2182-2185},
abstract={This paper presents a novel approach to single channel speech enhancement in noisy environments. Widely adopted noise reduction techniques based on the spectral subtraction are generally expressed as a spectral gain depending on the signal-to-noise ratio (SNR) [1]-[4]. As the estimation method of the SNR, the well-known decision-directed (DD) estimator of Ephraim and Malah efficiently is known to reduces musical noise in noise frames, but the a priori SNR, which is a crucial parameter of the spectral gain, follows the a posteriori SNR with a delay of one frame in speech frames [5]. Therefore, the noise suppression gain using the delayed a priori SNR, which is estimated by the DD algorithm matches the previous frame rather than the current one, so after noise suppression, this degrades the performance of a noise reduction during abrupt transient parts. To overcome this artifact, we propose a computationally simple but effective speech enhancement technique based on the sigmoid type function to adaptively determine the weighting factor of the DD algorithm. Actually, the proposed approach avoids the delay problem of the a priori SNR while maintaining the advantage of the DD algorithm. The performance of the proposed enhancement algorithm is evaluated by the objective and subjective test under various environments and yields better results compared with the conventional DD scheme based approach.},
keywords={},
doi={10.1093/ietcom/e90-b.8.2182},
ISSN={1745-1345},
month={August},}
Copy
TY - JOUR
TI - A Novel Approach to a Robust a Priori SNR Estimator in Speech Enhancement
T2 - IEICE TRANSACTIONS on Communications
SP - 2182
EP - 2185
AU - Yun-Sik PARK
AU - Joon-Hyuk CHANG
PY - 2007
DO - 10.1093/ietcom/e90-b.8.2182
JO - IEICE TRANSACTIONS on Communications
SN - 1745-1345
VL - E90-B
IS - 8
JA - IEICE TRANSACTIONS on Communications
Y1 - August 2007
AB - This paper presents a novel approach to single channel speech enhancement in noisy environments. Widely adopted noise reduction techniques based on the spectral subtraction are generally expressed as a spectral gain depending on the signal-to-noise ratio (SNR) [1]-[4]. As the estimation method of the SNR, the well-known decision-directed (DD) estimator of Ephraim and Malah efficiently is known to reduces musical noise in noise frames, but the a priori SNR, which is a crucial parameter of the spectral gain, follows the a posteriori SNR with a delay of one frame in speech frames [5]. Therefore, the noise suppression gain using the delayed a priori SNR, which is estimated by the DD algorithm matches the previous frame rather than the current one, so after noise suppression, this degrades the performance of a noise reduction during abrupt transient parts. To overcome this artifact, we propose a computationally simple but effective speech enhancement technique based on the sigmoid type function to adaptively determine the weighting factor of the DD algorithm. Actually, the proposed approach avoids the delay problem of the a priori SNR while maintaining the advantage of the DD algorithm. The performance of the proposed enhancement algorithm is evaluated by the objective and subjective test under various environments and yields better results compared with the conventional DD scheme based approach.
ER -