This letter proposes a novel approach for mel-cepstral analysis based on the psychoacoustic model of MPEG. A perceptual weighting function is developed by applying cubic spline interpolation on the signal-to-mask ratios (SMRs) which are obtained from the psychoacoustic model. Experiments on speaker identification and speech re-synthesis showed that the proposed method not only improved the speaker recognition performance, but also improved the speech quality of the re-synthesized speech.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Hongwu YANG, Dezhi HUANG, Lianhong CAI, "Perceptually Weighted Mel-Cepstrum Analysis of Speech Based on Psychoacoustic Model" in IEICE TRANSACTIONS on Information,
vol. E89-D, no. 12, pp. 2998-3001, December 2006, doi: 10.1093/ietisy/e89-d.12.2998.
Abstract: This letter proposes a novel approach for mel-cepstral analysis based on the psychoacoustic model of MPEG. A perceptual weighting function is developed by applying cubic spline interpolation on the signal-to-mask ratios (SMRs) which are obtained from the psychoacoustic model. Experiments on speaker identification and speech re-synthesis showed that the proposed method not only improved the speaker recognition performance, but also improved the speech quality of the re-synthesized speech.
URL: https://globals.ieice.org/en_transactions/information/10.1093/ietisy/e89-d.12.2998/_p
Copy
@ARTICLE{e89-d_12_2998,
author={Hongwu YANG, Dezhi HUANG, Lianhong CAI, },
journal={IEICE TRANSACTIONS on Information},
title={Perceptually Weighted Mel-Cepstrum Analysis of Speech Based on Psychoacoustic Model},
year={2006},
volume={E89-D},
number={12},
pages={2998-3001},
abstract={This letter proposes a novel approach for mel-cepstral analysis based on the psychoacoustic model of MPEG. A perceptual weighting function is developed by applying cubic spline interpolation on the signal-to-mask ratios (SMRs) which are obtained from the psychoacoustic model. Experiments on speaker identification and speech re-synthesis showed that the proposed method not only improved the speaker recognition performance, but also improved the speech quality of the re-synthesized speech.},
keywords={},
doi={10.1093/ietisy/e89-d.12.2998},
ISSN={1745-1361},
month={December},}
Copy
TY - JOUR
TI - Perceptually Weighted Mel-Cepstrum Analysis of Speech Based on Psychoacoustic Model
T2 - IEICE TRANSACTIONS on Information
SP - 2998
EP - 3001
AU - Hongwu YANG
AU - Dezhi HUANG
AU - Lianhong CAI
PY - 2006
DO - 10.1093/ietisy/e89-d.12.2998
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E89-D
IS - 12
JA - IEICE TRANSACTIONS on Information
Y1 - December 2006
AB - This letter proposes a novel approach for mel-cepstral analysis based on the psychoacoustic model of MPEG. A perceptual weighting function is developed by applying cubic spline interpolation on the signal-to-mask ratios (SMRs) which are obtained from the psychoacoustic model. Experiments on speaker identification and speech re-synthesis showed that the proposed method not only improved the speaker recognition performance, but also improved the speech quality of the re-synthesized speech.
ER -