Perceptually Weighted Mel-Cepstrum Analysis of Speech Based on Psychoacoustic Model

Hongwu YANG; Dezhi HUANG; Lianhong CAI

doi:10.1093/ietisy/e89-d.12.2998

Perceptually Weighted Mel-Cepstrum Analysis of Speech Based on Psychoacoustic Model

Hongwu YANG, Dezhi HUANG, Lianhong CAI

Full Text Views

0

Share
Cite this

Summary :

This letter proposes a novel approach for mel-cepstral analysis based on the psychoacoustic model of MPEG. A perceptual weighting function is developed by applying cubic spline interpolation on the signal-to-mask ratios (SMRs) which are obtained from the psychoacoustic model. Experiments on speaker identification and speech re-synthesis showed that the proposed method not only improved the speaker recognition performance, but also improved the speech quality of the re-synthesized speech.

Publication: IEICE TRANSACTIONS on Information Vol.E89-D No.12 pp.2998-3001

Publication Date: 2006/12/01

Publicized

Online ISSN: 1745-1361

DOI: 10.1093/ietisy/e89-d.12.2998

Type of Manuscript: LETTER

Category: Speech and Hearing

Cite this

Copy

Hongwu YANG, Dezhi HUANG, Lianhong CAI, "Perceptually Weighted Mel-Cepstrum Analysis of Speech Based on Psychoacoustic Model" in IEICE TRANSACTIONS on Information, vol. E89-D, no. 12, pp. 2998-3001, December 2006, doi: 10.1093/ietisy/e89-d.12.2998.
Abstract: This letter proposes a novel approach for mel-cepstral analysis based on the psychoacoustic model of MPEG. A perceptual weighting function is developed by applying cubic spline interpolation on the signal-to-mask ratios (SMRs) which are obtained from the psychoacoustic model. Experiments on speaker identification and speech re-synthesis showed that the proposed method not only improved the speaker recognition performance, but also improved the speech quality of the re-synthesized speech.
URL: https://globals.ieice.org/en_transactions/information/10.1093/ietisy/e89-d.12.2998/_p

Copy

@ARTICLE{e89-d_12_2998,
author={Hongwu YANG, Dezhi HUANG, Lianhong CAI, },
journal={IEICE TRANSACTIONS on Information},
title={Perceptually Weighted Mel-Cepstrum Analysis of Speech Based on Psychoacoustic Model},
year={2006},
volume={E89-D},
number={12},
pages={2998-3001},
abstract={This letter proposes a novel approach for mel-cepstral analysis based on the psychoacoustic model of MPEG. A perceptual weighting function is developed by applying cubic spline interpolation on the signal-to-mask ratios (SMRs) which are obtained from the psychoacoustic model. Experiments on speaker identification and speech re-synthesis showed that the proposed method not only improved the speaker recognition performance, but also improved the speech quality of the re-synthesized speech.},
keywords={},
doi={10.1093/ietisy/e89-d.12.2998},
ISSN={1745-1361},
month={December},}

Copy

TY - JOUR
TI - Perceptually Weighted Mel-Cepstrum Analysis of Speech Based on Psychoacoustic Model
T2 - IEICE TRANSACTIONS on Information
SP - 2998
EP - 3001
AU - Hongwu YANG
AU - Dezhi HUANG
AU - Lianhong CAI
PY - 2006
DO - 10.1093/ietisy/e89-d.12.2998
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E89-D
IS - 12
JA - IEICE TRANSACTIONS on Information
Y1 - December 2006
AB - This letter proposes a novel approach for mel-cepstral analysis based on the psychoacoustic model of MPEG. A perceptual weighting function is developed by applying cubic spline interpolation on the signal-to-mask ratios (SMRs) which are obtained from the psychoacoustic model. Experiments on speaker identification and speech re-synthesis showed that the proposed method not only improved the speaker recognition performance, but also improved the speech quality of the re-synthesized speech.
ER -