This paper describes a new talker direction estimation method for front-end processing to capture distant-talking speech by using a microphone array. The proposed method consists of two algorithms: One is a TDOA (Time Delay Of Arrival) estimation algorithm based on a weighted CSP (Cross-power Spectrum Phase) analysis with an average speech spectrum and CSP coefficient subtraction. The other is a talker direction estimation algorithm based on ML (Maximum Likelihood) estimation in a time sequence of the estimated TDOAs. To evaluate the effectiveness of the proposed method, talker direction estimation experiments were carried out in an actual office room. The results confirmed that the talker direction estimation performance of the proposed method is superior to that of the conventional methods in both diffused- and directional-noise environments.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Yuki DENDA, Takanobu NISHIURA, Yoichi YAMASHITA, "Robust Talker Direction Estimation Based on Weighted CSP Analysis and Maximum Likelihood Estimation" in IEICE TRANSACTIONS on Information,
vol. E89-D, no. 3, pp. 1050-1057, March 2006, doi: 10.1093/ietisy/e89-d.3.1050.
Abstract: This paper describes a new talker direction estimation method for front-end processing to capture distant-talking speech by using a microphone array. The proposed method consists of two algorithms: One is a TDOA (Time Delay Of Arrival) estimation algorithm based on a weighted CSP (Cross-power Spectrum Phase) analysis with an average speech spectrum and CSP coefficient subtraction. The other is a talker direction estimation algorithm based on ML (Maximum Likelihood) estimation in a time sequence of the estimated TDOAs. To evaluate the effectiveness of the proposed method, talker direction estimation experiments were carried out in an actual office room. The results confirmed that the talker direction estimation performance of the proposed method is superior to that of the conventional methods in both diffused- and directional-noise environments.
URL: https://globals.ieice.org/en_transactions/information/10.1093/ietisy/e89-d.3.1050/_p
Copy
@ARTICLE{e89-d_3_1050,
author={Yuki DENDA, Takanobu NISHIURA, Yoichi YAMASHITA, },
journal={IEICE TRANSACTIONS on Information},
title={Robust Talker Direction Estimation Based on Weighted CSP Analysis and Maximum Likelihood Estimation},
year={2006},
volume={E89-D},
number={3},
pages={1050-1057},
abstract={This paper describes a new talker direction estimation method for front-end processing to capture distant-talking speech by using a microphone array. The proposed method consists of two algorithms: One is a TDOA (Time Delay Of Arrival) estimation algorithm based on a weighted CSP (Cross-power Spectrum Phase) analysis with an average speech spectrum and CSP coefficient subtraction. The other is a talker direction estimation algorithm based on ML (Maximum Likelihood) estimation in a time sequence of the estimated TDOAs. To evaluate the effectiveness of the proposed method, talker direction estimation experiments were carried out in an actual office room. The results confirmed that the talker direction estimation performance of the proposed method is superior to that of the conventional methods in both diffused- and directional-noise environments.},
keywords={},
doi={10.1093/ietisy/e89-d.3.1050},
ISSN={1745-1361},
month={March},}
Copy
TY - JOUR
TI - Robust Talker Direction Estimation Based on Weighted CSP Analysis and Maximum Likelihood Estimation
T2 - IEICE TRANSACTIONS on Information
SP - 1050
EP - 1057
AU - Yuki DENDA
AU - Takanobu NISHIURA
AU - Yoichi YAMASHITA
PY - 2006
DO - 10.1093/ietisy/e89-d.3.1050
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E89-D
IS - 3
JA - IEICE TRANSACTIONS on Information
Y1 - March 2006
AB - This paper describes a new talker direction estimation method for front-end processing to capture distant-talking speech by using a microphone array. The proposed method consists of two algorithms: One is a TDOA (Time Delay Of Arrival) estimation algorithm based on a weighted CSP (Cross-power Spectrum Phase) analysis with an average speech spectrum and CSP coefficient subtraction. The other is a talker direction estimation algorithm based on ML (Maximum Likelihood) estimation in a time sequence of the estimated TDOAs. To evaluate the effectiveness of the proposed method, talker direction estimation experiments were carried out in an actual office room. The results confirmed that the talker direction estimation performance of the proposed method is superior to that of the conventional methods in both diffused- and directional-noise environments.
ER -