We propose a speech recognition method using fusion of auditory and visual information for accurate speech recognition. Since we use both auditory information and visual information, we can perform speech recognition more accurately in comparison with the case of either auditory information or visual information. After processing each information by HMM, they are fused by linear combination with weight coefficient. We performed experiments and confirmed the validity of the proposed method.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Akira SHINTANI, Akiko OGIHARA, Naoshi DOI, Shinobu TAKAMATSU, "An Isolated Word Speech Recognition Using Fusion of Auditory and Visual Information" in IEICE TRANSACTIONS on Fundamentals,
vol. E79-A, no. 6, pp. 777-783, June 1996, doi: .
Abstract: We propose a speech recognition method using fusion of auditory and visual information for accurate speech recognition. Since we use both auditory information and visual information, we can perform speech recognition more accurately in comparison with the case of either auditory information or visual information. After processing each information by HMM, they are fused by linear combination with weight coefficient. We performed experiments and confirmed the validity of the proposed method.
URL: https://globals.ieice.org/en_transactions/fundamentals/10.1587/e79-a_6_777/_p
Copy
@ARTICLE{e79-a_6_777,
author={Akira SHINTANI, Akiko OGIHARA, Naoshi DOI, Shinobu TAKAMATSU, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={An Isolated Word Speech Recognition Using Fusion of Auditory and Visual Information},
year={1996},
volume={E79-A},
number={6},
pages={777-783},
abstract={We propose a speech recognition method using fusion of auditory and visual information for accurate speech recognition. Since we use both auditory information and visual information, we can perform speech recognition more accurately in comparison with the case of either auditory information or visual information. After processing each information by HMM, they are fused by linear combination with weight coefficient. We performed experiments and confirmed the validity of the proposed method.},
keywords={},
doi={},
ISSN={},
month={June},}
Copy
TY - JOUR
TI - An Isolated Word Speech Recognition Using Fusion of Auditory and Visual Information
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 777
EP - 783
AU - Akira SHINTANI
AU - Akiko OGIHARA
AU - Naoshi DOI
AU - Shinobu TAKAMATSU
PY - 1996
DO -
JO - IEICE TRANSACTIONS on Fundamentals
SN -
VL - E79-A
IS - 6
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - June 1996
AB - We propose a speech recognition method using fusion of auditory and visual information for accurate speech recognition. Since we use both auditory information and visual information, we can perform speech recognition more accurately in comparison with the case of either auditory information or visual information. After processing each information by HMM, they are fused by linear combination with weight coefficient. We performed experiments and confirmed the validity of the proposed method.
ER -