IEICE globals.ieice.org Site

Author Search Result

[Author] Akira SHINTANI(4hit)

1-4hit

Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information
Akira SHINTANI Akio OGIHARA Yoshikazu YAMAGUCHI Yasuhisa HAYASHI Kunio FUKUNAGA

LETTER

Vol:
E77-A No:11
Page(s):
1875-1878
We propose two methods to fuse auditory information and visual information for accurate sppech recognition. The first method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. The second method fuses two kinds of information by using the histogram which expresses the correlation of them. We have performed experiments comparing the proposed methods with the conventional method and confirmed the validity of the proposed methods.
A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information
Naoshi DOI Akira SHINTANI Yasuhisa HAYASHI Akio OGIHARA Shinobu TAKAMATSU

LETTER

Vol:
E78-A No:11
Page(s):
1548-1552
Recently, some speech recognition methods using fusion of visual and auditory information have been researched. In this paper, a study on the mouth shape image suitable for fusion of visual and auditory information has been described. Features of mouth shape which are extracted from gray level image and binary image are adopted, and speech recognition using linear combination method has been performed. From results of speech recognition, the studies on the mouth shape features which are effective in fusion of visual and auditory information have been performed. And the effectiveness of using two kinds of mouth shape features also has been confirmed.
An Isolated Word Speech Recognition Using Fusion of Auditory and Visual Information
Akira SHINTANI Akiko OGIHARA Naoshi DOI Shinobu TAKAMATSU

PAPER

Vol:
E79-A No:6
Page(s):
777-783
We propose a speech recognition method using fusion of auditory and visual information for accurate speech recognition. Since we use both auditory information and visual information, we can perform speech recognition more accurately in comparison with the case of either auditory information or visual information. After processing each information by HMM, they are fused by linear combination with weight coefficient. We performed experiments and confirmed the validity of the proposed method.
Speech Recognition Based on Fusion of Visual and Auditory Information Using Full-Framse Color Image
Satoru IGAWA Akio OGIHARA Akira SHINTANI Shinobu TAKAMATSU

LETTER

Vol:
E79-A No:11
Page(s):
1836-1840
We propose a method to fuse auditory information and visual information for accurate speech recognition. This method fuses two kinds of information by using Iinear combination after calculating two kinds of probabilities by HMM for each word. In addition, we use full-frame color image as visual information in order to improve the accuracy of the proposed speech recognition system. We have performed experiments comparing the proposed method with the method using either auditory information or visual information, and confirmed the validity of the proposed method.

Author Search Result

[Author] Akira SHINTANI(4hit)

Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information

A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information

An Isolated Word Speech Recognition Using Fusion of Auditory and Visual Information

Speech Recognition Based on Fusion of Visual and Auditory Information Using Full-Framse Color Image

Latest Issue

FlyerIEICE has prepared a flyer regarding multilingual services. Please use the one in your native language.

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles