Voice Activity Detection with Array Signal Processing in the Wavelet Domain

Yusuke HIOKA, Nozomu HAMADA

  • Full Text Views

    0

  • Cite this

Summary :

In speech enhancement with adaptive microphone array, the voice activity detection (VAD) is indispensable for the adaptation control. Even though many VAD methods have been proposed as a pre-processor for speech recognition and compression, they can hardly discriminate nonstationary interferences which frequently exist in real environment. In this research, we propose a novel VAD method with array signal processing in the wavelet domain. In that domain we can integrate the temporal, spectral and spatial information to achieve robust voice activity discriminability for a nonstationary interference arriving from close direction of speech. The signals acquired by microphone array are at first decomposed into appropriate subbands using wavelet packet to extract its temporal and spectral features. Then directionality check and direction estimation on each subbands are executed to do VAD with respect to the spatial information. Computer simulation results for sound data demonstrate that the proposed method keeps its discriminability even for the interference arriving from close direction of speech.

Publication
IEICE TRANSACTIONS on Fundamentals Vol.E86-A No.11 pp.2802-2811
Publication Date
2003/11/01
Publicized
Online ISSN
DOI
Type of Manuscript
PAPER
Category
Engineering Acoustics

Authors

Keyword

FlyerIEICE has prepared a flyer regarding multilingual services. Please use the one in your native language.