IEICE globals.ieice.org Site

Author Search Result

[Author] Ryo MUKAI(4hit)

1-4hit

Blind Source Separation of Convolutive Mixtures of Speech in Frequency Domain
Shoji MAKINO Hiroshi SAWADA Ryo MUKAI Shoko ARAKI

INVITED PAPER

Vol:
E88-A No:7
Page(s):
1640-1655
This paper overviews a total solution for frequency-domain blind source separation (BSS) of convolutive mixtures of audio signals, especially speech. Frequency-domain BSS performs independent component analysis (ICA) in each frequency bin, and this is more efficient than time-domain BSS. We describe a sophisticated total solution for frequency-domain BSS, including permutation, scaling, circularity, and complex activation function solutions. Experimental results of 22, 33, 44, 68, and 22 (moving sources), (#sources#microphones) in a room are promising.
Polar Coordinate Based Nonlinear Function for Frequency-Domain Blind Source Separation
Hiroshi SAWADA Ryo MUKAI Shoko ARAKI Shoji MAKINO

PAPER-Convolutive Systems

Vol:
E86-A No:3
Page(s):
590-596
This paper discusses a nonlinear function for independent component analysis to process complex-valued signals in frequency-domain blind source separation. Conventionally, nonlinear functions based on the Cartesian coordinates are widely used. However, such functions have a convergence problem. In this paper, we propose a more appropriate nonlinear function that is based on the polar coordinates of a complex number. In addition, we show that the difference between the two types of functions arises from the assumed densities of independent components. Our discussion is supported by several experimental results for separating speech signals, which show that the polar type nonlinear functions behave better than the Cartesian type.
Blind Source Separation for Moving Speech Signals Using Blockwise ICA and Residual Crosstalk Subtraction
Ryo MUKAI Hiroshi SAWADA Shoko ARAKI Shoji MAKINO

PAPER-Speech/Acoustic Signal Processing

Vol:
E87-A No:8
Page(s):
1941-1948
This paper describes a real-time blind source separation (BSS) method for moving speech signals in a room. Our method employs frequency domain independent component analysis (ICA) using a blockwise batch algorithm in the first stage, and the separated signals are refined by postprocessing using crosstalk component estimation and non-stationary spectral subtraction in the second stage. The blockwise batch algorithm achieves better performance than an online algorithm when sources are fixed, and the postprocessing compensates for performance degradation caused by source movement. Experimental results using speech signals recorded in a real room show that the proposed method realizes robust real-time separation for moving sources. Our method is implemented on a standard PC and works in realtime.
Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA
Satoshi UKAI Tomoya TAKATANI Hiroshi SARUWATARI Kiyohiro SHIKANO Ryo MUKAI Hiroshi SAWADA

PAPER

Vol:
E88-A No:3
Page(s):
642-650
In this paper, single-input multiple-output (SIMO)-model-based blind source separation (BSS) is addressed, where unknown mixed source signals are detected at microphones, and can be separated, not into monaural source signals but into SIMO-model-based signals from independent sources as they are at the microphones. This technique is highly applicable to high-fidelity signal processing such as binaural signal processing. First, we provide an experimental comparison between two kinds of SIMO-model-based BSS methods, namely, conventional frequency-domain ICA with projection-back processing (FDICA-PB), and SIMO-ICA which was recently proposed by the authors. Secondly, we propose a new combination technique of the FDICA-PB and SIMO-ICA, which can achieve a higher separation performance than the two methods. The experimental results reveal that the accuracy of the separated SIMO signals in the simple SIMO-ICA is inferior to that of the signals obtained by FDICA-PB under low-quality initial value conditions, but the proposed combination technique can outperform both simple FDICA-PB and SIMO-ICA.

Author Search Result

[Author] Ryo MUKAI(4hit)

Blind Source Separation of Convolutive Mixtures of Speech in Frequency Domain

Polar Coordinate Based Nonlinear Function for Frequency-Domain Blind Source Separation

Blind Source Separation for Moving Speech Signals Using Blockwise ICA and Residual Crosstalk Subtraction

Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA

Latest Issue

FlyerIEICE has prepared a flyer regarding multilingual services. Please use the one in your native language.

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles