Takuto YOSHIOKA Kana YAMASAKI Takuya SAWADA Kensaku FUJII Mitsuji MUNEYASU Masakazu MORIMOTO
In this paper, we propose a step size control method capable of quickly canceling acoustic echo even when double talk continues from the echo path change. This method controls the step size by substituting the norm of the difference vector between the coefficient vectors of a main adaptive filter (Main-ADF) and a sub-adaptive filter (Sub-ADF) for the estimation error provided by the former. Actually, the number of taps of Sub-ADF is limited to a quarter of that of Main-ADF, and the larger step size than that applied to Main-ADF is given to Sub-ADF; accordingly the norm of the difference vector quickly approximates to the estimation error. The estimation speed can be improved by utilizing the norm of the difference vector for the step size control in Main-ADF. We show using speech signals that in single talk the proposed method can provide almost the same estimation speed as the method whose step size is fixed at the optimum one and verify that even in double talk the estimation error, quickly decreases.
Mitsuji MUNEYASU Shuhei ODANI Yoshihiro KITAURA Hitoshi NAMBA
On the use of a surveillance camera, there is a case where privacy protection should be considered. This paper proposes a new privacy protection method by automatically degrading the face region in surveillance images. The proposed method consists of ROI coding of JPEG2000 and a face detection method based on template matching. The experimental result shows that the face region can be detected and hidden correctly.
Kensaku FUJII Yoshihisa NAKATANI Mitsuji MUNEYASU
This paper proposes a new method to reduce sinusoidal noise components whose frequencies are known. The new method is based on the simultaneous equations technique. The technique does not require the secondary path filter: thereby the automatic recovering of the noise reduction effect deteriorated by secondary path changes becomes possible. This paper also presents computer simulation results to examine the performance of the new method.
Takamasa FUJII Soh YOSHIDA Mitsuji MUNEYASU
In video search reranking, in addition to the well-known semantic gap, the intent gap, which is the gap between the representation of the users' demand and the real search intention, is becoming a major problem restricting the improvement of reranking performance. To address this problem, we propose video search reranking based on a semantic representation by multiple tags. In the proposed method, we use relevance feedback, which the user can interact with by specifying some example videos from the initial search results. We apply the relevance feedback to reduce the gap between the real intent of the users and the video search results. In addition, we focus on the fact that multiple tags are used to represent video contents. By vectorizing multiple tags associated with videos on the basis of the Word2Vec algorithm and calculating the centroid of the tag vector as a collective representation, we can evaluate the semantic similarity between videos by using tag features. We conduct experiments on the YouTube-8M dataset, and the results show that our reranking approach is effective and efficient.
Mitsuji MUNEYASU Osamu HISAYASU Kensaku FUJII Takao HINAMOTO
A simultaneous equations method is one of active noise control algorithms without estimating an error path. This algorithm requires identification of a transfer function from a reference microphone to an error microphone containing the effect of a noise control filter. It is achieved by system identification of an auxiliary filter. However, the introduction of the auxiliary filter requires more number of samples to obtain the noise control filter and brings a requirement of some undesirable assumption in the multiple channel case. In this paper, a new simultaneous equations method without the identification of the auxiliary filter is proposed. By storing a small number of input signals and error signals, we avoid this identification. Therefore, we can reduce the number of samples to obtain the noise control filters and can avoid the undesirable assumption. From simulation examples, it is verified that the merits of the ordinary method is also retained in the proposed method.
Masayoshi NAKAMOTO Kohei SAYAMA Mitsuji MUNEYASU Tomotaka HARANO Shuichi OHNO
For copyright protection, a watermark signal is embedded in host images with a secret key, and a correlation is applied to judge the presence of watermark signal in the watermark detection. This paper treats a discrete wavelet transform (DWT)-based image watermarking method under specified false positive probability. We propose a new watermarking method to improve the detection performance by using not only positive correlation but also negative correlation. Also we present a statistical analysis for the detection performance with taking into account the false positive probability and prove the effectiveness of the proposed method. By using some experimental results, we verify the statistical analysis and show this method serves to improve the robustness against some attacks.
Mitsuji MUNEYASU Ken'ichi KAGAWA Kensaku FUJII Takao HINAMOTO
For multiple-channel active noise control (ANC) systems, distributed systems consisting of more than one controller are useful. In this paper, we propose a performance improvement algorithm for the distributed multiple-channel ANC system based on the simultaneous equations method. In the proposed algorithm, no estimation of error paths is required. This algorithm can provide good performance in canceling primary noises with auto-/cross-correlations and achieve stable noise reduction under a change of the error paths.
Kensaku FUJII Kenji KASHIHARA Mitsuji MUNEYASU Masakazu MORIMOTO
In this paper, we propose a method capable of shortening the distance from a noise detection microphone to a loudspeaker, which is one of important issues in the field of active noise control (ANC). In the ANC system, the secondary noise provided by the loudspeaker is required arriving at an error microphone simultaneously with the primary noise to be cancelled. However, the reverberation involved in the secondary path from the loudspeaker to the error microphone increases the secondary noise components arriving later than the primary noise. The late components are not only invalid for canceling the primary noise but also impede the cancellation. To reduce the late components, the distance between the noise detection microphone and the loud speaker is generally extended. The proposed method differently reduces the late components by forming the noise control filter, which produces the secondary noise, with the cascade connection of a non-recursive and a recursive filters. The distance can be thus shortened. On the other hand, the recursive filter is required to work stably. The proposed method guarantees the stable work by forming the recursive filter with the lattice filter whose coefficients are restricted to less than unity.
Kensaku FUJII Shigeyuki HASHIMOTO Mitsuji MUNEYASU
This paper presents a frequency domain simultaneous equations method capable of automatically recovering noise reduction effect degraded by secondary path changes. The simultaneous equations method has been studied, first in time domain. Accordingly to the study, in the time domain, the simultaneous equations method requires an additional filter and a system identification circuit used for transforming the solution of the simultaneous equations into the coefficients of noise control filter, which increase the processing cost. To reduce the processing cost, this paper studies on the application of a frequency domain processing technique, the cross spectrum method, to the simultaneous equations method. By directly applying the equation defining the cross spectrum method to the solution, the additional filter becomes unnecessary. In addition, the system identification circuit is replaced with the inverse Fourier transform. Thereby, the processing cost drastically decreases. This paper also presents simulation results to confirm that the proposed method can automatically recover the noise reduction effect degraded by a path change and provides much higher convergence speed than that of the filtered-x NLMS algorithm with the perfectly modeled secondary path filter.
Mitsuji MUNEYASU Kouichiro ASOU Yuji WADA Akira TAGUCHI Takao HINAMOTO
This paper presents a new implementation of fuzzy filters for edge-preserving smoothing of an image corrupted by impulsive and white Gaussian noise. This filter structure is expressed as an adaptive weighted mean filter that uses fuzzy control. The parameters of this filter can be adjusted by learning. Finally, simulation results demonstrate the effectiveness of the proposed technique.
Kensaku FUJII Mitsuji MUNEYASU Takao HINAMOTO Yoshinori TANAKA
The sub-recursive least squares (sub-RLS) algorithm estimates the coefficients of adaptive filter under the least squares (LS) criterion, however, does not require the calculation of inverse matrix. The sub-RLS algorithm, based on the different principle from the RLS algorithm, still provides a convergence property similar to that of the RLS algorithm. This paper first rewrites the convergence condition of the sub-RLS algorithm, and then proves that the convergence property of the sub-RLS algorithm successively approximates that of the RLS algorithm on the convergence condition.
Liang LI Akira ASANO Chie MURAKI ASANO Mitsuji MUNEYASU Yoshiko HANADA
A method of estimating dual primitives in a textural image is proposed. This method is based on the Primitive, Grain, and Point Configuration (PGPC) texture model, which regards a texture as an arrangement of grains derived from one or a few primitives. Appropriate primitives can be represented by morphological structuring elements estimated from a texture. Conventional primitive estimation methods estimate only one primitive from each textural image. However, they do not work well on textural images that contain more than one basic structure, since two or more types of grain cannot be generated from only one primitive. The proposed method simultaneously estimates two optimal structuring elements of a texture. The experimental results show that the proposed method provides more representative estimations than the conventional method.
Hiroyuki OKUNO Yoshiko HANADA Mitsuji MUNEYASU Akira ASANO
In this paper we propose an unsupervised method of optimizing structuring elements (SEs) used for impulse noise reduction in texture images through the opening operation which is one of the morphological operations. In this method, a genetic algorithm (GA), which can effectively search wide search spaces, is applied and the size of the shape of the SE is included in the design variables. Through experiments, it is shown that our new approach generally outperforms the conventional method.
Shoya OOHARA Mitsuji MUNEYASU Soh YOSHIDA Makoto NAKASHIZUKA
For image restoration, an image prior that is obtained from the morphological gradient has been proposed. In the field of mathematical morphology, the optimization of the structuring element (SE) used for this morphological gradient using a genetic algorithm (GA) has also been proposed. In this paper, we introduce a new image prior that is the sum of the morphological gradients and total variation for an image restoration problem to improve the restoration accuracy. The proposed image prior makes it possible to almost match the fitness to a quantitative evaluation such as the mean square error. It also solves the problem of the artifact due to the unsuitability of the SE for the image. An experiment shows the effectiveness of the proposed image restoration method.
In information retrieval from printed images considering the use of mobile devices, the correction of geometrical deformation and lens distortion is required, posing a heavy computational burden. In this paper, we propose a method of reducing the computational burden for such corrections. This method consists of improved extraction to find a line segment of a frame, the reconsideration of the interpolation method for image correction, and the optimization of image resolution in the correction process. The proposed method can reduce the number of computations significantly. The experimental result shows the effectiveness of the proposed method.
Kensaku FUJII Ryo AOKI Mitsuji MUNEYASU
This paper proposes an adaptive algorithm for identifying unknown systems containing nonlinear amplitude characteristics. Usually, the nonlinearity is so small as to be negligible. However, in low cost systems, such as acoustic echo canceller using a small loudspeaker, the nonlinearity deteriorates the performance of the identification. Several methods preventing the deterioration, polynomial or Volterra series approximations, have been hence proposed and studied. However, the conventional methods require high processing cost. In this paper, we propose a method approximating the nonlinear characteristics with a piecewise linear curve and show using computer simulations that the performance can be extremely improved. The proposed method can also reduce the processing cost to only about twice that of the linear adaptive filter system.
Soh YOSHIDA Mitsuji MUNEYASU Takahiro OGAWA Miki HASEYAMA
In this paper, we address the problem of analyzing topics, included in a social video group, to improve the retrieval performance of videos. Unlike previous methods that focused on an individual visual aspect of videos, the proposed method aims to leverage the “mutual reinforcement” of heterogeneous modalities such as tags and users associated with video on the Internet. To represent multiple types of relationships between each heterogeneous modality, the proposed method constructs three subgraphs: user-tag, video-video, and video-tag graphs. We combine the three types of graphs to obtain a heterogeneous graph. Then the extraction of latent features, i.e., topics, becomes feasible by applying graph-based soft clustering to the heterogeneous graph. By estimating the membership of each grouped cluster for each video, the proposed method defines a new video similarity measure. Since the understanding of video content is enhanced by exploiting latent features obtained from different types of data that complement each other, the performance of visual reranking is improved by the proposed method. Results of experiments on a video dataset that consists of YouTube-8M videos show the effectiveness of the proposed method, which achieves a 24.3% improvement in terms of the mean normalized discounted cumulative gain in a search ranking task compared with the baseline method.
Mitsuji MUNEYASU Yumi WAKASUGI Osamu HISAYASU Kensaku FUJII Takao HINAMOTO
This paper proposes a new hybrid active noise control (ANC) system without the estimation of the secondary path filter in advance. The algorithm of the feedforward part of the proposed method is based on the simultaneous equations method and the feedback part employs the filtered-X LMS algorithm. The estimation of the secondary path filter is obtained in the operation of the feedforward part and it is used in the feedback part. When the secondary path changes in the operation of the system, the proposed system can follow to this change. In the simulation example which treats the colored measurement noise, the fine noise reduction performance is obtained.
Kensaku FUJII Mitsuji MUNEYASU Takao HINAMOTO Yoshinori TANAKA
The normalized least mean square (NLMS) algorithm has the drawback that the convergence speed of adaptive filter coefficients decreases when the reference signal has high auto-correlation. A technique to improve the convergence speed is to apply the decorrelated reference signal to the calculation of the gradient defined in the NLMS algorithm. So far, only the effect of the improvement is experimentally examined. The convergence property of the adaptive algorithm to which the technique is applied is not analized yet enough. This paper first defines a cost function properly representing the criterion to estimate the coefficients of adaptive filter. The name given in this paper to the adaptive algorithm exploiting the decorrelated reference signal, 'normalized least mean EE' algorithm, exactly expresses the criterion. This adaptive algorithm estimates the coefficients so as to minimize the product of E and E' that are the differences between the responses of the unknown system and the adaptive filter to the original and the decorrelated reference signals, respectively. By using the cost function, this paper second specifies the convergence condition of the normalized least mean EE' algorithm and finally presents computer simulations, which are calculated using real speech signal, to demonstrate the validity of the convergence condition.
Mitsuji MUNEYASU Nayuta JINDA Yuuya MORITANI Soh YOSHIDA
In this paper, we propose a method of embedding and detecting data in printed images with several formats, such as different resolutions and numbers of blocks, using the camera of a tablet device. To specify the resolution of an image and the number of blocks, invisible markers that are embedded in the amplitude domain of the discrete Fourier transform of the target image are used. The proposed method can increase the variety of images suitable for data embedding.