An Efficient Image to Sound Mapping Method Using Speech Spectral Phase and Multi-Column Image

Arata KAWAMURA, Hiro IGARASHI, Youji IIGUNI

  • Full Text Views

    0

  • Cite this

Summary :

Image-to-sound mapping is a technique that transforms an image to a sound signal, which is subsequently treated as a sound spectrogram. In general, the transformed sound differs from a human speech signal. Herein an efficient image-to-sound mapping method, which provides an understandable speech signal without any training, is proposed. To synthesize such a speech signal, the proposed method utilizes a multi-column image and a speech spectral phase that is obtained from a long-time observation of the speech. The original image can be retrieved from the sound spectrogram of the synthesized speech signal. The synthesized speech and the reconstructed image qualities are evaluated using objective tests.

Publication
IEICE TRANSACTIONS on Fundamentals Vol.E100-A No.3 pp.893-895
Publication Date
2017/03/01
Publicized
Online ISSN
1745-1337
DOI
10.1587/transfun.E100.A.893
Type of Manuscript
LETTER
Category
Digital Signal Processing

Authors

Arata KAWAMURA
  Osaka University
Hiro IGARASHI
  Osaka University
Youji IIGUNI
  Osaka University

Keyword

FlyerIEICE has prepared a flyer regarding multilingual services. Please use the one in your native language.