著者
Arata KAWAMURA Hiro IGARASHI Youji IIGUNI
出版者
一般社団法人 電子情報通信学会
雑誌
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences (ISSN:09168508)
巻号頁・発行日
vol.E100.A, no.3, pp.893-895, 2017-03-01 (Released:2017-03-01)
参考文献数
11
被引用文献数
2

Image-to-sound mapping is a technique that transforms an image to a sound signal, which is subsequently treated as a sound spectrogram. In general, the transformed sound differs from a human speech signal. Herein an efficient image-to-sound mapping method, which provides an understandable speech signal without any training, is proposed. To synthesize such a speech signal, the proposed method utilizes a multi-column image and a speech spectral phase that is obtained from a long-time observation of the speech. The original image can be retrieved from the sound spectrogram of the synthesized speech signal. The synthesized speech and the reconstructed image qualities are evaluated using objective tests.

言及状況

外部データベース (DOI)

Twitter (1 users, 1 posts, 0 favorites)

An efficient image to sound mapping method using speech spectral phase and multi-column image https://t.co/ePL5hMHblm

収集済み URL リスト