著者
Kei Sawada Kei Hashimoto Keiichiro Oura Yoshihiko Nankaku Keiichi Tokuda
出版者
ACOUSTICAL SOCIETY OF JAPAN
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.39, no.2, pp.119-129, 2018-03-01 (Released:2018-03-01)
参考文献数
35

This paper proposes a method for constructing text-to-speech (TTS) systems for languages with unknown pronunciations. One goal of speech synthesis research is to establish a framework that can be used to construct TTS systems for any written language. Generally, language-specific knowledge is required to construct TTS systems for a new language. However, it is difficult to acquire language-specific knowledge in each new language. Therefore, constructing a TTS system for a new language entails huge costs. To address this problem, we investigate a framework for automatically constructing a TTS system from a target language database consisting of only speech data and corresponding Unicode texts. In the proposed method, pseudo phonetic information of the target language with unknown pronunciation is obtained by a speech recognizer of a rich-resource proxy language. Then, a grapheme-to-phoneme converter and a statistical parametric speech synthesizer are constructed based on the obtained pseudo phonetic information. The proposed method was applied to Japanese and was evaluated in terms of objective and subjective measures. Additionally, we challenged the construction of TTS systems for nine Indian languages using the proposed method, and TTS systems were evaluated in the Blizzard Challenge 2014 and 2015.

言及状況

外部データベース (DOI)

Twitter (14 users, 14 posts, 23 favorites)

https://t.co/eKP5iEYvMm 未知の言語のtext to speechシステムの構築が、その言語の音のデータ学習とユニコードだけでできるらしい
「Constructing text-to-speech systems for languages with unknown pronunciations」と題する論文がAcoustical Science and Technologyに掲載されましたhttps://t.co/uHoUkLgSxf。 音素等の発音が未知の言語に対して音声合成システムを構築する手法について論じています。

収集済み URL リスト