文献一覧: 福岡維新 (著者)

1 0 0 0 OA 会話によるニュース記事伝達のための音声合成

著者: 高津弘明福岡維新藤江真也岩田和彦小林哲則
出版者: 一般社団法人人工知能学会
雑誌: 人工知能学会論文誌 (ISSN:13460714)
巻号頁・発行日: vol.34, no.2, pp.B-I65_1-15, 2019-03-01 (Released:2019-03-01)
参考文献数: 46
被引用文献数: 1

We have been developing a speech-based “news-delivery system”, which can transmit news contents via spoken dialogues. In such a system, a speech synthesis sub system that can flexibly adjust the prosodic features in utterances is highly vital: the system should be able to highlight spoken phrases containing noteworthy information in an article; it should also provide properly controlled pauses between utterances to facilitate user’s interactive reactions including questions. To achieve these goals, we have decided to incorporate the position of the utterance in the paragraph and the role of the utterance in the discourse structure into the bundle of features for speech synthesis. These features were found to be crucially important in fulfilling the above-mentioned requirements for the spoken utterances by the thorough investigation into the news-telling speech data uttered by a voice actress. Specifically, these features dictate the importance of information carried by spoken phrases, and hence should be effectively utilized in synthesizing prosodically adequate utterances. Based on these investigations, we devised a deep neural network-based speech synthesis model that takes as input the role and position features. In addition, we designed a neural network model that can estimate an adequate pause length between utterances. Experimental results showed that by adding these features to the input, it becomes more proper speech for information delivery. Furthermore, we confirmed that by inserting pauses properly, it becomes easier for users to ask questions during system utterances.

2019-03-06 19:25:36
1 + 0 Twitter

1 0 0 0 OA 漸増的な情報補完機能を有する音声対話システム

著者: 福岡維新麥田愛純高津弘明藤江真也林良彦小林哲則
出版者: 人工知能学会
雑誌: 人工知能学会全国大会論文集 (ISSN:13479881)
巻号頁・発行日: vol.29, 2015

ニュース記事の伝達をタスクとして,聞き手が挟む相槌や聞き返しに応じて適宜情報を補完しながら会話を進める音声対話システムを提案する.要点を伝える主発話計画と,それに対し情報を補完する副発話計画を記事から自動生成する機能,これらの発話計画を聞き手の反応に暗に表れる理解状態に応じて切り替える機能を実装した.これらにより,リズムある対話による効率的な情報伝達が実現できた.

2015-05-16 18:46:59
1 + 0 Twitter

https://kaigi.org/jsai/webprogram/2015/paper-540.html