著者
Takafumi KOSHINAKA Kentaro NAGATOMO Koichi SHINODA
出版者
The Institute of Electronics, Information and Communication Engineers
雑誌
IEICE TRANSACTIONS on Information and Systems (ISSN:09168532)
巻号頁・発行日
vol.E95-D, no.10, pp.2469-2478, 2012-10-01

A novel online speaker clustering method based on a generative model is proposed. It employs an incremental variant of variational Bayesian learning and provides probabilistic (non-deterministic) decisions for each input utterance, on the basis of the history of preceding utterances. It can be expected to be robust against errors in cluster estimation and the classification of utterances, and hence to be applicable to many real-time applications. Experimental results show that it produces 50% fewer classification errors than does a conventional online method. They also show that it is possible to reduce the number of speech recognition errors by combining the method with unsupervised speaker adaptation.

言及状況

Twitter (1 users, 1 posts, 0 favorites)

2月に投稿した論文が採録されて、今日やっと掲載された。ここにたどり着くまで思いのほか長い時間を食ってしまったけど、助言くださった先生方、同僚、論文誌編集委員、査読委員、ほか皆さんに感謝。さて祝杯は何を飲もうかな?http://t.co/AO6oYahS

収集済み URL リスト