文献一覧: Ryoichi Takashima (著者)

3 0 0 0 OA Comparison of real-time multi-speaker neural vocoders on CPUs

著者: Keisuke Matsubara Takuma Okamoto Ryoichi Takashima Tetsuya Takiguchi Tomoki Toda Hisashi Kawai
出版者: ACOUSTICAL SOCIETY OF JAPAN
雑誌: Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日: vol.43, no.2, pp.121-124, 2022-03-01 (Released:2022-03-01)
参考文献数: 16
被引用文献数: 3

2022-03-01 07:25:18
3 + 3 Twitter

3 0 0 0 OA Investigation of training data size for real-time neural vocoders on CPUs

著者: Keisuke Matsubara Takuma Okamoto Ryoichi Takashima Tetsuya Takiguchi Tomoki Toda Yoshinori Shiga Hisashi Kawai
出版者: ACOUSTICAL SOCIETY OF JAPAN
雑誌: Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日: vol.42, no.1, pp.65-68, 2021-01-01 (Released:2021-01-01)
参考文献数: 19
被引用文献数: 6

2021-01-01 09:40:17
3 + 5 Twitter

1 0 0 0 OA Single-channel talker localization based on separation of the acoustic transfer function using hidden Markov model and its classification

著者: Ryoichi Takashima Tetsuya Takiguchi Yasuo Ariki
出版者: ACOUSTICAL SOCIETY OF JAPAN
雑誌: Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日: vol.34, no.3, pp.176-186, 2013-03-01 (Released:2013-05-01)
参考文献数: 21

This paper presents a talker localization method using only a single microphone, where phoneme hidden Markov models (HMMs) of clean speech are introduced to estimate the acoustic transfer function from the user's position. In our previous work, we proposed a Gaussian mixture model (GMM) separation for estimation of the user's position, where the observed speech is separated into the acoustic transfer function and the clean speech GMM. In this paper, we propose an improved method using phoneme HMMs for separation of the acoustic transfer function. This method expresses the speech signal as a network of phoneme HMMs, while our previous method expresses it as a GMM without considering the temporal phonetic changes of the speech signal. The support vector machine (SVM) for classifying the user's position is trained using the separated frame sequences of the acoustic transfer function. Then, for each test data set, the acoustic transfer function is separated, and the position is estimated by discriminating the acoustic transfer function. The effectiveness of this method has been confirmed by talker localization experiments performed in a room environment.

2013-05-24 12:06:36
1 + 0 Twitter