著者
Taichi Fukawa Kenya Jin'no
出版者
The Institute of Electronics, Information and Communication Engineers
雑誌
Nonlinear Theory and Its Applications, IEICE (ISSN:21854106)
巻号頁・発行日
vol.13, no.2, pp.277-281, 2022 (Released:2022-04-01)
参考文献数
15

For an indefinite length spectrogram sequence of phonemes, we experimentally verified two methods of obtaining speaker embedding by transforming it to fixed length: adding padding and time stretching. We confirmed that both methods can maintain the extraction performance. We also confirm that the fixed frame length does not affect the results.