著者
中澤 和司 近藤 和弘
出版者
一般社団法人 電気学会
雑誌
電気学会論文誌C(電子・情報・システム部門誌) (ISSN:03854221)
巻号頁・発行日
vol.143, no.8, pp.830-841, 2023-08-01 (Released:2023-08-01)
参考文献数
41
被引用文献数
2

In this paper, we made improvements and evaluated our proposed model for non-reference speech intelligibility estimation on reverberant speech, attempting to improve the estimation accuracy significantly. Our proposed method consists of a DNN for speech enhancement and a separate DNN for intelligibility estimation. The latter uses features obtained from enhanced and degraded speech to estimate intelligibility. Although previous studies have effectively estimated intelligibility for speech degraded by additive noise using similar models, they did not consider the degradation of distortion caused by reverberation. They also did not quantify the effect of various speech enhancement DNN models, the structure of the intelligibility prediction DNN, and the selection of parameters during feature calculation on estimation accuracy. Accordingly, we compared two top-of-the-line speech enhancement DNN models and used their output to train intelligibility prediction DNNs for reverberant speech while also varying the parameters used in the feature calculation. Consequently, the linear correlation coefficient between subjective and estimated intelligibility came to 0.801 with the best combination.