著者
樋口 寛晃 旭 健作 佐川 雄二 杉江 昇
出版者
一般社団法人 電気学会
雑誌
電気学会論文誌C(電子・情報・システム部門誌) (ISSN:03854221)
巻号頁・発行日
vol.124, no.12, pp.2439-2445, 2004 (Released:2005-03-01)
参考文献数
18
被引用文献数
1 2

We propose a method for separating speeches using two spectrograms. First, two spectrograms are generated from voices recorded with a pair of microphones. The onsets and the offsets of the frequency components are extracted as the features using image processing techniques. Then the correspondences of the features between the spectrograms are determined and the intermicrophone time differences are calculated. Each of frequency components with the common onset/offset occurrences and time difference are grouped together as originating one of the speech signals. A set of band-pass filters are generated corresponding to each group of frequency components. Finally, each of the separated speech signals is extracted by applying the set of band-pass filters to the voice signal recorded by a microphone. Experiments were conducted with the mixture of a male speech sound and a female speech sound consisting of Japanese vowel and contain consonant. The evaluation results demonstrated that the separation was done reasonably well with the proposed method.