音声認識過程での発話分割のための統計的言語モデル

1 0 0 0 音声認識過程での発話分割のための統計的言語モデル

著者: 中嶋秀治山本博史
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会論文誌 (ISSN:18827764)
巻号頁・発行日: vol.42, no.11, pp.2681-2688, 2001-11-15
参考文献数: 15
被引用文献数: 2

自然な話し言葉での対話においては,1回の発話(または発声)で複数の文が話されることがしばしば起こる.音声認識では,1回の発話を単位として処理が行われるが,複数の文を含んだ発話をそのまま1つの単位にして理解や翻訳や要約などの言語処理を行うことは困難であり,音声認識の後か言語処理の前に発話を文などへ分割することが必要となる.このため,本稿では通常の単語と同様に文境界としての句点を音声認識することによって複数の文が含まれる発話を各文に分割する手法を提案する.評価実験の結果,発話から文への分割性能の点では,最高で再現率94%適合率100%という性能が得られた.また,言語モデルに句点を含むか否かの違いによる句点以外の単語認識率の劣化はないという結果が得られ,本手法の有効性が確認された.In spontaneous dialogs, there are utterances containing several sentences.Although speech recognizers process utterances one by one,language processing such as understanding, translation or summarizationneeds to split utterances into sentences.This paper presents utterance splitting by recognizingperiods, i.e., sentence boundaries, as well as usual words.We evaluate the performance of the model in terms of splitting and word (except for periods) accuracy. Experimental results show high recall/precision rates of splitting (the highest scores are 94%/100%) and no reduction of other word accuracy, proving the applicability of the proposed method.

2019-03-09 07:45:24
1 + 0 Twitter

https://ci.nii.ac.jp/naid/110002726049

言及状況

Twitter (1 users, 1 posts, 0 favorites)

こんな論文どうですか？音声認識過程での発話分割のための統計的言語モデル(中嶋秀治ほか),2001 https://t.co/bbqxvrrrG7

収集済み URL リスト

https://ci.nii.ac.jp/naid/110002726049 (1)