著者
河原 英紀 栃内 香次 永田 邦一
出版者
一般社団法人 日本音響学会
雑誌
日本音響学会誌 (ISSN:03694232)
巻号頁・発行日
vol.33, no.9, pp.470-479, 1977-09-01 (Released:2017-06-02)

This paper describes a speech analysis method for linear predictor coefficients and formant frequencies and bandwidths estimation using a portion of one pitch period of speech waveform. This method is based on the fact that estimation errors together with prediction errors vanish when excitation-free segments are utilized (Eq. 6). In the case of actual speech analysis, however, prediction errors may remain, even if they are minimum (Fig. 1). Therefore, it is necessary to formulate estimation errors in order to evaluate the performance of the proposed method. Theoretical studies on estimation errors are carried out. Estimation errors in linear predictor coefficients are dependent on the unknown component of excitation (see Eqs. 7 through 11). Expected values of estimation errors are derived, based on the assumption of the statistical property of excitation (see Eqs. 13 through 15) . A similar expression of estimation errors in formant frequencies and bandwidths is derived in the same manner (see Eqs. 16 through 18). Finally, two kinds of error estimates represented in terms of observed values are introduced as criteria for the determination of analysis conditions of our method and for the experimental comparison between the proposed method and the usual linear predictive analysis (Eq. 19). Simulation studies on these error estimates are performed using several kinds of synthetic speech. Their formant frequencies and bandwidths are given in Table 1. The relevance of these error estimates is indicated in the case of our method and in the case of usual linear predictive analysis (Figs. 2 through 5 and Tables 2 and 3). The length of the analysis segment must be properly chosen when these error estimates are employed in the error evaluation of the usual linear predictive analysis of periodic speech (Eq. 20 and Fig. 6). The results of simulation studies indicate both adequacy and validity of these error estimates. Experimental studies have been done to evaluate the performance of our method in the case of actual speech analysis. The minimum values of these error estimates of our method are as small as one half to one eighth of those of usual linear predictive analysis (Figs. 7 through 9 and Tables 4 and 5). In addition, the glottal source waveform estimated by using the result of our method seems more plausible than that obtained by using the result of the usual linear predictive analysis (Fig. 10). The results of these studies indicate that the proposed method yields more accurate estimates of parameters than those obtained by the usual linear predictive analysis, especially in the case of low-pitched speech.
著者
荒木 健治 栃内 香次 永田 邦一
出版者
一般社団法人情報処理学会
雑誌
情報処理学会論文誌 (ISSN:18827764)
巻号頁・発行日
vol.28, no.4, pp.412-421, 1987-04-15
被引用文献数
6

べた書き文のかな漢字変換方式においては 計算機で文を分割する際に一般に極めて多数の候補が出現し それを一つに決定することが大きな問題となる.本論文は この問題に対して 他の語と重なりあうことなく確実に分離できる部分(キーワード)から順次 段階的に単語のあてはめを行う手法を提案し さらに本方式の前提となるキーワードの存在とその有効性を確認し その上で本方式を用いた実験システムを開発し その性能評価実験を特定専門分野の学術文献を対象に行った結果について述べたものである.本方式では 最初にキーワードを用いてべた書き文を分割し ついでそのキーワードによるあてはめの拡張を行い 以後カタカナ語のあてはめ 文節端の助詞候補の検出 連接情報を用いた単語のあてはめ 助詞候補の評価 接辞の処理 最後に一字漢字語のあてはめを行う.実験により500語程度のキーワードで平均3?4文字程度にべた書き文が分割できることがわかり 本方式によるべた書き文のかな漢字変換システムを開発した.さらに 工学に関する3分野の学術文献を資料として 性能評価実験を行った結果 90%以上の変換精度が得られることがわかり 本方式の有効性を示すことができた.