著者
Xuping Huang Nobutaka Ono Akira Nishimura Isao Echizen
出版者
Information Processing Society of Japan
雑誌
Journal of Information Processing (ISSN:18826652)
巻号頁・発行日
vol.25, pp.469-476, 2017 (Released:2017-07-15)
参考文献数
28
被引用文献数
4

Reversible audio information hiding and sample-scanning methods are proposed for digital audio content to achieve detailed detection and localization of tampered positions in each frame. The method proposed in this study allows detecting multiple tampering and reusing reliable content as well as avoiding false detection which were impossible for other methods to simultaneously achieve. In the proposed method, the original signal is partitioned into fixed-length frames and then transformed into discrete cosine transform (DCT) coefficients by the integer modified DCT (intDCT). Expansion of the DCT coefficients is applied to embed a content-based hash as a payload. The integer DCT algorithm ensures the reversibility of the transform so that the original data and embedded payload can be perfectly restored to enable blind verification of the data integrity. The perceptual evaluation of speech quality (PESQ) with the listening quality objective mean opinion (MOSLQO), the segmental signal to noise ratio (segSNR), and subjective evaluation results show that the proposed algorithm provides good sound quality (MOSLQO and segSNR are respectively 4.41 and 23.31dB on average for a capacity of 8, 000bps). Detection and localization are accurate in terms of correctly localizing tampered frames in case of insertion or deletion.

言及状況

外部データベース (DOI)

Twitter (8 users, 9 posts, 25 favorites)

音声電子透かしを用いた改竄検出手法の研究紹介ビデオをアップロードしました. https://t.co/TWiI31PKXd 整数離散コサイン変換に基づいて高周波数成分を拡張し,ペイロードを埋め込む手法です. 2012年楽天Tech ConfのLTで話した内容で,かなりの簡潔版 論文は↓ https://t.co/ftt8ZAwu2n
今日のTLの8割がフーリエ変換の話しで、某読み会で音声の周波数変換が話題になったのか 音声データの周波数が高くなるにつれて振幅が低くなる特徴がある。応用としては、高周波数領域のDCT係数の拡張による電子透かし手法が下記より提案された https://t.co/8Ucg2u85z0 第一著者とは超親しいです

収集済み URL リスト