Detection Method of Homograph Internationalized Domain Names with OCR

doi:10.2197/ipsjjip.27.536

9 0 0 0 OA Detection Method of Homograph Internationalized Domain Names with OCR

著者: Yuta Sawabe Daiki Chiba Mitsuaki Akiyama Shigeki Goto
出版者: 一般社団法人情報処理学会
雑誌: Journal of Information Processing (ISSN:18826652)
巻号頁・発行日: vol.27, pp.536-544, 2019 (Released:2019-09-15)
参考文献数: 29
被引用文献数: 2

Currently, many attacks are targeting legitimate domain names. In homograph attacks, attackers exploit human visual misrecognition, thereby leading users to visit different (fake) sites. These attacks involve the generation of new domain names that appear similar to an existing legitimate domain name by replacing several characters in the legitimate name with others that are visually similar. Specifically, internationalized domain names (IDNs), which may contain non-ASCII characters, can be used to generate/register many similar IDNs (homograph IDNs) for their application as phishing sites. A conventional method of detecting such homograph IDNs uses a predefined mapping between ASCII and similar non-ASCII characters. However, this approach has two major limitations: (1) it cannot detect homograph IDNs comprising characters that are not defined in the mapping and (2) the mapping must be manually updated. Herein, we propose a new method for detecting homograph IDNs using optical character recognition (OCR). By focusing on the idea that homograph IDNs are visually similar to legitimate domain names, we leverage OCR techniques to recognize such similarities automatically. Further, we compare our approach with a conventional method in evaluations employing 3.19 million real (registered) and 10, 000 malicious IDNs. Results reveal that our method can automatically detect homograph IDNs that cannot be detected when using the conventional approach.

2023-03-15 10:15:11
9 + 21 Twitter

言及状況

外部データベース (DOI)

Twitter (9 users, 9 posts, 21 favorites)

GPT-4の画像（画面スクリーンショット）と言語解析を組み合わせれば、IDNホモグラフ攻撃対策が強化できそうシンプルなOCRではなく、前後の文脈や確率に基づいて判定できるのが利点（OCRを用いた対策は下記） CAPTCHAをどの程度解析できるか気になるところ https://t.co/Ca1TQjJxV9

2 @hiroara @__nyoshi__

弊社アナリストの澤部祐太が情報処理学会論文誌 Journal of Information Processing(JIP) に投稿した「Detection Method of Homograph Internationalized Domain Names with OCR」がSpecially Selected Paperに選定されました。論文はこちらからご覧いただけます。https://t.co/2Dz7vGwczb

7 @konannai @fkshom @taka_tana_sec @mo_32_ko @ka0com @you_and_i @bori_so1

17 @0xiso_ @bori_so1 @evoyag @foomur @guaranasour @kiyo1482 @koki310dm @konannai @maple1st @mo_32_ko @ninoseki @s6969 @skunkworks7171 @taku888infinity @TKM_Sec @wavedoor @yoko8man

収集済み URL リスト

https://www.jstage.jst.go.jp/article/ipsjjip/27/0/27_536/_article (9)