著者
財津 亘 金 明哲
出版者
日本行動計量学会
雑誌
行動計量学 (ISSN:03855481)
巻号頁・発行日
vol.45, no.1, pp.39-47, 2018 (Released:2018-11-03)
参考文献数
23

This study examined the accuracy for author identification by text mining. We conducted 16 analyses (four writing styles × four multivariate analyses) across texts of 100 Bloggers, written by approximately 1,000 characters. Specifically, we conducted (1) principal components analysis, (2) correspondence analysis, (3) multi-dimensional scaling, and (4) hierarchical cluster analysis on each writing style: (1) rate of usage of non-independent words, (2) bigram of parts-of-speech, (3) bigram of postpositional particles, and (4) positioning of commas. We obtained high accuracy: 100% on sensitivity and 95.1% on specificity. Furthermore, the results showed no effects of age and gender against accuracy for author identification.

言及状況

外部データベース (DOI)

Twitter (3 users, 3 posts, 5 favorites)

J-STAGE Articles - テキストマイニングによる筆者識別の正確性ならびに判定手続きの標準化 https://t.co/UDZhY4FutQ

収集済み URL リスト