著者
金 明哲
出版者
The Behaviormetric Society of Japan
雑誌
行動計量学 (ISSN:03855481)
巻号頁・発行日
vol.36, no.2, pp.89-103, 2009

In this research, as a basis of studies regarding when certain works were written, an estimation was attempted using the works of Ryunosuke Akutagawa. In the experiment, two types of data sets were created from the text with part-of-speech tagging, and a comparative analysis was performed using three methods: Linear Regression, Support Vector Regression, and Random Forest Regression. As a result, when the works were written was estimated with rather high accuracy. The average of absolute value of estimation error and standard deviation was approximately 1.4 years. The order of high accuracy of estimation was Random Forest Regression, Support Vector Regression, and Linear Regression.

言及状況

外部データベース (DOI)

Twitter (8 users, 8 posts, 11 favorites)

芥川龍之介は自殺する前の作品では助詞の「は」の使用頻度が増えて,「が」の使用頻度が減ったらしい.なんでだろう CiNii 論文 -  文章の執筆時期の推定 —— 芥川龍之介の作品を例として —— https://t.co/kZfTiYVAyV #CiNii

収集済み URL リスト