著者
中崎 寛之 川場 真理子 横本 大輔 宇津呂 武仁 福原 知宏
出版者
一般社団法人 人工知能学会
雑誌
人工知能学会論文誌 (ISSN:13460714)
巻号頁・発行日
vol.25, no.5, pp.613-622, 2010 (Released:2010-08-06)
参考文献数
12
被引用文献数
1

The overall goal of this paper is to cross-lingually analyze multilingual blogs collected with a topic keyword. The framework of collecting multilingual blogs with a topic keyword is designed as the blog feed retrieval procedure. In this paper, we take an approach of collecting blog feeds rather than blog posts, mainly because we regard the former as a larger information unit in the blogosphere and prefer it as the information source for cross-lingual blog analysis. In the blog feed retrieval procedure, we also regard Wikipedia as a large scale ontological knowledge base for conceptually indexing the blogosphere. The underlying motivation of employing Wikipedia is in linking a knowledge base of well known facts and relatively neutral opinions with rather raw, user generated media like blogs, which include less well known facts and much more radical opinions. In our framework, first, in order to collect candidates of blog feeds for a given query, we use existing Web search engine APIs, which return a ranked list of blog posts, given a topic keyword. Next, we re-rank the list of blog feeds according to the number of hits of the topic keyword as well as closely related terms extracted from the Wikipedia entry in each blog feed. We compare the proposed blog feed retrieval method to existing Web search engine APIs and achieve significant improvement. We then apply the proposed blog distillation framework to the task of cross-lingually analyzing multilingual blogs collected with a topic keyword. Here, we cross-lingually and cross-culturally compare less well known facts and opinions that are closely related to a given topic. Results of cross-lingual blog analysis support the effectiveness of the proposed framework.
著者
阿部 佑亮 中崎 寛之 横本 大輔 宇津呂 武仁 河田 容英 福原 知宏
出版者
人工知能学会
雑誌
人工知能学会全国大会論文集 (ISSN:13479881)
巻号頁・発行日
vol.24, 2010

本研究では,ブログ空間の情報や知識を類型化するための方式の一つとして, 「ブロガーの立場」に着目する.そして,事例研究として,「詐欺」,「イン ターネット犯罪」の分野を対象として,日英ブログサイトの収集を行い,ブロ グでの記述内容を被害者・ニュース記事引用・防止対策に類型化した結果を報 告する.さらに,それらの類型のうち,特に被害者によるブログ記事の自動収 集手法を提案する.