著者
Tetsuji Kuboyama Kouichi Hirata Hisashi Kashima Kiyoko F.Aoki-Kinoshita Hiroshi Yasuda
出版者
The Japanese Society for Artificial Intelligence
雑誌
Transactions of the Japanese Society for Artificial Intelligence (ISSN:13460714)
巻号頁・発行日
vol.22, no.2, pp.140-147, 2007 (Released:2007-01-25)
参考文献数
17
被引用文献数
5 11 27

Learning from tree-structured data has received increasing interest with the rapid growth of tree-encodable data in the World Wide Web, in biology, and in other areas. Our kernel function measures the similarity between two trees by counting the number of shared sub-patterns called tree q-grams, and runs, in effect, in linear time with respect to the number of tree nodes. We apply our kernel function with a support vector machine (SVM) to classify biological data, the glycans of several blood components. The experimental results show that our kernel function performs as well as one exclusively tailored to glycan properties.
著者
Shin-ichi Minato Kimihito Ito
出版者
The Japanese Society for Artificial Intelligence
雑誌
Transactions of the Japanese Society for Artificial Intelligence (ISSN:13460714)
巻号頁・発行日
vol.22, no.2, pp.156-164, 2007 (Released:2007-01-25)
参考文献数
17
被引用文献数
2 4

In this paper, we present a method of finding symmetric items in a combinatorial item set database. The techniques for finding symmetric variables in Boolean functions have been studied for long time in the area of VLSI logic design, and the BDD (Binary Decision Diagram) -based methods are presented to solve such a problem. Recently, we have developed an efficient method for handling databases using ZBDDs (Zero-suppressed BDDs), a particular type of BDDs. In our ZBDD-based data structure, the symmetric item sets can be found efficiently as well as for Boolean functions. We implemented the program of symmetric item set mining, and applied it to actual biological data on the amino acid sequences of influenza viruses. We found a number of symmetric items from the database, some of which indicate interesting relationships in the amino acid mutation patterns. The result shows that our method is helpful for extracting hidden interesting information in real-life databases.
著者
Shin-ichi Minato Hiroki Arimura
出版者
The Japanese Society for Artificial Intelligence
雑誌
Transactions of the Japanese Society for Artificial Intelligence (ISSN:13460714)
巻号頁・発行日
vol.22, no.2, pp.165-172, 2007 (Released:2007-01-25)
参考文献数
11
被引用文献数
1 5 9

Frequent item set mining is one of the fundamental techniques for knowledge discovery and data mining. In the last decade, a number of efficient algorithms for frequent item set mining have been presented, but most of them focused on just enumerating the item set patterns which satisfy the given conditions, and it was a different matter how to store and index the result of patterns for efficient data analysis. Recently, we proposed a fast algorithm of extracting all frequent item set patterns from transaction databases and simultaneously indexing the result of huge patterns using Zero-suppressed BDDs (ZBDDs). That method, ZBDD-growth, is not only enumerating/listing the patterns efficiently, but also indexing the output data compactly on the memory to be analyzed with various algebraic operations. In this paper, we present a variation of ZBDD-growth algorithm to generate frequent closed item sets. This is a quite simple modification of ZBDD-growth, and additional computation cost is relatively small compared with the original algorithm for generating all patterns. Our method can conveniently be utilized in the environment of ZBDD-based pattern indexing.
著者
Nozomi Kobayashi Kentaro Inui Yuji Matsumoto
出版者
The Japanese Society for Artificial Intelligence
雑誌
Transactions of the Japanese Society for Artificial Intelligence (ISSN:13460714)
巻号頁・発行日
vol.22, no.2, pp.227-238, 2007 (Released:2007-01-25)
参考文献数
19
被引用文献数
3 17 38

The task of opinion extraction and structurization is the key component of opinion mining, which allow Web users to retrieve and summarize people's opinions scattered over the Internet. Our aim is to develop a method for extracting opinions that represent evaluation of concumer products in a structured form. To achieve the goal, we need to consider some issues that are relevant to the extraction task: How the task of opinion extraction and structurization should be designed, and how to extract the opinions which we defined. We define an opinion unit consisting of a quadruple, that is, the opinion holder, the subject being evaluated, the part or the attribute in which it is evaluated, and the evaluation that expresses positive or negative assessment. In this task, we focus on two subtasks (a) extracting subject/aspect-evaluation relations, and (b) extracting subject/aspect-aspect relations, we approach each extraction task using a machine learning-based method. In this paper, we discuss how customer reviews in web documents can be best structured. We also report on the results of our experiments and discuss future directions.