文献一覧: 人工知能 (雑誌)

1 0 0 0 OA 「ロボットは東大に入れるか」歴史科目の自動解答

著者: 狩野芳伸|川添愛|渋木英潔|藤田彬
雑誌: 人工知能
巻号頁・発行日: vol.31, no.6, 2016-11-01

2022-03-03 00:03:02
1 + 0 Twitter

http://id.nii.ac.jp/1004/00008656/

1 0 0 0 OA MarcoPolo : 報酬獲得と環境同定のトレードオフを考慮した強化学習システム

著者: 宮崎和光山村雅幸小林重信
出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.12, no.1, pp.78-89, 1997-01-01 (Released:2020-09-29)

Reinforcement learning is a kind of machine learning. It aims to adapt an agent to a given environment with a clue to rewards. Profit sharing (PS) can get rewards efficiently at an initial learning phase. However, it can not always learn an optimum policy that maximizes rewards per an action. Though Q-learning is guaranteed to obtain an optimum policy, it needs numerous trials to learn it. On Markov decision processes (MDPs), if a correct environment model is identified, we can derive an optimum policy by applying Policy Iteration Algorithm (PIA). As an efficient method for identifying MDPs, k-Certainty Exploration Method has been proposed. We consider that ideal reinforcement learning systems are to get some rewards even at an initial learning phase and to get mere rewards as the identification of environments proceeds. In this paper, we propose a unified learning system : MarcoPolo which considers both getting rewards by PS or PIA and identifying the environment by k-Certainty Exploration Method. MarcoPolo can realize any tradeoff between exploitation and exploration through the whole learning process. By applying MarcoPolo to an example, its basic performance is shown. Moreover, by applying it to Sutton's maze problem and its modified version, its feasibility on more realistic domains is shown.

2022-02-22 18:39:07
1 + 0 Twitter

1 0 0 0 OA 行為の同型性に基づく強化学習法

著者: 山口智浩野村勇治田中康祐谷内田正彦
出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.12, no.6, pp.870-880, 1997-11-01 (Released:2020-09-29)

The advantage of emergence is that various solutions are emerged. However, it takes large computation cost to emerge them since it requires the numbers of iterations of simulation. So we try to reduces the computation cost without losing variety of solutions by introducing the abstraction technique in Artificial Intelligence. This paper presents Isomorphism Based Reinforcement Learning by Isomorphism of Actions that reduces the learning cost without losing variety of solutions. Isomorphism is one of the concepts in Enumerative Combinatorics of mathematics. First we explain Isomorphism of Actions, then explain Isomorphism of Behaviors. Isomorphic behaviors those perform the same task can be obtained by transforming the learning result of the task by "the appropriate permutation". However, a priori knowledge that represents "the appropriate permutation" is not always given, so this paper uses the generate & test method that first generates the isomorphic learning results by transforming the learning result of reinforcement learning for a task by the combinatorial permutations, then tests to select two kinds of the behaviors performing the following tasks ; (1) isomorphic behaviors those perform the same task ; (2) discovery of the behaviors those are converged to the new task state. Since the acquired learning results are isomorphic each other, the merits of our method are those the time cost for generating various learning results is small and also the space cost is small too because it needs only the original learning result and the set of permutations for it. For these reasons, this method is significant for realizing the learning various behaviors for the dynamic environment or multiagent.

2022-02-21 22:27:09
1 + 2 Twitter

1 0 0 0 OA 人工知能の倫理と社会

著者: 村上祐子
出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.34, no.2, pp.176-181, 2019-03-01 (Released:2020-09-29)

2022-02-14 11:41:34
1 + 0 Twitter

1 0 0 0 OA 言外の意味のコミュニケーション : 語用論概説(<シリーズ>AI研究者が学ぶ言語学の新展開(第1回))

著者: 内海彰
出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.18, no.3, pp.337-345, 2003-05-01 (Released:2020-09-29)

2022-02-13 15:02:02
1 + 0 Twitter

1 0 0 0 OA 博士論文アブストラクト

出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.37, no.1, pp.57-67, 2022-01-01 (Released:2022-01-01)

2022-02-01 14:04:29
1 + 0 Twitter

1 0 0 0 レクチャーシリーズ:「人工知能の今」〔第11 回〕AI 倫理指針における課題

著者: 中川裕志
出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.35, no.6, pp.845-854, 2020-11-01 (Released:2020-11-02)

2022-01-21 23:39:20
1 + 0 Twitter

1 0 0 0 OA 機械学習の解釈性

著者: 増井紀貞
雑誌: 人工知能
巻号頁・発行日: vol.33, 2018-09-01

2022-01-14 04:29:56
1 + 0 Twitter

http://id.nii.ac.jp/1004/00009334/

1 0 0 0 OA アーティクル:表紙解説「—人工生命との遭遇— 」

著者: 井上昂治
出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.37, no.1, pp.111-112, 2022-01-01 (Released:2022-01-01)

2022-01-04 11:36:58
1 + 10 Twitter

1 0 0 0 人間社会の複雑現象の構成論的モデル

著者: 笹原和俊
出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.37, no.1, pp.50-55, 2022-01-01 (Released:2022-01-01)

2022-01-02 01:38:35
1 + 5 Twitter

1 0 0 0 OA 汎用エージェントの理論的枠組み : Marcus Hutterが提唱するAIXIの紹介(<特集>汎用人工知能(AGI)への招待)

著者: 小林亮太相澤彰子
出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.29, no.3, pp.234-238, 2014-05-01 (Released:2020-09-29)

2021-12-26 14:45:06
1 + 0 Twitter

1 0 0 0 OA システムの研究に魅せられて : 分散OSからユビキタスコンピューティングシステムへ(<レクチャーシリーズ>つながりが創発するイノベーション〔第2回〕)

著者: 徳田英幸
出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.30, no.4, pp.532-538, 2015-07-01 (Released:2020-09-29)

2021-12-10 20:51:04
1 + 0 Twitter

1 0 0 0 ディジタル史料批判と歴史学における新発見

著者: 西村陽子|北本朝展北本朝展
出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.31, no.6, pp.769-774, 2016-11-01

2021-12-06 16:35:37
1 + 0 Twitter

1 0 0 0 OA 一般化状態空間モデルと自己組織化の方法(<論文特集>「情報論的学習理論(IBIS2000)」)

著者: 北川源四郎
出版者: 一般社団法人人工知能学会
雑誌: 人工知能 (ISSN:21882266)
巻号頁・発行日: vol.16, no.2, pp.300-307, 2001-03-01 (Released:2020-09-29)

For automatic extraction of essential information and discovery from massive time series, it is necessary to develop a method which is flexible enough to handle actual phenomena in real world.That can be achieved by the use of general state space model, and it provides us with a unified tool for analyzing complex time series.To apply these general state space models, development of practical filtering and smoothing algorithms is indispensable.In this article, the non-Gaussian filter/smooother, Monte Carlo filter/smoother and self-organizing state space model are shown.As applications of the method, problems of detecting sudden changes of the trend and nonlinear smoothing are shown.