強化学習における適応的状態空間構成法

doi:10.3902/jnns.6.144

3 0 0 0 OA 強化学習における適応的状態空間構成法

著者: 鮫島和行大森隆司
出版者: 日本神経回路学会
雑誌: 日本神経回路学会誌 (ISSN:1340766X)
巻号頁・発行日: vol.6, no.3, pp.144-154, 1999-09-05 (Released:2011-01-17)
参考文献数: 21
被引用文献数: 4 6

For the application of reinforcement learning to real-world problems, an internal state space has to be constructed from a high dimensional observation space. The algorithm presented here constructs the internal state space during the course of learning desirable actions, and assigns local basis functions adaptively depending on the task requirement. The internal state space initially has only one basis function over the entire observation space, and that basis is eventually divided into smaller ones due to the statistical property of locally weighted temporal difference error. The algorithm was applied to an autonomous robot collision avoidance problem, and the validity of the algorithm was evaluated to show, for instance, the need of a smaller number of basis functions in comparison to other method.

2017-02-01 10:55:57
3 + 3 Twitter

言及状況

外部データベース (DOI)

Twitter (3 users, 4 posts, 3 favorites)

@kosukesa https://t.co/cvpazqShMN 日本語です

1 @KikumotoAtsushi

というわけで、現実の問題に強化学習を適用しようとしておもちゃであそんでた２０年まえやってたことはだいぶ進化しているということがわかりましたとさ（すてま）https://t.co/cvpazqShMN

1 @shigejisoga

今日の会議の議題と関連するので @KazuSamejima 先生の昔の論文を読み返すがこの研究はASDの説明概念にもつながるかもしれなくて面白い https://t.co/byQgu3lVDN

2 @sion519 @terashimahiroki

3 0 0 0 OA 強化学習における適応的状態空間構成法

言及状況

外部データベース (DOI)

Twitter (3 users, 4 posts, 3 favorites)

収集済み URL リスト