著者
浅田 稔 野田 彰一 俵積田 健 細田 耕
出版者
The Robotics Society of Japan
雑誌
日本ロボット学会誌 (ISSN:02891824)
巻号頁・発行日
vol.13, no.1, pp.68-74, 1995-01-15
被引用文献数
49 18

We propose a method which acquires a purposive behavior for a mobile robot to shoot a ball into the goal by using the Q-learning, one of the reinforcement learning methods. A mobile robot (an agent) does not need to know any parameters of the 3-D environment or its kinematics/dynamics. Information about the changes of the environment is only the image captured from a single TV camera mounted on the robot. Image positions of the ball and the goal are used as a state variable which shows the effect of an action taken by the robot during the learning process. After the learning process, the robot tries to carry a ball near the goal and to shoot it. Computer simulation is used not only to check the validity of the method but also to save the learning time on the real robot. The real robot succeeded in shooting a ball into the goal using the learned policy transferred to it.

言及状況

外部データベース (DOI)

Twitter (2 users, 5 posts, 1 favorites)

こんな論文どうですか? 視覚に基づく強化学習によるロボットの行動獲得(浅田 稔ほか),1995 http://t.co/Cq5w1r2bYf
こんな論文どうですか? 視覚に基づく強化学習によるロボットの行動獲得(浅田 稔ほか),1995 http://t.co/Fy1nFIaO
こんな論文どうですか? 視覚に基づく強化学習によるロボットの行動獲得(浅田 稔ほか),1995 http://t.co/Fy1nFIaO
こんな論文どうですか? 視覚に基づく強化学習によるロボットの行動獲得(浅田 稔ほか),1995 http://t.co/fJ71RTE

収集済み URL リスト