- 著者
-
植田 聡史
伊藤 琢博
坂井 真一郎
- 出版者
- 公益社団法人 計測自動制御学会
- 雑誌
- 計測自動制御学会論文集 (ISSN:04534654)
- 巻号頁・発行日
- vol.58, no.3, pp.194-201, 2022 (Released:2022-04-07)
- 参考文献数
- 12
Space missions require “resilience” to flexibly complete the mission in response to changes in the environment and system characteristics. The present study proposes a method for autonomously planning a corrective control law for lunar landing trajectory control to cope with off-nominal conditions and reflecting it in resilience improvement measures by utilizing reinforcement learning. The proposed method employs a reinforcement learning problem in which an agent is additionally placed in the control loop and the corrective control input as an action output by the agent is added to the original closed-loop control input. The results and insights are summarized for the resultant agent's characteristics which autonomously detect off-nominal conditions and proactively implement recovery measures, while verifying the capability and effectiveness of the proposed design framework enabled by a reinforcement learning architecture in a realistic and specific lunar landing sequence.