著者
Ihor Lubashevsky Kosuke Hijikata
出版者
システム制御情報学会ストカスティックシステムシンポジウム
雑誌
Proceedings of the ISCIE International Symposium on Stochastic Systems Theory and its Applications (ISSN:21884730)
巻号頁・発行日
vol.2017, pp.190-196, 2017-05-31 (Released:2017-11-01)
参考文献数
16

Within the paradigm of human intermittent control over unstable systems human behavior admits the interpretation as a sequence of point-like moments when the operator makes decision on activating or deactivating the control. These decision-making events are assumed to be governed by the information about the state of system under control which the operator accumulates continuously. In the present work we propose the concept of reinforcement learning with decision inertia (the status quo bias) that opens a gate to applying the formalism of reinforcement learning to describing human intermittent control. The basic feature of such reinforcement learning is that human behavior in a sequence of selecting available options exhibits quasicontinuous dynamics. Numerical simulation based on a fairly simple model demonstrates that the proposed formalism does possess the required properties of quasicontinuous behavior.