搜索结果: 1-1 共查到“管理学 Restless”相关记录1条 . 查询时间(0.044 秒)
We consider the restless Markov bandit problem, in which the state of each arm evolves according to a Markov process independently of the learner's actions. We suggest an algorithm that after $T$ step...