方略学科导航

搜索结果: 1-10 共查到“统计学 reinforcement”相关记录10条 . 查询时间(0.062 秒)

Reinforcement Learning for the Soccer Dribbling Task Reinforcement Learning Soccer Dribbling Task 2013/6/17

We propose a reinforcement learning solution to the \emph{soccer dribbling task}, a scenario in which a soccer agent has to go from the beginning to the end of a region keeping possession of the ball,...

存档附件原文地址

Cover Tree Bayesian Reinforcement Learning Cover Tree Bayesian Learning 2013/6/14

This paper proposes an online tree-based Bayesian approach for reinforcement learning. For inference, we employ a generalised context tree model. This defines a distribution on multivariate Gaussian p...

存档附件原文地址

Regret Bounds for Reinforcement Learning with Policy Advice Regret Bounds Reinforcement LearningPolicy Advice 2013/6/13

In some reinforcement learning problems an agent may be provided with a set of input policies, perhaps learned from prior experience or provided by advisors. We present a reinforcement learning with p...

存档附件原文地址

ABC Reinforcement Learning ABC Reinforcement Learning 2013/4/28

This paper introduces a simple, general framework for likelihood-free Bayesian reinforcement learning, through Approximate Bayesian Computation (ABC). The main advantage is that we only require a prio...

存档附件原文地址

Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems Efficient Reinforcement Learning High Dimensional Linear Quadratic Systems 2013/4/28

We study the problem of adaptive control of a high dimensional linear quadratic (LQ) system. Previous work established the asymptotic convergence to an optimal controller for various adaptive control ...

存档附件原文地址

A Greedy Approximation of Bayesian Reinforcement Learning with Probably Optimistic Transition Model Reinforcement Learning Uncertain Knowledge Probabilistic Reasoning Optimal Behavior in Polynomial Time 2013/5/2

Bayesian Reinforcement Learning (RL) is capable of not only incorporating domain knowledge, but also solving the exploration-exploitation dilemma in a natural way. As Bayesian RL is intractable except...

存档附件原文地址

Monte-Carlo utility estimates for Bayesian reinforcement learning Monte-Carlo estimates Bayesian reinforcement learning 2013/5/2

This paper introduces a set of algorithms for Monte-Carlo Bayesian reinforcement learning. Firstly, Monte-Carlo estimation of upper bounds on the Bayes-optimal value function is employed to construct ...

存档附件原文地址

Bayesian multitask inverse reinforcement learning Bayesian inference multitask learning inverse reinforce-ment learning 2011/7/6

We generalise the problem of inverse reinforcement learning to multiple tasks, from a set of demonstrations. Each demonstration may represent one expert trying to solve a different task.

存档附件原文地址

Optimal Reinforcement Learning for Gaussian Systems Optimal Reinforcement Learning Gaussian Systems 2011/7/5

The exploration-exploitation tradeoff is among the central challenges of reinforcement learning. A hypothetical exact Bayesian learner would provide the optimal solution, but is intractable in general...

存档附件原文地址

A survey of random processes with reinforcement urn model urn scheme Pólya’s urn stochastic approximation dynamical system exchangeability Lyapunov function reinforced random walk ERRW VRRW learning agent-based model evolutionary game theory self-avoiding walk 2009/5/18

The models surveyed include generalized Polya urns, reinforced random walks, interacting urn models, and continuous reinforced processes. Emphasis is on methods and results, with sketches provided of ...

存档附件原文地址

中国研究生教育排行榜-条

正在加载...

中国学术期刊排行榜-条

正在加载...

世界大学科研机构排行榜-条

正在加载...

中国大学排行榜-条

正在加载...

人　物-篇

正在加载...

课　件-篇

正在加载...

视听资料-篇

正在加载...

研招资料 -篇

正在加载...

知识要闻-篇

正在加载...

国际动态-篇

正在加载...

会议中心-篇

正在加载...

学术指南-篇

正在加载...

学术站点-篇

正在加载...

中国研究生教育排行榜-条

中国学术期刊排行榜-条

世界大学科研机构排行榜-条

中国大学排行榜-条

人 物-篇

课 件-篇

视听资料-篇

知识库-篇

研招资料 -篇

知识要闻-篇

国际动态-篇

会议中心-篇

学术指南-篇

学术站点-篇

人　物-篇

课　件-篇