如何将强化学习应用于连续动作空间? [英] How can I apply reinforcement learning to continuous action spaces?

查看：287 发布时间：2020/5/4 9:05:58 algorithm machine-learning reinforcement-learning q-learning

本文介绍了如何将强化学习应用于连续动作空间?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正试图找一个特工来学习在强化学习设置中最好地执行某些任务所必需的鼠标移动(即，奖励信号是唯一的学习反馈).

I'm trying to get an agent to learn the mouse movements necessary to best perform some task in a reinforcement learning setting (i.e. the reward signal is the only feedback for learning).

我希望使用Q学习技术，但是当我发现

I'm hoping to use the Q-learning technique, but while I've found a way to extend this method to continuous state spaces, I can't seem to figure out how to accommodate a problem with a continuous action space.

我可以强制所有鼠标移动一定程度，并且只能在一定数量的不同方向上移动，但是使动作离散的任何合理方法都会产生巨大的动作空间.由于标准的Q学习需要代理评估所有可能采取的措施，因此这种近似值在任何实际意义上都无法解决问题.

I could just force all mouse movement to be of a certain magnitude and in only a certain number of different directions, but any reasonable way of making the actions discrete would yield a huge action space. Since standard Q-learning requires the agent to evaluate all possible actions, such an approximation doesn't solve the problem in any practical sense.

如何将强化学习应用于连续动作空间? [英] How can I apply reinforcement learning to continuous action spaces?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

如何将强化学习应用于连续动作空间? [英] How can I apply reinforcement learning to continuous action spaces?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭