通过可变动作进行强化学习 [英] Reinforcement Learning With Variable Actions

查看：170 发布时间：2020/5/4 9:35:27 machine-learning reinforcement-learning planning

本文介绍了通过可变动作进行强化学习的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

所有强化学习算法通常都应用于具有固定数量的单个代理动作.是否有任何强化学习算法可在考虑可变数量的动作的同时做出决定?例如，您如何在玩家控制N名士兵，并且每个士兵根据其状况随机选择动作的计算机游戏中应用RL算法?您无法为全球决策者(即将军")制定固定数量的行动，因为随着士兵的创造和死亡，可用的行动不断变化.而且您不能在士兵级别制定固定数量的动作，因为士兵的动作是根据其直接环境而定的.如果一个士兵没有看到对手，那么它可能只能走路，而如果看到10个对手，那么它将有10种可能的新动作，攻击10个对手中的1个.

All the reinforcement learning algorithms I've read about are usually applied to a single agent that has a fixed number of actions. Are there any reinforcement learning algorithms for making a decision while taking into account a variable number of actions? For example, how would you apply a RL algorithm in a computer game where a player controls N soldiers, and each soldier has a random number of actions based its condition? You can't formulate fixed number of actions for a global decision maker (i.e. "the general") because the available actions are continually changing as soldiers are created and killed. And you can't formulate a fixed number of actions at the soldier level, since the soldier's actions are conditional based on its immediate environment. If a soldier sees no opponents, then it might only be able to walk, whereas if it sees 10 opponents, then it has 10 new possible actions, attacking 1 of the 10 opponents.

通过可变动作进行强化学习 [英] Reinforcement Learning With Variable Actions

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

通过可变动作进行强化学习 [英] Reinforcement Learning With Variable Actions

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭