分层强化学习的实现 [英] Implementations of Hierarchical Reinforcement Learning

查看：91 发布时间：2021/5/31 18:43:24 machine-learning reinforcement-learning

本文介绍了分层强化学习的实现的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

任何人都可以推荐可以通过抽象处理大型状态空间的强化学习库或框架吗?

Can anyone recommend a reinforcement learning library or framework that can handle large state spaces by abstracting them?

我正在尝试为游戏世界中的小代理实现智能.该代理由一个小型的两轮机器人代表，该机器人可以向前和向后移动，以及向左和向右旋转.它具有用于检测地面边界的一对传感器，用于检测远处物体的一对超声波传感器以及用于检测与物体或对手的接触的一对碰撞传感器.它也可以做一些简单的航位推算，以其起始位置为参考来估算其在世界上的位置.因此，所有可用的状态功能是:

I'm attempting to implement the intelligence for a small agent in a game world. The agent is represented by a small two-wheeled robot that can move forward and backwards, and turn left and right. It has a couple sensors for detecting a boundary on the ground, a couple ultrasonic sensors for detecting objects far away, and a couple bump sensors for detecting contact with an object or opponent. It also can do some simple dead reckoning to estimate its position in the world using its starting position as a reference. So all the state features available to it are:

edge_detected=0|1
edge_left=0|1
edge_right=0|1
edge_both=0|1
sonar_detected=0|1
sonar_left=0|1
sonar_left_dist=near|far|very_far
sonar_right=0|1
sonar_right_dist=near|far|very_far
sonar_both=0|1
contact_detected=0|1
contact_left=0|1
contact_right=0|1
contact_both=0|1
estimated_distance_from_edge_in_front=near|far|very_far
estimated_distance_from_edge_in_back=near|far|very_far
estimated_distance_from_edge_to_left=near|far|very_far
estimated_distance_from_edge_to_right=near|far|very_far

目标是确定接收到奖励信号的状态，并学习一种策略，以尽快获取该奖励.在传统的马尔可夫模型中，离散表示的状态空间将具有2985984个可能的值，对于使用Q学习或SARSA之类的方法探究每个状态空间而言，这是太多了.

The goal is to identify the state where the reward signal is received, and learn a policy to acquire that reward as quickly as possible. In a traditional Markov model, this state space represented discretely would have 2985984 possible values, which is far too much to explore each and every one using something like Q-learning or SARSA.

任何人都可以推荐适合该领域的增强库(最好是使用Python绑定)或我可能自己实现的未实现算法吗?

Can anyone recommend a reinforcement library appropriate for this domain (preferably with Python bindings) or an unimplemented algorithm that I could potentially implement myself?

分层强化学习的实现 [英] Implementations of Hierarchical Reinforcement Learning

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

分层强化学习的实现 [英] Implementations of Hierarchical Reinforcement Learning

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭