通过强化学习训练神经网络 [英] Training a Neural Network with Reinforcement learning

查看:603
本文介绍了通过强化学习训练神经网络的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我了解前馈神经网络的基础知识,以及如何使用反向传播算法对其进行训练,但是我正在寻找一种算法,可以用来通过强化学习在线训练ANN.

I know the basics of feedforward neural networks, and how to train them using the backpropagation algorithm, but I'm looking for an algorithm than I can use for training an ANN online with reinforcement learning.

例如,购物车杆向上摆动问题是我想用一个人工神经网络解决.在那种情况下,我不知道应该怎么做来控制摆,我只知道我离理想位置有多近.我需要让ANN基于奖励和惩罚来学习.因此,监督学习不是一种选择.

For example, the cart pole swing up problem is one I'd like to solve with an ANN. In that case, I don't know what should be done to control the pendulum, I only know how close I am to the ideal position. I need to have the ANN learn based on reward and punishment. Thus, supervised learning isn't an option.

另一种情况是蛇游戏,反馈被延迟了,并且仅限于目标和反目标,而不是奖励.

Another situation is something like the snake game, where feedback is delayed, and limited to goals and anti-goals, rather than reward.

我可以想到一些针对第一种情况的算法,例如爬山算法或遗传算法,但是我猜它们都会很慢.它们也可能适用于第二种情况,但速度非常慢,并且不利于在线学习.

I can think of some algorithms for the first situation, like hill-climbing or genetic algorithms, but I'm guessing they would both be slow. They might also be applicable in the second scenario, but incredibly slow, and not conducive to online learning.

我的问题很简单:是否存在用于通过强化学习训练人工神经网络的简单算法?我主要对实时奖励情况感兴趣,但是如果基于目标的算法情况还可以,甚至更好.

My question is simple: Is there a simple algorithm for training an artificial neural network with reinforcement learning? I'm mainly interested in real-time reward situations, but if an algorithm for goal-based situations is available, even better.

推荐答案

有一些关于该主题的研究论文:

There are some research papers on the topic:

  • Efficient Reinforcement Learning Through Evolving Neural Network Topologies (2002)
  • Reinforcement Learning Using Neural Networks, with Applications to Motor Control
  • Reinforcement Learning Neural Network To The Problem Of Autonomous Mobile Robot Obstacle Avoidance

和一些代码:

  • Code examples for neural network reinforcement learning.

这些只是该主题上的一些Google顶级搜索结果.尽管我还没有亲自阅读过,但前几篇论文看起来还不错.我想,如果您在Google Scholar上进行快速搜索,您将会发现更多关于具有强化学习功能的神经网络的信息.

Those are just some of the top google search results on the topic. The first couple of papers look like they're pretty good, although I haven't read them personally. I think you'll find even more information on neural networks with reinforcement learning if you do a quick search on Google Scholar.

这篇关于通过强化学习训练神经网络的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆