通过强化学习训练神经网络 [英] Training a Neural Network with Reinforcement learning

查看：603 发布时间：2020/4/27 3:45:29 algorithm language-agnostic machine-learning neural-network reinforcement-learning

本文介绍了通过强化学习训练神经网络的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我了解前馈神经网络的基础知识，以及如何使用反向传播算法对其进行训练，但是我正在寻找一种算法，可以用来通过强化学习在线训练ANN.

I know the basics of feedforward neural networks, and how to train them using the backpropagation algorithm, but I'm looking for an algorithm than I can use for training an ANN online with reinforcement learning.

例如，购物车杆向上摆动问题是我想用一个人工神经网络解决.在那种情况下，我不知道应该怎么做来控制摆，我只知道我离理想位置有多近.我需要让ANN基于奖励和惩罚来学习.因此，监督学习不是一种选择.

For example, the cart pole swing up problem is one I'd like to solve with an ANN. In that case, I don't know what should be done to control the pendulum, I only know how close I am to the ideal position. I need to have the ANN learn based on reward and punishment. Thus, supervised learning isn't an option.

另一种情况是蛇游戏，反馈被延迟了，并且仅限于目标和反目标，而不是奖励.

Another situation is something like the snake game, where feedback is delayed, and limited to goals and anti-goals, rather than reward.

我可以想到一些针对第一种情况的算法，例如爬山算法或遗传算法，但是我猜它们都会很慢.它们也可能适用于第二种情况，但速度非常慢，并且不利于在线学习.

I can think of some algorithms for the first situation, like hill-climbing or genetic algorithms, but I'm guessing they would both be slow. They might also be applicable in the second scenario, but incredibly slow, and not conducive to online learning.

我的问题很简单:是否存在用于通过强化学习训练人工神经网络的简单算法?我主要对实时奖励情况感兴趣，但是如果基于目标的算法情况还可以，甚至更好.

My question is simple: Is there a simple algorithm for training an artificial neural network with reinforcement learning? I'm mainly interested in real-time reward situations, but if an algorithm for goal-based situations is available, even better.

通过强化学习训练神经网络 [英] Training a Neural Network with Reinforcement learning

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

通过强化学习训练神经网络 [英] Training a Neural Network with Reinforcement learning

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭