Adagrad如何在Keras工作? self.weights在Keras Optimizer中是什么意思? [英] How Adagrad works in Keras? What does self.weights mean in Keras Optimizer?

查看：209 发布时间：2020/4/25 10:14:50 python machine-learning tensorflow theano keras

本文介绍了Adagrad如何在Keras工作? self.weights在Keras Optimizer中是什么意思?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

例如，Keras的Adagrad的实现是:

For example, the implementation of Keras' Adagrad has been:

class Adagrad(Optimizer):
"""Adagrad optimizer.
It is recommended to leave the parameters of this optimizer
at their default values.
# Arguments
    lr: float >= 0. Learning rate.
    epsilon: float >= 0.
    decay: float >= 0. Learning rate decay over each update.
# References
    - [Adaptive Subgradient Methods for Online Learning and Stochastic Optimization](http://www.jmlr.org/papers/volume12/duchi11a/duchi11a.pdf)
"""

def __init__(self, lr=0.01, epsilon=1e-8, decay=0., **kwargs):
    super(Adagrad, self).__init__(**kwargs)
    self.lr = K.variable(lr)
    self.epsilon = epsilon
    self.decay = K.variable(decay)
    self.initial_decay = decay
    self.iterations = K.variable(0.)

def get_updates(self, params, constraints, loss):
    grads = self.get_gradients(loss, params)
    shapes = [K.get_variable_shape(p) for p in params]
    accumulators = [K.zeros(shape) for shape in shapes]
    self.weights = accumulators
    self.updates = []

    lr = self.lr
    if self.initial_decay > 0:
        lr *= (1. / (1. + self.decay * self.iterations))
        self.updates.append(K.update_add(self.iterations, 1))

    for p, g, a in zip(params, grads, accumulators):
        new_a = a + K.square(g)  # update accumulator
        self.updates.append(K.update(a, new_a))
        new_p = p - lr * g / (K.sqrt(new_a) + self.epsilon)
        # apply constraints
        if p in constraints:
            c = constraints[p]
            new_p = c(new_p)
        self.updates.append(K.update(p, new_p))
    return self.updates

函数'get_update()'似乎是一步更新.但是，应该将累加器存储的历史信息吗?为什么在每一步都将其初始化为零?在整个培训过程中如何成为累加器?

And the Function 'get_update()' seems one step update. However should the accumulators be stored the history information? Why it has been initialized to zeros at each step? How it can be an accumulator through the whole training process?

这行是做什么的?

self.weights = accumulators

似乎self.weights不再被调用.

It seems self.weights is never been called anymore.

Adagrad如何在Keras工作? self.weights在Keras Optimizer中是什么意思? [英] How Adagrad works in Keras? What does self.weights mean in Keras Optimizer?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

Adagrad如何在Keras工作? self.weights在Keras Optimizer中是什么意思? [英] How Adagrad works in Keras? What does self.weights mean in Keras Optimizer?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭