在 TensorFlow 中定义自定义梯度时使用操作输入 [英] Using op inputs when defining custom gradients in TensorFlow

查看：24 发布时间：2021/9/5 20:11:41 python tensorflow

本文介绍了在 TensorFlow 中定义自定义梯度时使用操作输入的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试为我的自定义 TF 操作定义渐变方法.我在网上找到的大多数解决方案似乎都基于 gisthttps://github.com/harpone" rel="nofollow noreferrer">竖琴.我不愿意使用这种方法，因为它使用了不能在 GPU 上运行的 py_func.我找到了另一个解决方案这里使用 tf.identity() 看起来更优雅，我认为将在 GPU 上运行.但是，我在访问自定义梯度函数中的操作输入时遇到了一些问题.这是我的代码:

I'm trying to define a gradient method for my custom TF operation. Most of the solutions I have found online seem to based on a gist by harpone. I'm reluctant to use that approach as it uses py_func which won't run on GPU. I found another solution here that uses tf.identity() that looks more elegant and I think will run on GPU. However, I have some problems accessing inputs of the ops in my custom gradient function. Here's my code:

@tf.RegisterGradient('MyCustomGradient')
def _custom_gradient(op, gradients):
    x = op.inputs[0]
    return(x)

def my_op(w):
    return tf.pow(w,3)


var_foo = tf.Variable(5, dtype=tf.float32)
bar = my_op(var_foo)


g = tf.get_default_graph()
with g.gradient_override_map({'Identity': 'MyCustomGradient'}):
    bar = tf.identity(bar)
    g = tf.gradients(bar, var_foo)

with tf.Session() as sess:

    sess.run(tf.global_variables_initializer())
    print(sess.run(g))

我期待 _custom_gradient() 将输入返回给 op(在本例中为 5)，但它似乎返回了 op output x gradient.我的自定义 my_op 将具有不可微分的操作，例如 tf.sign，我想根据输入定义我的自定义渐变.我究竟做错了什么?

I was expecting _custom_gradient() to return the input to the op (5 in this example) but instead it seems to return op output x gradient. My custom my_op will have non-differentiable operations like tf.sign and I'd like to define my custom gradient based on the inputs. What am I doing wrong?

在 TensorFlow 中定义自定义梯度时使用操作输入 [英] Using op inputs when defining custom gradients in TensorFlow

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

在 TensorFlow 中定义自定义梯度时使用操作输入 [英] Using op inputs when defining custom gradients in TensorFlow

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭