我如何计算在TensorFlow的急切执行模式下的梯度w.r.t.a是不可变的？

import numpy as np import tensorflow as tf tf.enable_eager_execution() a = tf.convert_to_tensor(np.array([1., 2., 3.]), dtype=tf.float32) b = tf.constant([1., 2., 3.]) c = tf.Variable([1., 2., 3.], trainable=False) d = tf.Variable([1., 2., 3.], trainable=True) with tf.GradientTape() as tape: result = a + b + c + d grads = tape.gradient(result, [a, b, c, d])

1条回答

网友

1楼 · 发布于 2024-04-26 21:10:08

^{}文档揭示了一个简单的解决方案：

Trainable variables (created by tf.Variable or tf.get_variable, where trainable=True is default in both cases) are automatically watched. Tensors can be manually watched by invoking the watch method on this context manager.

在这种情况下

with tf.GradientTape() as tape:
    tape.watch(a)
    tape.watch(b)
    tape.watch(c)
    result = a + b + c + d

grads = tape.gradient(result, [a, b, c, d])

将导致print(grads)：

^{pr2}$

编程相关推荐

java如何格式化servlet响应以生成HTML中“accept”参数可接受的媒体类型？
java如何使用JasperReports为单个报表传递多个结果集？
EclipseVBA到JAVA链接
java如何为Gradle中的不同配置配置PMD规则集？
在给出正确答案之前，是否要求回答？Java Eclipse
java查询SearchView崩溃（尝试实现SearchView操作栏）
java为什么跳过我的IF语句？
java循环以获取与输入值最接近的对象
java默认构造函数真正做什么？
java我需要测试类中的测试方法吗

相关问题更多 >

编程相关推荐

热门问题

热门文章

我如何计算在TensorFlow的急切执行模式下的梯度w.r.t.a是不可变的？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >