获取TensorF中动态的最后一个输出

3条回答

网友

1楼 · 编辑于 2024-04-25 22:35:17

从以下两个来源

http://www.wildml.com/2016/08/rnns-in-tensorflow-a-practical-guide-and-undocumented-features/

outputs, last_states = tf.nn.dynamic_rnn(
cell=cell,
dtype=tf.float64,
sequence_length=X_lengths,
inputs=X)

或者https://github.com/ageron/handson-ml/blob/master/14_recurrent_neural_networks.ipynb

显然，最后的状态可以直接从动态调用的第二个输出中提取。它将为您提供跨所有层的最后一个状态（在LSTM中，它是从LSTMStateTuple合成的），而输出包含最后层中的所有状态。

网友

2楼 · 编辑于 2024-04-25 22:35:17

好吧-看来实际上是一个更简单的解决方案。正如@Shao Tang和@Rahul提到的，最好的方法是访问最终的细胞状态。原因如下：

如果您查看GRUCell源代码（如下），您将看到该单元维护的“状态”实际上是隐藏的权重本身。因此，当tf.nn.dynamic_rnn返回最终状态时，实际上是返回您感兴趣的最终隐藏权重。为了证明这一点，我调整了你的设置并得到了结果：

GRUCell调用（rnn_cell_impl.py）：

def call(self, inputs, state):
"""Gated recurrent unit (GRU) with nunits cells."""
if self._gate_linear is None:
      bias_ones = self._bias_initializer
if self._bias_initializer is None:
        bias_ones = init_ops.constant_initializer(1.0, dtype=inputs.dtype)
with vs.variable_scope("gates"):  # Reset gate and update gate.
self._gate_linear = _Linear(
            [inputs, state],
2 * self._num_units,
True,
bias_initializer=bias_ones,
kernel_initializer=self._kernel_initializer)
    value = math_ops.sigmoid(self._gate_linear([inputs, state]))
    r, u = array_ops.split(value=value, num_or_size_splits=2, axis=1)
    r_state = r * state
if self._candidate_linear is None:
with vs.variable_scope("candidate"):
self._candidate_linear = _Linear(
            [inputs, r_state],
self._num_units,
True,
bias_initializer=self._bias_initializer,
kernel_initializer=self._kernel_initializer)
    c = self._activation(self._candidate_linear([inputs, r_state]))
    new_h = u * state + (1 - u) * c
return new_h, new_h

解决方案：

import numpy as np
import tensorflow as tf

batch = 2
dim = 3
hidden = 4

lengths = tf.placeholder(dtype=tf.int32, shape=[batch])
inputs = tf.placeholder(dtype=tf.float32, shape=[batch, None, dim])
cell = tf.nn.rnn_cell.GRUCell(hidden)
cell_state = cell.zero_state(batch, tf.float32)
output, state = tf.nn.dynamic_rnn(cell, inputs, lengths, initial_state=cell_state)

inputs_ = np.asarray([[[0, 0, 0], [1, 1, 1], [2, 2, 2], [3, 3, 3]],
                    [[6, 6, 6], [7, 7, 7], [8, 8, 8], [9, 9, 9]]],
                    dtype=np.int32)
lengths_ = np.asarray([3, 1], dtype=np.int32)

with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    output_, state_ = sess.run([output, state], {inputs: inputs_, lengths: lengths_})
    print (output_)
    print (state_)

输出：

[[[ 0.          0.          0.          0.        ]
  [-0.24305521 -0.15512943  0.06614969  0.16873555]
  [-0.62767833 -0.30741733  0.14819752  0.44313088]
  [ 0.          0.          0.          0.        ]]

 [[-0.99152333 -0.1006391   0.28767768  0.76360202]
  [ 0.          0.          0.          0.        ]
  [ 0.          0.          0.          0.        ]
  [ 0.          0.          0.          0.        ]]]
[[-0.62767833 -0.30741733  0.14819752  0.44313088]
 [-0.99152333 -0.1006391   0.28767768  0.76360202]]

对于正在使用LSTMCell（另一个流行的选项）的其他读者来说，情况有点不同。LSTMCell以不同的方式维护状态-cell state是实际cell state和hidden state的元组或连接版本。因此，要访问最终隐藏权重，可以在单元格初始化期间设置（is_state_tuple到True），最终状态将是一个元组：（final cell state，final hidden weights）。所以，在这种情况下
_2;，（2;，h）=tf.nn.dynamic_rnn（单元，输入，长度，初始状态=单元状态）

会给你最后的重量。

参考文献： c_state and m_state in Tensorflow LSTM https://github.com/tensorflow/tensorflow/blob/438604fc885208ee05f9eef2d0f2c630e1360a83/tensorflow/python/ops/rnn_cell_impl.py#L308 https://github.com/tensorflow/tensorflow/blob/438604fc885208ee05f9eef2d0f2c630e1360a83/tensorflow/python/ops/rnn_cell_impl.py#L415

网友
3楼 · 编辑于 2024-04-25 22:35:17

这就是gather_nd的目的！

def extract_axis_1(data, ind):
    """
    Get specified elements along the first axis of tensor.
    :param data: Tensorflow tensor that will be subsetted.
    :param ind: Indices to take (one for each element along axis 0 of data).
    :return: Subsetted tensor.
    """

    batch_range = tf.range(tf.shape(data)[0])
    indices = tf.stack([batch_range, ind], axis=1)
    res = tf.gather_nd(data, indices)

    return res

就你而言：

output = extract_axis_1(output, lengths - 1)

现在output是维度[batch_size, num_cells]的张量。

相关问题更多 >

编程相关推荐

热门问题

热门文章

获取TensorF中动态的最后一个输出

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >