内存缩减Tensorflow TPU v2/v3 bfloat16

1条回答

网友

1楼 · 发布于 2024-04-24 20:10:24

您可以将bfloat16与tpu一起使用。主要有两件事要做：

将输入转换为输入管道中的bfloat16
在bfloat16范围内环绕您的网络，并将输出转换为F32以进行进一步计算。在

下面是一个代码片段，说明了必要的更改：

def input_fn():

  def dataset_parser(self, value):
    """Parse an ImageNet record from a serialized string Tensor."""
    image = self.image_preprocessing_fn(
        image_bytes=image_bytes,
        is_training=self.is_training,
    )

    if self.use_bfloat16:
      image = tf.cast(image, tf.bfloat16)

    return image, label


def resnet_model_fn(features, labels, mode, params):
  """The model_fn for ResNet to be used with TPUEstimator."""

  # This nested function allows us to avoid duplicating the logic which
  # builds the network, for different values of  precision.
  def build_network():
    network = resnet_model.resnet_v1(
        resnet_depth=FLAGS.resnet_depth,
        num_classes=LABEL_CLASSES,
        data_format=FLAGS.data_format)
    return network(
        inputs=features, is_training=(mode == tf.estimator.ModeKeys.TRAIN))

  if FLAGS.precision == 'bfloat16':
    with bfloat16.bfloat16_scope():
      logits = build_network()
    logits = tf.cast(logits, tf.float32)
  elif FLAGS.precision == 'float32':
    logits = build_network()

您还可以看到this TPU model中说明的第二个条件。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

内存缩减Tensorflow TPU v2/v3 bfloat16

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >