从Tensorflow预取数据集中提取目标

> for example in tf_test: > print(example[0].numpy()) > print(example[1].numpy()) > exit() $ [[-0.31 -0.94 -1.12 ... 0.18 -0.27] [-0.22 -0.54 -0.14 ... 0.33 -0.55] [-0.60 -0.02 -1.41 ... 0.21 -0.63] ... [-0.03 -0.91 -0.12 ... 0.77 -0.23] [-0.76 -1.48 -0.15 ... 0.38 -0.35] [-0.55 -0.08 -0.69 ... 0.44 -0.36]] [0 0 1 0 1 0 0 0 1 0 1 1 0 1 0 0 0 ... 0 1 1 0]

> y_pred = model.predict(tf_test) > print(y_pred) $ [[0.01] [0.14] [0.00] ... [0.32] [0.03] [0.00]] > y_pred_list = [int(x[0]) for x in y_pred] # assumes value >= 0.5 is positive prediction > y_true = [] # what I need help with > print(sklearn.metrics.confusion_matrix(y_true, y_pred_list)

> labels = [] # what I need help with > predictions = y_pred_list # could we just use a tensor? > print(tf.math.confusion_matrix(labels, predictions)

3条回答

网友

1楼 · 编辑于 2024-04-25 07:02:30

您可以使用list(ds)将其转换为列表，然后使用tf.data.Dataset.from_tensor_slices(list(ds))将其重新编译为普通数据集。从那以后，你的噩梦又开始了，但至少这是一场别人以前经历过的噩梦

请注意，对于更复杂的数据集（例如嵌套字典），在调用list(ds)后需要进行更多的预处理，但这对于您所询问的示例应该有效

这远不是一个令人满意的答案，但不幸的是，这个类完全没有文档记录，标准的数据集技巧都不起作用

网友

2楼 · 编辑于 2024-04-25 07:02:30

如果要保留批次或将所有标签提取为单个张量，可以使用以下函数：


def get_labels_from_tfdataset(tfdataset, batched=False):

    labels = list(map(lambda x: x[1], tfdataset)) # Get labels 

    if not batched:
        return tf.concat(labels, axis=0) # concat the list of batched labels

    return labels

网友

3楼 · 编辑于 2024-04-25 07:02:30

您可以使用map从每个(input, label)对中选择输入或标签，并将其转换为列表：

import tensorflow as tf
import numpy as np

inputs = np.random.rand(100, 99)
targets = np.random.rand(100)

ds = tf.data.Dataset.from_tensor_slices((inputs, targets))

X_train = list(map(lambda x: x[0], ds))
y_train = list(map(lambda x: x[1], ds))

相关问题更多 >

编程相关推荐

热门问题

热门文章