Tensorflow 2.0（Keras）分类及限制类

3条回答

网友

1楼 · 编辑于 2024-04-25 08:43:41

假设您必须在推理时通过类限制矩阵

您可以在一个简单的Lambda层内对logits手动构建限制操作。然后在受限登录上应用softmax，并应用标准交叉熵损失函数

这里是一个虚拟示例，其中我们有二进制格式的类的掩码/限制

n_class = 8
n_sample = 10
X = np.random.uniform(0,1, (n_sample,30))
y = np.random.randint(0,n_class, (n_sample,))
mask = np.random.randint(0,2, (n_sample,n_class))

def mask_logits(logits, mask):
    restrictions = (mask > 0)
    return tf.keras.backend.switch(restrictions, -1000.0 * tf.ones_like(logits), logits)

inp_x = Input((X.shape[-1],))
inp_mask = Input((n_class,))
logits = Dense(n_class)(inp_x)
out = Lambda(mask_logits)(logits, inp_mask)
out = Activation('softmax')(out)
model = Model([inp_x, inp_mask], out)
model.compile('adam', 'sparse_categorical_crossentropy')

model.fit([X,mask], y, epochs=3)

在推断时，可通过以下方式检索预测：

pred = model.predict([X, mask])

最后，我们计算了一些简单的检查：

>>> pred.sum(1)
array([1., 1., 1., 1., 1., 1., 1., 1., 1., 1.], dtype=float32)

预测概率总和为1行

>>> pred == 0
array([[ True,  True, False,  True, False, False,  True, False],
       [ True,  True,  True,  True, False, False,  True,  True],
       [False, False, False,  True, False, False,  True,  True],
       [False, False,  True, False, False,  True,  True, False],
       [False, False,  True, False, False,  True,  True, False],
       [False, False,  True,  True, False, False, False, False],
       [False,  True, False,  True, False, False,  True,  True],
       [ True, False, False, False, False,  True, False,  True],
       [False,  True,  True, False, False, False,  True, False],
       [False,  True,  True, False,  True, False, False, False]])

一些预测概率等于0，如我们的二进制掩码所指定的

网友

2楼 · 编辑于 2024-04-25 08:43:41

但是，在推理时，这是如何工作的呢？您知道在推断时新行的类限制吗

如果答案是“是”：

我认为不应该将整个类限制矩阵作为输入，而应该使用串联来提供类限制向量。因此，不是用shape (n,)馈送row，而是用shape (n+20,)馈送row_plus_class_restrictions

row_feature_0
row_feature_1
...
row_feature_n
0
1
.
.
.
1

这样，您也不需要消除任何错误，模型将根据分类损失了解应该输出什么

如果答案是“否”：

那你的模型就没什么意义了。训练数据是一组(row, class_restrictions, class_it_should_be)维度为(nb_row_features + 20 + 20)的数据，对吗？你想训练什么——实际上是实际应用——你的行中有什么样的数据？如果答案是否定的，我不明白你想要什么

网友

3楼 · 编辑于 2024-04-25 08:43:41

您的损失函数可以完全以相同的方式实现：

def getLoss(logits, y, restrictions):
    logits = tf.where(restrictions, -1000.0 * tf.ones_like(y, dtype=tf.float32), logits)
    return tf.nn.softmax_cross_entropy_with_logits(logits=logits, labels=y)

然后可按如下方式定义模型：

x_input = Input(shape=(100,))
y_true = Input(shape=(20,))
restrictions = Input(shape=(20,), dtype=tf.bool)
# ... model definition here
y_pred = Dense(20)(x_input)
model = Model([x_input, restrictions, y_true], y_pred)

要编译模型，请按如下方式添加损失：

model.add_loss(getLoss(y_pred, y_true, restrictions))
model.compile(optimizer='rmsprop')

最后，可以使用模型的fit方法对模型进行训练。例如：

x = np.random.random((1000, 100))
restrictions = np.random.binomial(1, p=0.5, size=(1000, 20))
y = np.random.randint(20, size=1000)
y_onehot = np.eye(20)[y]
model.fit((x, restrictions, y_onehot), epochs=10, batch_size=10)

问题背景

问题:

次优解思想

相关问题更多 >

编程相关推荐

热门问题

热门文章