有没有办法不用一个热编码器就可以训练RNN？

def to_categorical(y, num_classes=None, dtype='float32'): """Converts a class vector (integers) to binary class matrix. E.g. for use with categorical_crossentropy. # Arguments y: class vector to be converted into a matrix (integers from 0 to num_classes). num_classes: total number of classes. dtype: The data type expected by the input, as a string (`float32`, `float64`, `int32`...) # Returns A binary matrix representation of the input. The classes axis is placed last. # Example ```python # Consider an array of 5 labels out of a set of 3 classes {0, 1, 2}: > labels array([0, 2, 1, 2, 0]) # `to_categorical` converts this into a matrix with as many # columns as there are classes. The number of rows # stays the same. > to_categorical(labels) array([[ 1., 0., 0.], [ 0., 0., 1.], [ 0., 1., 0.], [ 0., 0., 1.], [ 1., 0., 0.]], dtype=float32) ``` """ y = np.array(y, dtype='int') input_shape = y.shape if input_shape and input_shape[-1] == 1 and len(input_shape) > 1: input_shape = tuple(input_shape[:-1]) y = y.ravel() if not num_classes: num_classes = np.max(y) + 1 n = y.shape[0] categorical = np.zeros((n, num_classes), dtype=dtype) categorical[np.arange(n), y] = 1 output_shape = input_shape + (num_classes,) categorical = np.reshape(categorical, output_shape) return categorical

2条回答

网友

1楼 · 编辑于 2024-04-23 08:20:17

您必须具有类一致性，否则您的模型将无法正常工作。你知道吗

如果数字在数字上有意义，你可以用数字代替一个热的。但既然你说它们是课堂，那就没什么意义了。你知道吗

您可以尝试将一些train类分离为未知类，并将它们分组为单个热编码。然后所有新类都将接收相同的编码。你知道吗

但并不能保证这个模型能给你带来好的效果。你知道吗

网友

2楼 · 编辑于 2024-04-23 08:20:17

关于@Dainel关于类一致性的回答，您可以用np.nan替换训练序列中没有出现的任何值，并使用pd.get_dummies，如下所示。你知道吗

train_seq = np.array([1,2,3,4,5])
test_seq = np.array([1,2,3,4,5,6,7,8,9,10], dtype=np.float32)

test_seq[~np.isin(test_seq, train_seq)] = np.nan

df = pd.get_dummies(test_seq, dummy_na=True)
print(df)

它为看不见的数据生成一个单独的类。你知道吗

   1.0  2.0  3.0  4.0  5.0  NaN
0    1    0    0    0    0    0
1    0    1    0    0    0    0
2    0    0    1    0    0    0
3    0    0    0    1    0    0
4    0    0    0    0    1    0
5    0    0    0    0    0    1
6    0    0    0    0    0    1
7    0    0    0    0    0    1
8    0    0    0    0    0    1
9    0    0    0    0    0    1

相关问题更多 >

编程相关推荐

热门问题

热门文章