多层感知器的目标变量为数组而非单一值

0 投票
1 回答
1434 浏览
提问于 2025-04-17 23:05

我刚接触深度学习,正在尝试使用theano库来训练我的数据。这里的MLP教程是针对一个标量输出值的,而我的情况是输出是一个数组,里面有一个1对应于输出中显示的值。

举个例子(假设可能的标量值是0,1,2,3,4,5),

0 = [1,0,0,0,0,0]
1 = [0,1,0,0,0,0]
2 = [0,0,1,0,0,0]

我只修改了代码,以便读取我的输入和输出(现在的输出是一个二维数组,或者在theano的术语中叫矩阵)。其他部分的代码和上面的MLP教程是一样的。

我遇到的错误出现在以下函数中:

test_model = theano.function(inputs=[index],
            outputs=classifier.errors(y),
            givens={
                x: test_set_x[index * batch_size:(index + 1) * batch_size],
                y: test_set_y[index * batch_size:(index + 1) * batch_size]}) //line 286

错误堆栈:

  Traceback (most recent call last):
  File "mlp.py", line 398, in <module>
    test_mlp()
  File "mlp.py", line 286, in test_mlp
    y: test_set_y[index * batch_size:(index + 1) * batch_size]})
  File "/usr/local/lib/python2.7/dist-packages/theano/compile/function.py", line 223, in function
    profile=profile)
  File "/usr/local/lib/python2.7/dist-packages/theano/compile/pfunc.py", line 490, in pfunc
    no_default_updates=no_default_updates)
  File "/usr/local/lib/python2.7/dist-packages/theano/compile/pfunc.py", line 241, in rebuild_collect_shared
    cloned_v = clone_v_get_shared_updates(outputs, copy_inputs_over)
  File "/usr/local/lib/python2.7/dist-packages/theano/compile/pfunc.py", line 92, in clone_v_get_shared_updates
    clone_a(v.owner, copy_inputs_over)
  File "/usr/local/lib/python2.7/dist-packages/theano/compile/pfunc.py", line 131, in clone_a
    clone_v_get_shared_updates(i, copy_inputs_over)
  File "/usr/local/lib/python2.7/dist-packages/theano/compile/pfunc.py", line 92, in clone_v_get_shared_updates
    clone_a(v.owner, copy_inputs_over)
  File "/usr/local/lib/python2.7/dist-packages/theano/compile/pfunc.py", line 131, in clone_a
    clone_v_get_shared_updates(i, copy_inputs_over)
  File "/usr/local/lib/python2.7/dist-packages/theano/compile/pfunc.py", line 92, in clone_v_get_shared_updates
    clone_a(v.owner, copy_inputs_over)
  File "/usr/local/lib/python2.7/dist-packages/theano/compile/pfunc.py", line 135, in clone_a
    strict=rebuild_strict)
  File "/usr/local/lib/python2.7/dist-packages/theano/gof/graph.py", line 213, in clone_with_new_inputs
    new_inputs[i] = curr.type.filter_variable(new)
  File "/usr/local/lib/python2.7/dist-packages/theano/tensor/type.py", line 205, in filter_variable
    self=self)<br><br>
  TypeError: Cannot convert Type TensorType(int64, matrix) (of Variable Subtensor{int64:int64:}.0) into Type TensorType(int32, vector). You can try to manually convert Subtensor{int64:int64:}.0 into a TensorType(int32, vector).

我想知道如何修改这个theano.function,以便让y值可以作为一个矩阵来使用。

1 个回答

2

你需要把 y 定义为 T.imatrix(),而不是 T.lvector()

撰写回答