训练3dconv神经网络失败；损失收敛于0.6931

# Gets the label of the image, the label determines how tensorflow will classify the image def get_label(file_path): # Convert the path to a list of path components parts = tf.strings.split(file_path, os.path.sep) # The fourth last is the class-directory return float(parts[-4] == "class1") # Reads the data from a .nii file and returns a NumPy ndarray that is compatible with tensorflow def decode_img(img): img = nib.load(img.numpy().decode('utf-8')) # convert the compressed string to a NumPy ndarray data = img.get_fdata() # Resize img data = np.resize(data, imgshape) # Normalize max = np.amax(data) min = np.amin(data) data = ((data-min)/(max-min)) return data # Processes a path to return a image data and label pair def process_path(file_path): # Gets the files label label = get_label(file_path) img = decode_img(file_path) return img, label

def configure_for_performance(ds): #ds = ds.cache(filename='cachefile') ds = ds.cache() ds = ds.shuffle(buffer_size=1000) ds = ds.repeat() ds = ds.batch(BATCH_SIZE) ds = ds.prefetch(buffer_size=tf.data.experimental.AUTOTUNE) return ds

# Create a sequential network model = tf.keras.Sequential([ tf.keras.layers.Convolution3D( 4, 4, padding='same', data_format="channels_last", input_shape=imgshape, activation='tanh'), tf.keras.layers.MaxPooling3D(padding='same'), tf.keras.layers.Dropout(0.2), tf.keras.layers.Convolution3D(4, 4, padding='same', activation='tanh'), tf.keras.layers.MaxPooling3D(padding='same'), tf.keras.layers.Dropout(0.2), tf.keras.layers.Convolution3D(4, 4, padding='same', activation='tanh'), tf.keras.layers.MaxPooling3D(padding='same'), tf.keras.layers.Dropout(0.2), tf.keras.layers.Convolution3D(4, 4, padding='same', activation='tanh'), tf.keras.layers.MaxPooling3D(padding='same'), tf.keras.layers.Dropout(0.2), tf.keras.layers.Flatten(), tf.keras.layers.Dense(2048, activation='tanh'), tf.keras.layers.Dense(1024, activation='tanh'), tf.keras.layers.Dense(512, activation='tanh'), tf.keras.layers.Dense(256, activation='tanh'), tf.keras.layers.Dense(1, activation='sigmoid') ]) model.compile(optimizer='adam', loss=tf.keras.losses.BinaryCrossentropy(from_logits=True), metrics=['accuracy']) model.summary() model.fit( train_ds, validation_data=val_ds, epochs=500, steps_per_epoch=BATCH_SIZE, validation_steps=BATCH_SIZE )

2条回答

网友

1楼 · 编辑于 2024-04-23 15:48:47

您获取图像的代码看起来不错，但是我无法亲自测试，因为我不确定您的数据是如何存储的。此外，您的模型将开始训练这一事实表明错误可能不在这里。如果要确保可以使用matplotlib显示图像，以确保正确加载图像

首先，我会让你的模型尽可能简单，并且仍然有效，测试它是否仍然收敛到0.6931或其他数字。然后尝试使用不同的激活功能ie relu。另一种方法可能是使用一些批量标准化。我的理论是，在tanh函数中有非常大或非常小的值，这会导致每次输出接近0或1。这也阻止了进一步的训练，因为训练时有很小的坡度。更改为relu可能会绕过大值的问题，但可能不是小值。使用批量归一化将使您的值远离tanh输出仅为0或1的极值

网友

2楼 · 编辑于 2024-04-23 15:48:47

如果您一直在收敛到完全相同的损失，那么根据我的经验，只有一种解释——您对数据加载器的编码不正确。发生的情况是图像和标签不匹配。它试图学习纯粹的随机性。在这种情况下，它只需尽其所能输出“平均”正确答案。我怀疑0.69值来自您的数据标签，例如您有69%的1级和31%的0级

相关问题更多 >

编程相关推荐

热门问题

热门文章