图像预处理和数据增强应该如何进行语义分割？

from keras.preprocessing.image import ImageDataGenerator datagen = ImageDataGenerator( rotation_range=20, # is a value in degrees (0–180) width_shift_range=0.2, # is a range within which to randomly translate pictures horizontally. height_shift_range=0.2, # is a range within which to randomly translate pictures vertically. shear_range=0.2, # is for randomly applying shearing transformations. zoom_range=0.2, # is for randomly zooming inside pictures. horizontal_flip=True, # is for randomly flipping half the images horizontally fill_mode='nearest', # is the strategy used for filling in newly created pixels, which can appear after a rotation or a width/height shift featurewise_center=True, featurewise_std_normalization=True) datagen.fit(X_train)

1条回答

网友

1楼 · 发布于 2024-05-29 10:49:50

扩充和预处理阶段总是取决于您遇到的问题。你必须考虑所有可能的扩充，以扩大你的数据集。但最重要的是，你不应该进行极端的扩充，这使得新的训练样本无法在真实的例子中发生。如果您不希望实际的示例将水平翻转，请不要执行水平翻转，因为这会给您的模型提供错误的信息。考虑所有可能发生在输入图像中的更改，并尝试从现有图像中人为地生成新图像。您可以使用Keras的许多内置函数。但是你应该意识到每一个都不会产生新的例子，这些例子不太可能出现在你的模型的输入中。在

正如你所说，没有“一刀切”的解决方案，因为一切都取决于数据。分析数据并建立与之相关的一切。在

关于小物体-你应该检查的一个方向是损失函数，它强调目标体积相对于背景的影响。看看骰子损失或一般骰子损失。在

相关问题更多 >

编程相关推荐

热门问题

热门文章