计算与数据相对应的光流torch.utils.data公司.数据加载

1条回答

网友

1楼 · 发布于 2024-04-20 04:25:07

您必须使用自己的数据加载器类来动态计算光流。这个类的思想是获取包含视频序列的当前和下一帧文件名的文件名元组列表（curr image，next image），而不是简单的文件名列表。这允许在填充文件名列表后获得正确的图像对。下面的代码为您提供了一个非常简单的示例实现：

from torch.utils.data import Dataset
import cv2
import numpy as np

class FlowDataLoader(Dataset):
def __init__(self,
             filename_tuples):

    random.shuffle(filename_tuples)
    self.lines = filename_tuples

def __getitem__(self, index):
    img_filenames = self.lines[index]
    curr_img = cv2.cvtColor(cv2.imread(img_filenames[0]), cv2.BGR2GRAY)
    next_img = cv2.cvtColor(cv2.imread(img_filenames[1]), cv2.BGR2GRAY)
    flow = cv2.calcOpticalFlowFarneback(curr_img, next_img, ... [parameter])

    # code for loading the class label
    # label = ...
    #
    # this is a very simple data normalization
    curr_img= curr_img.astype(np.float32) / 255
    next_img = next_img .astype(np.float32) / 255
    # you can return the image and flow seperatly 
    return curr_img, flow, label
    # or stacked as follows
    # return np.dstack((curr_img,flow)), label

# at this place you need a function that create a list of training sample filenames
# that look like this
training_filelist = [(img000.png, img001.png), 
                     (img001.png, img002.png),
                     (img002.png, img003.png)] 

training_data = FlowDataLoader(training_filelist)
train_loader = torch.utils.data.DataLoader(
        training_data,
        batch_size=8,
        shuffle=True,
        num_workers=4,
        pin_memory=True)

这只是FlowDataLoader的一个简单示例。理想情况下，这应该扩展，以便当前图像输出包含标准化的rgb值，光流也被标准化和剪裁。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章

计算与数据相对应的光流torch.utils.data公司.数据加载

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >