PyTorch Lightning：在检查点文件中包含一些张量对象

checkpoint_callback = ModelCheckpoint( monitor='val_acc', dirpath='checkpoints/', filename='{epoch:02d}-{val_acc:.2f}', save_top_k=5, mode='max', )

class SampleNet(pl.LightningModule): def __init__(self): super().__init__() self.save_hyperparameters() self.layer = torch.nn.Linear(100, 1) self.loss = torch.nn.CrossEntropy() self.some_data = None # Initialize as None def training_step(self, batch): x, t = batch out = self.layer(x) loss = self.loss(out, t) results = {'loss': loss} return results def training_epoch_end(self, outputs): self.some_data = some_tensor_object

2条回答

网友

1楼 · 编辑于 2024-04-20 05:29:22

似乎不可能直接提取参数，因为最有可能使用^{}。这种方法只提取实际视为参数的张量值。因此，在这种情况下，解决方法是将数据保存为参数（请参见docs）：

self.some_data = torch.nn.parameter.Parameter(your_data)

网友

2楼 · 编辑于 2024-04-20 05:29:22

只需将模型类挂钩on_save_checkpoint()和on_load_checkpoint()用于所有要与默认属性一起保存的对象

def on_save_checkpoint(self, checkpoint) -> None:
    "Objects to include in checkpoint file"
    checkpoint["some_data"] = self.some_data

def on_load_checkpoint(self, checkpoint) -> None:
    "Objects to retrieve from checkpoint file"
    self.some_data= checkpoint["some_data"]

See module docs

相关问题更多 >

编程相关推荐

热门问题

热门文章