PyTorch：保存优化器state的目的是什么？

Question

PyTorch is capable of saving and loading the state of an optimizer. PyTorch 能够保存和加载优化器的 state。 An example is shown in the PyTorch tutorial . PyTorch 教程中显示了一个示例。 I'm currently just saving and loading the model state but not the optimizer.我目前只是保存和加载 model state 但不是优化器。 So what's the point of saving and loading the optimizer state besides not having to remember the optimizers params such as the learningrate.那么除了不必记住诸如学习率之类的优化器参数之外，保存和加载优化器 state 的意义何在。 And what's contained in the optimizer state?优化器 state 中包含什么？

Answer 1

I believe that saving the optimizer's state is an important aspect of logging and reproducibility.我相信保存优化器的 state 是日志记录和重现性的一个重要方面。 It stores many details about the optimizer's settings;它存储了许多关于优化器设置的细节； things including the kind of optimizer used, learning rate, weight decay, type of scheduler used (I find this very useful personally), etc. Moreover, it can be used in a similar fashion when loading pre-trained weights into your current model via .load_state_dict() such that you can pass in some stored optimizer setting/configuration into your current optimizer using the same method: optimizer.load_state_dict(some_good_optimizer.state_dict()) .包括使用的优化器类型、学习率、权重衰减、使用的调度器类型（我个人觉得这非常有用）等。此外，当通过.load_state_dict()以便您可以使用相同的方法将一些存储的优化器设置/配置传递到当前的优化器中： optimizer.load_state_dict(some_good_optimizer.state_dict()) 。

Answer 2

You should save the optimizer state if you want to resume model training later.如果您想稍后恢复 model 训练，您应该保存优化器 state。 Especially if Adam is your optimizer.特别是如果 Adam 是您的优化器。 Adam is an adaptive learning rate method, which means it computes individual learning rates for various parameters. Adam 是一种自适应学习率方法，这意味着它计算各种参数的个体学习率。

It is not required if you only want to use the saved model for inference.如果您只想使用保存的 model 进行推理，则不需要。

However, It's best practice to save both model state and optimizer state.但是，最好同时保存 model state 和优化器 state。 You can also save loss history and other running metrics if you want to plot them later.如果您想稍后 plot，您还可以保存损失历史记录和其他运行指标。

I'd do it like,我会这样做，

    torch.save({
            'epoch': epochs,
            'model_state_dict': model.state_dict(),
            'optimizer_state_dict': optimizer.state_dict(),
            'train_loss_history': loss_history,
            }, PATH)

PyTorch：保存优化器state的目的是什么？

问题描述

2 个解决方案

解决方案1
1 2022-01-19 10:42:36

解决方案2
1 已采纳 2022-01-19 12:36:41

PyTorch：保存优化器state的目的是什么？

问题描述

2 个解决方案

解决方案1 1 2022-01-19 10:42:36

解决方案2 1 已采纳 2022-01-19 12:36:41

解决方案1
1 2022-01-19 10:42:36

解决方案2
1 已采纳 2022-01-19 12:36:41