繁体 English 中英

具有预训练权重的 Model.train() 使结果全部为 0，而 model.eval() 很好

[英]Model.train() with pre-trained weights makes results all 0 while model.eval() is fine

原文 2021-06-14 03:56:19 4 1 python/ pytorch/ batch-normalization

感谢您对此事的关注。

我想继续用它的预训练权重训练 model。 When I evaluate this pre-trained model with model.eval() , everything is fine and the model will generate some reasonable results, but when I want to further train this model and set the mode with model.train() , the problem will发生。 在前向循环期间，在model.train()语句（ batchsize = 1 ）之后，所有生成的结果都将为零。

关于为什么会发生这种情况的任何想法？

非常感谢。

1 个解决方案

批量归一化通过根据批次的估计均值和方差对所有激活进行归一化来工作。
当batchsize=1时，您期望这些值是多少？

增加您的batchsize ，看看是否出现问题。

哪些 PyTorch 模块受 model.eval() 和 model.train() 影响？

[英]Which PyTorch modules are affected by model.eval() and model.train()?

Pytorch中model.train()和model.eval()模式下BatchNorm层反向传播的区别？

[英]The differences of BatchNorm layer backpropagation at mode of model.train() and model.eval() in Pytorch?

如何向预训练的对象检测模型添加额外的类并训练它检测所有类（预训练 + 新的）？

[英]How to add additional classes to a pre-trained object detection model and train it to detect all of the classes (pre-trained + new)?

当使用TensorFlow slim调整预训练模型时，如何知道要排除或训练的范围？

[英]How to know which scopes to exclude or to train when fine tuning a pre-trained model with TensorFlow slim?

使用预训练模型在张量流中训练新模型

[英]Using a pre-trained model to train a new model in tensor flow

如何访问和可视化预训练的 TensorFlow 2 模型中的权重？

[英]How to access and visualize the weights in a pre-trained TensorFlow 2 model?

在Keras的Alexnet模型中使用预先训练的权重

[英]Using pre-trained weights in Alexnet model in Keras

无法在Tensorflow估算器中训练Keras预训练模型

[英]Cannot Train Keras Pre-trained Model in Tensorflow Estimator

在迁移学习预训练模型上训练新数据集

[英]Train new dataset on transfer learning pre-trained model

是什么让 pytorch 中的预训练 model 错误分类图像

[英]What makes a pre-trained model in pytorch misclassify an image

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 哪些 PyTorch 模块受 model.eval() 和 model.train() 影响？ Pytorch中model.train()和model.eval()模式下BatchNorm层反向传播的区别？如何向预训练的对象检测模型添加额外的类并训练它检测所有类（预训练 + 新的）？当使用TensorFlow slim调整预训练模型时，如何知道要排除或训练的范围？使用预训练模型在张量流中训练新模型如何访问和可视化预训练的 TensorFlow 2 模型中的权重？在Keras的Alexnet模型中使用预先训练的权重无法在Tensorflow估算器中训练Keras预训练模型在迁移学习预训练模型上训练新数据集是什么让 pytorch 中的预训练 model 错误分类图像

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM