在 pytorch 中，如何训练具有两个或更多输出的 model？

Question

output_1, output_2 = model(x)
loss = cross_entropy_loss(output_1, target_1)
loss.backward()
optimizer.step()

loss = cross_entropy_loss(output_2, target_2)
loss.backward()
optimizer.step()

但是，当我运行这段代码时，我得到了这个错误：

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.FloatTensor [1, 4]], which is output 0 of TBackward, is at version 2; expected version 1 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).

然后，我真的很想知道我应该做什么来训练具有 2 个或更多输出的 model

Answer 1

pytorch（和其他 DL 框架）的整个前提是标量损失 function 的梯度的反向传播。
在您的情况下，您有一个向量（dim=2）损失 function：

[cross_entropy_loss(output_1, target_1), cross_entropy_loss(output_2, target_2)]

您需要决定如何将这两个损失组合成一个标量损失。
例如：

weight = 0.5  # relative weight
loss = weight * cross_entropy_loss(output_1, target_1) + (1. - weight) * cross_entropy_loss(output_2, target_2)
# now loss is a scalar
loss.backward()
optimizer.step()

在 pytorch 中，如何训练具有两个或更多输出的 model？

问题描述

1 个解决方案

解决方案1
4 已采纳 2021-04-13 08:43:46

在 pytorch 中，如何训练具有两个或更多输出的 model？

问题描述

1 个解决方案

解决方案1 4 已采纳 2021-04-13 08:43:46

解决方案1
4 已采纳 2021-04-13 08:43:46