如何转换NN的output，同时还能训练？

Question

I have a neural network which outputs output .我有一个输出output的神经网络。 I want to transform output before the loss and backpropogation happen.我想在损失和反向传播发生之前转换output 。

Here is my general code:这是我的一般代码：

with torch.set_grad_enabled(training):
                  outputs = net(x_batch[:, 0], x_batch[:, 1]) # the prediction of the NN
                  # My issue is here:
                  outputs = transform_torch(outputs)
                  loss = my_loss(outputs, y_batch)

                  if training:
                      scheduler.step()
                      loss.backward()
                      optimizer.step()

Following the advice in How to transform output of neural network and still train?遵循如何转换神经网络的 output 中的建议并继续训练？ , I have a transformation function which I put my output through: ，我有一个转换 function 我把我的 output 通过：

def transform_torch(predictions):
    new_tensor = []
    for i in range(int(len(predictions))):
      arr = predictions[i]
      a = arr.clone().detach() 
      
      # My transformation, which results in a positive first element, and the other elements represent decrements of the first positive element.
     
      b = torch.negative(a)
      b[0] = abs(b[0])
      new_tensor.append(torch.cumsum(b, dim = 0))

      # new_tensor[i].requires_grad = True
    new_tensor = torch.stack(new_tensor, 0)    

    return new_tensor

Note: In addition to clone().detach() , I also tried the methods described in Pytorch preferred way to copy a tensor , to similar result.注意：除了clone().detach()之外，我还尝试了Pytorch 首选方法中描述的方法来复制张量，得到类似的结果。

My problem is that no training actually happens with this tensor that is tranformed.我的问题是，这个被转换的张量实际上并没有发生任何训练。

If I try to modify the tensor in-place (eg directly modify arr ), then Torch complains that I can't modify a tensor in-place with a gradient attached to it.如果我尝试就地修改张量（例如直接修改arr ），那么 Torch 会抱怨我无法就地修改带有渐变的张量。

Any suggestions?有什么建议么？

Answer 1

Calling detach on your predictions stops gradient propagation to your model.对predictions调用detach会停止向 model 的梯度传播。 Nothing you do after that can change your parameters.之后您所做的任何事情都不会改变您的参数。

How about modifying your code to avoid this:如何修改代码以避免这种情况：

def transform_torch(predictions):
  b = torch.cat([predictions[:, :1, ...].abs(), -predictions[:, 1:, ...]], dim=1)
  new_tensor = torch.cumsum(b, dim=1)
  return new_tensor

Answer 2

How about extracting grad from the tensor with something like this用这样的东西从张量中提取 grad 怎么样

  grad = output.grad

and after the transformation assigning the same gradient to new tensor并在转换后将相同的梯度分配给新张量

如何转换NN的output，同时还能训练？

问题描述

2 个解决方案

解决方案1
0 2021-12-27 08:17:56

解决方案2
-1 2021-12-20 18:42:54

如何转换NN的output，同时还能训练？

问题描述

2 个解决方案

解决方案1 0 2021-12-27 08:17:56

解决方案2 -1 2021-12-20 18:42:54

解决方案1
0 2021-12-27 08:17:56

解决方案2
-1 2021-12-20 18:42:54