model.parameters() 未在 Pytorch 的线性回归中更新

Question

I'm a newbie in Deep Learning with Pytorch.我是 Pytorch 的深度学习新手。 I am using the Housing Prices dataset from Kaggle here.我在这里使用 Kaggle 的房价数据集。 I tried sampling with first 50 rows.我尝试对前 50 行进行采样。 But the model.parameters() is not updating as I perform the training.但是 model.parameters() 在我执行训练时没有更新。 Can anyone help?任何人都可以帮忙吗？

import torch
import numpy as np
from torch.utils.data import TensorDataset
import torch.nn as nn
from torch.utils.data import DataLoader
import torch.nn.functional as F

inputs = np.array(label_X_train[:50])
targets = np.array(train_y[:50])

# Tensors
inputs = torch.from_numpy(inputs)
targets = torch.from_numpy(targets)
targets = targets.view(-1, 1)
train_ds = TensorDataset(inputs, targets)
batch_size = 5
train_dl = DataLoader(train_ds, batch_size, shuffle=True)

model = nn.Linear(10, 1)
# Define Loss func
loss_fn = F.mse_loss
# Optimizer
opt = torch.optim.SGD(model.parameters(), lr = 1e-5)


num_epochs = 100
model.train()         
for epoch in range(num_epochs):
    # Train with batches of data
    for xb, yb in train_dl:

        # 1. Generate predictions
        pred = model(xb.float())

        # 2. Calculate loss
        loss = loss_fn(pred, yb.float())
    
        # 3. Compute gradients
        loss.backward()

        # 4. Update parameters using gradients
        opt.step()

        # 5. Reset the gradients to zero
        opt.zero_grad()
     
    if (epoch+1) % 10 == 0:
        print('Epoch [{}/{}], Loss: {:.4f}'.format(epoch +
                                                   1, num_epochs, 
                                                   loss.item()))

Answer 1

The weight does update, but you weren't capturing it correctly.重量确实会更新，但您没有正确捕获它。 model.weight.data is a torch tensor, but the name of the variable is just a reference, so setting w = model.weight.data does not create a copy but another reference to the object. model.weight.data是一个火炬张量，但变量的名称只是一个参考，所以设置w = model.weight.data不会创建一个副本，而是另一个对 ZA8CFDE6331BD59EB2AC96F8911ZB4 的引用。 Hence changing model.weight.data would change w too.因此改变model.weight.data也会改变w 。

So by setting w = model.weight.data and w_new = model.weight data in different part of the loops means you're assigning two reference to the same object making their value equal at all time.因此，通过在循环的不同部分设置w = model.weight.data和w_new = model.weight data意味着您将两个引用分配给相同的 object，使其值始终相等。

In order to assess that the model weight are changing, either print(model.weight.data) before and after the loop (since you got one linear layer of 10 parameters it's still okay to do that) or simply set w = model.weight.data.clone() .为了评估 model 的权重是否发生变化，可以在循环之前和之后print(model.weight.data) （因为你有一个包含 10 个参数的线性层，仍然可以这样做）或简单地设置w = model.weight.data.clone() 。 In that case your output will be:在这种情况下，您的 output 将是：

tensor([[False, False, False, False, False, False, False, False, False, False]])

Here's an example that shows you that your weights are changing:这是一个示例，显示您的权重正在发生变化：

import torch
import numpy as np
from torch.utils.data import TensorDataset
import torch.nn as nn
from torch.utils.data import DataLoader
import torch.nn.functional as F

inputs = np.random.rand(50, 10)
targets = np.random.randint(0, 2, 50)

# Tensors
inputs = torch.from_numpy(inputs)
targets = torch.from_numpy(targets)
targets = targets.view(-1, 1)
train_ds = TensorDataset(inputs, targets.squeeze())
batch_size = 5
train_dl = DataLoader(train_ds, batch_size, shuffle=True)

model = nn.Linear(10, 1)
# Define Loss func
loss_fn = F.mse_loss
# Optimizer
opt = torch.optim.SGD(model.parameters(), lr = 1e-1)


num_epochs = 100
model.train()
w = model.weight.data.clone()         
for epoch in range(num_epochs):
    # Train with batches of data
    for xb, yb in train_dl:

        # 1. Generate predictions
        pred = model(xb.float())

        # 2. Calculate loss
        loss = loss_fn(pred, yb.float())
    
        # 3. Compute gradients
        loss.backward()

        # 4. Update parameters using gradients
        opt.step()

        # 5. Reset the gradients to zero
        opt.zero_grad()

    if (epoch+1) % 10 == 0:
        print('Epoch [{}/{}], Loss: {:.4f}'.format(epoch +
                                                   1, num_epochs, 
                                                   loss.item()))
print(w == model.weight.data)

model.parameters() 未在 Pytorch 的线性回归中更新

问题描述

1 个解决方案

解决方案1
2 已采纳 2020-07-11 12:42:09

model.parameters() 未在 Pytorch 的线性回归中更新

问题描述

1 个解决方案

解决方案1 2 已采纳 2020-07-11 12:42:09

解决方案1
2 已采纳 2020-07-11 12:42:09