Torch neural network does not train

Question

I have implemented a very simple neural network in the torch framework

def mlp(sizes, activation, output_activation=torch.nn.Identity):
layers = []
for j in range(len(sizes)-1):
    act = activation if j < len(sizes)-1 else output_activation
    layers += [torch.nn.Linear(sizes[j], sizes[j+1]), act()]
return torch.nn.Sequential(*layers)

In order to train a network to make regression on the function y=sin(x)

x = torch.linspace(-math.pi, math.pi, 2000, device=device, dtype=dtype)
y = torch.sin(x)

the training code is here

size = [1,20,20,1]
activation = torch.nn.ReLU
model = mlp(size, activation)

optimizer = torch.optim.SGD(model.parameters(), lr=0.002)

n_epoch = 600
mse_loss = torch.nn.MSELoss()
X = x.unsqueeze(-1)
for i in range(n_epoch):
    y_pred = model(X)
    step_loss = mse_loss(y_pred, y)
    optimizer.zero_grad()
    step_loss.backward()
    optimizer.step()

Unfortunately, the network only learn an almost constant function $y=0$. I have already tried many things

Change Hyperparameters of the network
Add mini batches in training
Change the number of epochs and learning rate

But nothing seems to work. The problem is so simple that I think there is an error in the code.

Answer 1

I am not sure if this is the main cause, but the statement

act = activation if j < len(sizes)-1 else output_activation

appears to be logically incorrect. In the loop, j can take values from 0 to len(sizes)-1 , so the condition is always true. This means that your network has a ReLU right at the end, and so can only ever give non-negative outputs. This can be corrected by changing that statement to:

act = activation if j < len(sizes)-2 else output_activation

Torch neural network does not train

Question

1 answers

solution1
2 2022-05-25 20:13:19

Torch neural network does not train

Question

1 answers

solution1 2 2022-05-25 20:13:19

solution1
2 2022-05-25 20:13:19