在Python列表上使用nn.ModuleList会大大减慢训练速度

Question

I'm training a very simple model that takes the number of hidden layers as a parameter. 我正在训练一个非常简单的模型，该模型将隐藏层的数量作为参数。 I originally stored these hidden layers in a vanilla python list [] , however when converting this list to a nn.ModuleList , training slows down dramatically by at least one order of magnitude ! 我最初将这些隐藏层存储在香草python列表[] ，但是当将此列表转换为nn.ModuleList ，训练速度至少会降低一个数量级 ！

AdderNet AdderNet

class AdderNet(nn.Module):
    def __init__(self, num_hidden, hidden_width):
        super(AdderNet, self).__init__()
        self.relu = nn.ReLU()

        self.hiddenLayers = []
        self.inputLayer = nn.Linear(2, hidden_width)
        self.outputLayer = nn.Linear(hidden_width, 1)

        for i in range(num_hidden):
            self.hiddenLayers.append(nn.Linear(hidden_width, hidden_width))

        self.hiddenLayers = nn.ModuleList(self.hiddenLayers)  # <--- causes DRAMATIC slowdown!

    def forward(self, x):
        out = self.inputLayer(x)
        out = self.relu(out)

        for layer in self.hiddenLayers:
            out = layer(out)
            out = self.relu(out)

        return self.outputLayer(out)

Training 训练

for epoch in range(num_epochs):
    for i in range(0,len(data)):
        out = model.forward(data[i].x)
        loss = lossFunction(out, data[i].y)

        optimizer.zero_grad()
        loss.backward()
        optimizer.step()

Answer 1

That's because when using a normal python list, the parameters are not added to the model's parameter list, but when using a ModuleList, they are. 这是因为在使用普通的python列表时，参数不会添加到模型的参数列表中，而在使用ModuleList时会添加。 So, in the original scenario, you were never actually training the hidden layers, which is why it was faster. 因此，在原始方案中，您从未真正训练过隐藏层，这就是为什么它更快的原因。 (Print out model.parameters() in each case and see what happens!) （在每种情况下打印出model.parameters（），看看会发生什么！）

在Python列表上使用nn.ModuleList会大大减慢训练速度

问题描述

1 个解决方案

解决方案1
1 已采纳 2019-02-19 19:56:24

在Python列表上使用nn.ModuleList会大大减慢训练速度

问题描述

1 个解决方案

解决方案1 1 已采纳 2019-02-19 19:56:24

解决方案1
1 已采纳 2019-02-19 19:56:24