PyTorch：nn.LSTM 對同一批中的相同輸入輸出不同的結果

Question

我嘗試使用torch.nn.LSTM實現兩層雙向 LSTM。

我做了一個玩具示例：一批 3 個張量，它們完全相同（見下面我的代碼）。 我希望 BiLSTM 的輸出在批次維度上是相同的，即out[:,0,:] == out[:,1,:] == out[:, 2, :] 。

但情況似乎並非如此。 根據我的實驗，20%~40% 的時間，輸出是不一樣的。 所以我想知道我哪里錯了。

# Python 3.6.6, Pytorch 0.4.1
import torch

def test(hidden_size, in_size):
    seq_len, batch = 4, 3
    bilstm = torch.nn.LSTM(input_size=in_size, hidden_size=hidden_size, 
                            num_layers=2, bidirectional=True)

    # create a batch with 3 exactly the same tensors
    a = torch.rand(seq_len, 1, in_size)  # (seq_len, 1, in_size)
    x = torch.cat((a, a, a), dim=1)

    out, _ = bilstm(x)  # (seq_len, batch, n_direction * hidden_size)

    # expect the output should be the same along the batch dimension
    assert torch.equal(out[:, 0, :], out[:, 1, :])  
    assert torch.equal(out[:, 1, :], out[:, 2, :])

if __name__ == '__main__':
    count, total = 0, 0
    for h_size in range(1, 51):
        for in_size in range(1, 51):
            total += 1
            try:
                test(h_size, in_size)
            except AssertionError:
                count += 1
    print('percentage of assertion error:', count / total)

Answer 1

使您困惑的是浮點精度。 浮點運算有些不准確，並且可能相差很小。請改用以下方法：

torch.set_default_dtype(torch.float64)

然后，您將看到它們在批處理暗處應該是相同的。

感謝您糾正一些英語語法錯誤。

Answer 2

我對GRU有同樣的問題，以下為我解決了這個問題。
在測試之前設置手動種子並將模型設置為評估模式：

torch.manual_seed(42)
bilstm.eval()  # or: bilstm.train(false)

來源： LSTMcell 和 LSTM 返回不同的輸出

此外，我必須在每次調用模型之前（在測試期間）設置相同的種子。 在你的情況下：

torch.manual_seed(42)
out, _ = bilstm(x)  # (seq_len, batch, n_direction * hidden_size)

PyTorch：nn.LSTM 對同一批中的相同輸入輸出不同的結果

問題描述

2 個解決方案

解決方案1
0 2019-02-20 21:06:32

解決方案2
0 2021-08-11 13:32:19

PyTorch：nn.LSTM 對同一批中的相同輸入輸出不同的結果

問題描述

2 個解決方案

解決方案1 0 2019-02-20 21:06:32

解決方案2 0 2021-08-11 13:32:19

解決方案1
0 2019-02-20 21:06:32

解決方案2
0 2021-08-11 13:32:19