Keras 密集输入大小错误，展平返回（无，无）

Question

I'm trying to implement this model to generate midi music but I'm getting an error我正在尝试实现此模型以生成 MIDI 音乐，但出现错误

The last dimension of the inputs to `Dense` should be defined. Found `None`.

Here's my code这是我的代码

model = Sequential()
model.add(Bidirectional(LSTM(512, return_sequences=True), input_shape=(network_input.shape[1], network_input.shape[2])))
model.add(SeqSelfAttention(attention_activation='sigmoid'))
model.add(Dropout(0.3))
model.add( LSTM(512, return_sequences=True))
model.add(Dropout(0.3))
model.add(Flatten())
model.summary()
model.add(Dense(note_variants_count))
model.compile(loss='categorical_crossentropy', optimizer='adam')

And here's the summary before the dense layer这是密集层之前的总结


Model: "sequential_17"
_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
bidirectional_19 (Bidirectio (None, 100, 1024)         2105344   
_________________________________________________________________
seq_self_attention_20 (SeqSe (None, None, 1024)        65601     
_________________________________________________________________
dropout_38 (Dropout)         (None, None, 1024)        0         
_________________________________________________________________
lstm_40 (LSTM)               (None, None, 512)         3147776   
_________________________________________________________________
dropout_39 (Dropout)         (None, None, 512)         0         
_________________________________________________________________
flatten_14 (Flatten)         (None, None)              0         
=================================================================
Total params: 5,318,721
Trainable params: 5,318,721
Non-trainable params: 0
_________________________________________________________________

I think that the Flatten layer is causing the problem but I have no idea why it's returning a (None, None) shape.我认为 Flatten 层导致了问题，但我不知道为什么它返回 (None, None) 形状。

Answer 1

您必须为 Flatten 层提供除批次维度之外的所有维度。

Answer 2

Using RNN and 'return_sequence'使用 RNN 和“return_sequence”

For RNNs like LSTM, there is an option to either return the whole sequence or just the results.对于像 LSTM 这样的 RNN，可以选择返回整个序列或仅返回结果。 In this case you want just the results.在这种情况下，您只需要结果。 Changing just this one line只改变这一行

model.add(Dropout(0.3))
model.add(LSTM(512, return_sequences=False)) # <== change to False
model.add(Dropout(0.3))
model.add(Flatten())
model.summary()

Returns the following network shape from summary从摘要中返回以下网络形状

_________________________________________________________________
lstm_4 (LSTM)                (None, 512)               3147776   
_________________________________________________________________
dropout_3 (Dropout)          (None, 512)               0         
_________________________________________________________________
flatten_1 (Flatten)          (None, 512)               0         
=================================================================

Notice the reduction of rank of the output tensor from Rank 3 to Rank 2. This is because this output is just that, the output, and not the whole sequence under consideration with all the hidden states.注意输出张量的秩从 Rank 3 降低到 Rank 2。这是因为这个输出只是那个输出，而不是考虑到所有隐藏状态的整个序列。

Keras 密集输入大小错误，展平返回（无，无）

问题描述

2 个解决方案

解决方案1
0 2020-11-09 08:25:12

解决方案2
0 2020-11-24 20:46:04

Using RNN and 'return_sequence'使用 RNN 和“return_sequence”

Keras 密集输入大小错误，展平返回（无，无）

问题描述

2 个解决方案

解决方案1 0 2020-11-09 08:25:12

解决方案2 0 2020-11-24 20:46:04

Using RNN and 'return_sequence'使用 RNN 和“return_sequence”

解决方案1
0 2020-11-09 08:25:12

解决方案2
0 2020-11-24 20:46:04