用于LSTM模型的Keras多输出数据重塑

Question

我有一个包含多个输出的Keras LSTM模型。 该模型定义如下：

outputs=[]

main_input = Input(shape= (seq_length,feature_cnt), name='main_input')
lstm = LSTM(32,return_sequences=True)(main_input)
for _ in range((output_branches)): #output_branches is the number of output branches of the model
    prediction = LSTM(8,return_sequences=False)(lstm)
    out = Dense(1)(prediction)
    outputs.append(out)

model = Model(inputs=main_input, outputs=outputs)
model.compile(optimizer='rmsprop',loss='mse')

重塑输出数据时出现问题。 重塑输出数据的代码是：

y=y.reshape((len(y),output_branches,1))

我收到以下错误：

ValueError：检查模型目标时出错：传递给模型的Numpy数组列表不是模型预期的大小。 预计会看到5个数组，但获得了以下1个数组的列表：[array（[[[[0.29670931]，[0.16652206]，[0.25114482]，[0.36952324]，[0.09429612]]，
  [[0.16652206], [0.25114482], [0.36952324], [0.09429612],... 

如何正确调整输出数据的形状？

Answer 1

这取决于y最初的结构。 在这里，我假设y是批处理中每个序列的单值标签。

当有多个输入/输出时， model.fit()期望给出相应的输入/输出列表。 在下面的完全可重现的示例中np.split(y, output_branches, axis=-1)确实做到了这一点-对于每个批次，将单个输出列表拆分为一个单独的输出列表，其中每个输出（在这种情况下）为1个元素列表：

import tensorflow as tf
import numpy as np

tf.enable_eager_execution()

batch_size = 100
seq_length = 10
feature_cnt = 5
output_branches = 3

# Say we've got:
# - 100-element batch
# - of 10-element sequences
# - where each element of a sequence is a vector describing 5 features.
X = np.random.random_sample([batch_size, seq_length, feature_cnt])

# Every sequence of a batch is labelled with `output_branches` labels.
y = np.random.random_sample([batch_size, output_branches])
# Here y.shape() == (100, 3)

# Here we split the last axis of y (output_branches) into `output_branches` separate lists.
y = np.split(y, output_branches, axis=-1)
# Here y is not a numpy matrix anymore, but a list of matrices.
# E.g. y[0].shape() == (100, 1); y[1].shape() == (100, 1) etc...

outputs = []

main_input = tf.keras.layers.Input(shape=(seq_length, feature_cnt), name='main_input')
lstm = tf.keras.layers.LSTM(32, return_sequences=True)(main_input)
for _ in range(output_branches):
    prediction = tf.keras.layers.LSTM(8, return_sequences=False)(lstm)
    out = tf.keras.layers.Dense(1)(prediction)
    outputs.append(out)

model = tf.keras.models.Model(inputs=main_input, outputs=outputs)
model.compile(optimizer='rmsprop', loss='mse')

model.fit(X, y)

由于未指定数据的外观，因此您可能需要使用轴。

编辑：由于作者正在从官方渠道中寻找答案，因此这里提到了（尽管没有明确提及，它仅提及数据集应产生的内容，因此-什么样的输入结构model.fit()期望）：

当使用数据集对象调用fit时，它应产生一个列表元组，例如([title_data, body_data, tags_data], [priority_targets, dept_targets])或一个字典元组，例如({'title': title_data, 'body': body_data, 'tags': tags_data}, {'priority': priority_targets, 'department': dept_targets}) 。

Answer 2

由于您的输出量等于output_branches ，因此您的输出数据必须是具有相同数量数组的list 。

基本上，如果输出数据在中间维度上（如您的reshape所示），则：

y = [ y[:,i] for i in range(output_branches)]

用于LSTM模型的Keras多输出数据重塑

问题描述

2 个解决方案

解决方案1
3 2019-07-29 00:16:44

解决方案2
1 2019-08-03 01:59:53

用于LSTM模型的Keras多输出数据重塑

问题描述

2 个解决方案

解决方案1 3 2019-07-29 00:16:44

解决方案2 1 2019-08-03 01:59:53

解决方案1
3 2019-07-29 00:16:44

解决方案2
1 2019-08-03 01:59:53