Keras train_on_batch（）不训练模型vs fit（）

Question

I have a dataset that is too large to fit on RAM so I opted to use train_on_batch to train my model incrementally. 我的数据集太大而无法容纳在RAM上，因此我选择使用train_on_batch逐步训练我的模型。 To test if this approach works, I took a subset of my large data to run some preliminary testing. 为了测试这种方法是否有效，我使用了一部分大数据来进行一些初步测试。

However, I have been having some issues training the model, namely the accuracy of the model gets stuck at 10% when training with train_on_batch(). 但是，我在训练模型时遇到了一些问题，即在使用train_on_batch（）进行训练时，模型的准确性停留在10％。 With fit(), I get an accuracy of 95% at 40 epochs. 使用fit（）时，在40个历元时我的准确度为95％。 I have also tried fit_generator() and have encountered similar issues. 我也尝试过fit_generator（）并遇到类似的问题。

using fit() 使用fit（）

results = model.fit(x_train,y_train,batch_size=128,nb_epoch=40)

using train_on_batch() 使用train_on_batch（）

#386 has been chosen so that each batch size is 128
splitSize = len(y_train) // 386

for j in range(20):
    print('epoch: '+str(j)+' ----------------------------')
    np.random.shuffle(x_train)
    np.random.shuffle(y_train)
    xb = np.array_split(x_train,386)
    yb = np.array_split(y_train,386)
    sumAcc = 0
    index = list(range(386))
    random.shuffle(index)
    for i in index:
        results = model.train_on_batch(xb[i],yb[i])
        sumAcc += results[1]
    print(sumAcc/(386))

Answer 1

The shuffle you are using is incorrect, because the y_train does not match x_train after the shuffle. 您使用的随机播放是不正确的，因为随机播放之后y_train与x_train不匹配。 When you shuffle like that, each array is shuffled in a different order. 当您像这样随机播放时，每个数组都以不同的顺序随机播放。 You can use: 您可以使用：

length = x_train.shape[0]
idxs = np.arange(0, length)
np.random.shuffle(idxs)

x_train = x_train[idxs]
y_train = y_train[idxs]

Keras train_on_batch（）不训练模型vs fit（）

问题描述

1 个解决方案

解决方案1
0 2019-03-03 13:02:57

Keras train_on_batch（）不训练模型vs fit（）

问题描述

1 个解决方案

解决方案1 0 2019-03-03 13:02:57

解决方案1
0 2019-03-03 13:02:57