简体繁体中英

What does train_on_batch() do in keras model?

原文 2018-01-31 19:41:02 0 3 python/ tensorflow/ machine-learning/ keras/ artificial-intelligence

I saw a sample of code (too big to paste here) where the author used model.train_on_batch(in, out) instead of model.fit(in, out) . The official documentation of Keras says:

Single gradient update over one batch of samples.

But I don't get it. Is it the same as fit() , but instead of doing many feed-forward and backprop steps, it does it once? Or am I wrong?

3 answers

Yes, train_on_batch trains using a single batch only and once.

While fit trains many batches for many epochs. (Each batch causes an update in weights).

The idea of using train_on_batch is probably to do more things yourself between each batch.

It is used when we want to understand and do some custom changes after each batch training.

A more precide use case is with the GANs. You have to update discriminator but during update the GAN network you have to keep the discriminator untrainable. so you first train the discriminator and then train the gan keeping discriminator untrainable. see this for more understanding: https://medium.com/datadriveninvestor/generative-adversarial-network-gan-using-keras-ce1c05cfdfd3

The method fit of the model train the model for one pass through the data you gave it, however because of the limitations in memory (especially GPU memory), we can't train on a big number of samples at once, so we need to divide this data into small piece called mini-batches (or just batchs). The methode fit of keras models will do this data dividing for you and pass through all the data you gave it.

However, sometimes we need more complicated training procedure we want for example to randomly select new samples to put in the batch buffer each epoch (eg GAN training and Siamese CNNs training ...), in this cases we don't use the fancy an simple fit method but instead we use the train_on_batch method. To use this methode we generate a batch of inputs and a batch of outputs(labels) in each iteration and pass it to this method and it will train the model on the whole samples in the batch at once and gives us the loss and other metrics calculated with respect to the batch samples.

Keras train_on_batch() does not train the model vs fit()

Using Checkpoint saving with train_on_batch in Keras

TensorFlow Keras: tf.keras.Model train_on_batch vs make_train_function - Why is one slower than the other?

Problem in sample_weight in Keras when trying train_on_batch for a model with multiple outputs

How to revert keras model to previous epoch weights after train_on_batch nan update

Why does Keras' train_on_batch produce zero loss and accuracy at the second epoch?

How do I use “reduceLROnplateau” callback method with “train_on_batch” in keras

Keras An operation has None for gradient when train_on_batch

Keras: is there sample code for train_on_batch which has history + progress?

Is Keras Sequential fit the same as several train_on_batch calls?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Keras train_on_batch() does not train the model vs fit() Using Checkpoint saving with train_on_batch in Keras TensorFlow Keras: tf.keras.Model train_on_batch vs make_train_function - Why is one slower than the other? Problem in sample_weight in Keras when trying train_on_batch for a model with multiple outputs How to revert keras model to previous epoch weights after train_on_batch nan update Why does Keras' train_on_batch produce zero loss and accuracy at the second epoch? How do I use “reduceLROnplateau” callback method with “train_on_batch” in keras Keras An operation has None for gradient when train_on_batch Keras: is there sample code for train_on_batch which has history + progress? Is Keras Sequential fit the same as several train_on_batch calls?

Related Tags

What does train_on_batch() do in keras model?

Question

3 answers

solution1
15 ACCPTED 2018-01-31 20:21:50

solution2
3 2019-06-07 08:06:53

solution3
0 2020-06-18 14:39:06

What does train_on_batch() do in keras model?

Question

3 answers

solution1 15 ACCPTED 2018-01-31 20:21:50

solution2 3 2019-06-07 08:06:53

solution3 0 2020-06-18 14:39:06

solution1
15 ACCPTED 2018-01-31 20:21:50

solution2
3 2019-06-07 08:06:53

solution3
0 2020-06-18 14:39:06