BERT的慢训练 model 抱脸

Question

I am training the binary classfier using BERT model implement in hugging face library我正在使用 BERT model implement in hugging face library 训练二进制分类器

training_args = TrainingArguments(
   "deleted_tweets_trainer",                  
   num_train_epochs = 1,            
   #logging_steps=100,    
   evaluation_strategy='steps',       
   remove_unused_columns = True    
)

I am using Colab TPU still the training time is a lot, 38 hours for 60 hours cleaned tweets.我仍在使用 Colab TPU，训练时间仍然很多，38 小时清理推文 60 小时。

Is there any way to optimise the training?有什么办法可以优化训练？

Answer 1

You are currently evaluating every 500 steps and have a training and eval batch size of 8.您目前每 500 步评估一次，训练和评估批大小为 8。

Depending on your current memory consumption, you can increase the batch sizes (eval much more as training consumes more memory):根据您当前的 memory 消耗量，您可以增加批量大小（随着训练消耗更多内存，评估更多）：

per_device_train_batch_size per_device_train_batch_size
per_device_eval_batch_size per_device_eval_batch_size

In case it matches your use case, you can also increase the steps after an evaluation is started;如果它符合您的用例，您还可以在评估开始后增加步骤；

eval_steps评估步骤

BERT的慢训练 model 抱脸

问题描述

1 个解决方案

解决方案1
0 2021-08-23 05:39:33

BERT的慢训练 model 抱脸

问题描述

1 个解决方案

解决方案1 0 2021-08-23 05:39:33

解决方案1
0 2021-08-23 05:39:33