简体繁体中英

Slow training of BERT model Hugging face

原文 2021-08-21 08:21:22 7 1 python/ google-colaboratory/ bert-language-model/ huggingface-transformers/ tpu

I am training the binary classfier using BERT model implement in hugging face library

training_args = TrainingArguments(
   "deleted_tweets_trainer",                  
   num_train_epochs = 1,            
   #logging_steps=100,    
   evaluation_strategy='steps',       
   remove_unused_columns = True    
)

I am using Colab TPU still the training time is a lot, 38 hours for 60 hours cleaned tweets.

Is there any way to optimise the training?

1 answers

You are currently evaluating every 500 steps and have a training and eval batch size of 8.

Depending on your current memory consumption, you can increase the batch sizes (eval much more as training consumes more memory):

per_device_train_batch_size
per_device_eval_batch_size

In case it matches your use case, you can also increase the steps after an evaluation is started;

eval_steps

Training New AutoTokenizer Hugging Face

AttributeError: 'KerasTensor' object has no attribute 'size with hugging-face bert.AutoModel model with tensorflow

Getting KeyErrors when training Hugging Face Transformer

BERT Convert 'SpanAnnotation' to answers using scores from hugging face models

Reduce the number of hidden units in hugging face transformers (BERT)

Error loading weights from a Hugging Face model

Training a BERT model and using the BERT embeddings

key dataset lost during training using the Hugging Face Trainer

Hugging Face model Bio_ClinicalBERT producing 404 error

how to use csv data to train a hugging face model?

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Training New AutoTokenizer Hugging Face AttributeError: 'KerasTensor' object has no attribute 'size with hugging-face bert.AutoModel model with tensorflow Getting KeyErrors when training Hugging Face Transformer BERT Convert 'SpanAnnotation' to answers using scores from hugging face models Reduce the number of hidden units in hugging face transformers (BERT) Error loading weights from a Hugging Face model Training a BERT model and using the BERT embeddings key dataset lost during training using the Hugging Face Trainer Hugging Face model Bio_ClinicalBERT producing 404 error how to use csv data to train a hugging face model?

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM