简体   繁体   English

在本地 gpu 和 colab gpu 上进行分布式训练

[英]Distributed training over local gpu and colab gpu

I want to fine tune ALBERT.我想微调 ALBERT。

I see one can distribute neural net training over multiple gpus using tensorflow: https://www.tensorflow.org/guide/distributed_training我看到可以使用 tensorflow 在多个 gpu 上分配神经网络训练: https ://www.tensorflow.org/guide/distributed_training

I was wondering if it's possible to distribute fine-tuning across both my laptop's gpu and a colab gpu?我想知道是否可以在我的笔记本电脑的 gpu 和 colab gpu 上分配微调?

I don't think that's possible.我不认为那是可能的。 Because in order to do GPU distributed training, you need NVLinks among your GPUs.因为为了进行 GPU 分布式训练,您的 GPU 之间需要 NVLink。 You don't have such a link between your laptop's GPU and Colab GPUs.您的笔记本电脑的 GPU 和 Colab GPU 之间没有这种联系。 This is a good read https://lambdalabs.com/blog/introduction-multi-gpu-multi-node-distributed-training-nccl-2-0/这是一个很好的阅读https://lambdalabs.com/blog/introduction-multi-gpu-multi-node-distributed-training-nccl-2-0/

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM