简体繁体 English

鉴于 memory，tensor2tensor 和 pytorch 有什么区别吗？

[英]Is there any difference between tensor2tensor and pytorch in view of memory?

原文 2020-05-05 07:09:21 1 1 tensorflow/ pytorch/ tensor2tensor

I'm trying to train seq2seq model(transformer) with pytorch and tensor2tensor.我正在尝试使用 pytorch 和 tensor2tensor 训练 seq2seq 模型（变压器）。 When using tensor2tensor, the batch size can be like 1024, while pytorch model shows CUDA out of memory error with 8 batch size. When using tensor2tensor, the batch size can be like 1024, while pytorch model shows CUDA out of memory error with 8 batch size.

Is there any technique used in tensor2tensor to make best use of memory. tensor2tensor 中是否使用了任何技术来充分利用 memory。

If anyone know this, please tell me.如果有人知道这一点，请告诉我。

Thanks in advance.提前致谢。

1 个解决方案

In Tensor2Tensor by default, the batch size is specified in the number of tokens (subwords) per single GPU.默认情况下，在 Tensor2Tensor 中，批量大小以每个 GPU 的令牌（子字）数指定。 This allows to use a higher number of short sequences (sentences) in one batch or a smaller number of long sequences.这允许在一批中使用更多数量的短序列（句子）或更少数量的长序列。 Most other toolkits use a fixed batch size specified in the number of sequences.大多数其他工具包使用在序列数中指定的固定批量大小。 Either way, it may be a good idea to limit the maximum sentence length in training to a reasonable number to prevent Out-of-memory errors and excessive padding.无论哪种方式，最好将训练中的最大句子长度限制在一个合理的数字，以防止内存不足错误和过度填充。 Some toolkits also prefer to specify the total batch size per all GPU cards.一些工具包还喜欢指定所有 GPU 卡的总批量大小。

如何使用tensor2tensor对文本进行分类？ - How to use tensor2tensor to classify text?

无法运行 Tensorflow 的官方 Tensor2Tensor colab notebook - Unable to run Tensorflow's official Tensor2Tensor colab notebook

在Flask中导入多个自定义tensor2tensor问题 - Importing multiple custom tensor2tensor Problems in Flask

Tensorflow操作和Tensor之间的区别？ - Difference between Tensorflow Operation and Tensor?

np.array(tensor_list) 和 (tensor).numpy() 的区别 - Difference between np.array(tensor_list) and (tensor).numpy()

扩展时张量轴-1、1和0之间的差异 - Difference between tensor axis -1 , 1 and 0 while expanding

Pytorch 加权张量 - Pytorch weighted Tensor

在Tensorflow中，变量和张量之间有什么区别？ - In Tensorflow, what is the difference between a Variable and a Tensor?

Tensorflow中的Tensor和Variable有什么区别 - What's the difference between Tensor and Variable in Tensorflow

张量流和光流有什么区别？ - What is the difference between the tensor flow and the optical flow?

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何使用tensor2tensor对文本进行分类？ - How to use tensor2tensor to classify text? 无法运行 Tensorflow 的官方 Tensor2Tensor colab notebook - Unable to run Tensorflow's official Tensor2Tensor colab notebook 在Flask中导入多个自定义tensor2tensor问题 - Importing multiple custom tensor2tensor Problems in Flask Tensorflow操作和Tensor之间的区别？ - Difference between Tensorflow Operation and Tensor? np.array(tensor_list) 和 (tensor).numpy() 的区别 - Difference between np.array(tensor_list) and (tensor).numpy() 扩展时张量轴-1、1和0之间的差异 - Difference between tensor axis -1 , 1 and 0 while expanding Pytorch 加权张量 - Pytorch weighted Tensor 在Tensorflow中，变量和张量之间有什么区别？ - In Tensorflow, what is the difference between a Variable and a Tensor? Tensorflow中的Tensor和Variable有什么区别 - What's the difference between Tensor and Variable in Tensorflow 张量流和光流有什么区别？ - What is the difference between the tensor flow and the optical flow?

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM