简体繁体 English

长句子对深度学习模型不利吗？

[英]Are long sentences not good for deep learning models?

原文 2019-01-18 10:11:54 6 1 tensorflow/ tensorflow-datasets/ tensor2tensor

Interested to know if long sentences are good for tensor2tensor model training. 有兴趣知道长句子是否适合张量张量模型训练。 And why or why not? 为什么或为什么不呢？

1 个解决方案

Ideally, the training data should have the same distribution of sentence lengths as the target test data. 理想情况下，训练数据应与目标测试数据具有相同的句子长度分布。 Eg in machine translation, if long sentences are intended to be translated by the final model, similarly long sentences should be used also for training. 例如，在机器翻译中，如果打算由最终模型翻译长句子，则类似的长句子也应用于培训。 The Transformer model seems to not generalize to longer sentences than were used for training, but limiting the maximum sentence length in training allows to use higher batch sizes, which is helpful ( Popel and Bojar, 2018 ). Transformer模型似乎并不能推广到比用于训练的句子更长的句子，但是限制训练中的最大句子长度允许使用更高的批处理大小，这很有帮助（ Popel和Bojar，2018 ）。

使用AMD训练深度学习模型 - Train Deep learning Models with AMD

在深度学习模型中进行投票 - Ensemble with voting in deep learning models

从集群服务多个深度学习模型 - Serving multiple deep learning models from cluster

结合两种不同的深度学习模型进行评估 - Combining two different deep learning models for evaluation

TENSORFLOW的深度学习：保存和加载模型的问题 - Deep learning with TENSORFLOW: Issues with saving and loading models

为深度学习模型调整图像大小的正确方法 - Proper way of resizing image for Deep Learning models

深度学习：训练集往往是好的，而验证集是不好的 - Deep learning: Training set tends to be good and Validation set is bad

编码器/解码器模型如何在深度学习中学习？ - How do Encoder/Decoder models learn in Deep Learning?

在python深度学习中构建图像推荐系统的最佳模型 - Best models to build Image recommendation System in python deep learning

很长一段时间后检测到 GPU 的深度学习脚本 - Deep learning script detecting GPU after a very long time

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 使用AMD训练深度学习模型 - Train Deep learning Models with AMD 在深度学习模型中进行投票 - Ensemble with voting in deep learning models 从集群服务多个深度学习模型 - Serving multiple deep learning models from cluster 结合两种不同的深度学习模型进行评估 - Combining two different deep learning models for evaluation TENSORFLOW的深度学习：保存和加载模型的问题 - Deep learning with TENSORFLOW: Issues with saving and loading models 为深度学习模型调整图像大小的正确方法 - Proper way of resizing image for Deep Learning models 深度学习：训练集往往是好的，而验证集是不好的 - Deep learning: Training set tends to be good and Validation set is bad 编码器/解码器模型如何在深度学习中学习？ - How do Encoder/Decoder models learn in Deep Learning? 在python深度学习中构建图像推荐系统的最佳模型 - Best models to build Image recommendation System in python deep learning 很长一段时间后检测到 GPU 的深度学习脚本 - Deep learning script detecting GPU after a very long time

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM