繁体 English 中英

如何在 Pytorch1.1 和 DistributedDataParallel() 中计算米？

[英]How to calculate meters in Pytorch1.1 & DistributedDataParallel()?

原文 2019-05-19 07:36:07 9 1 python/ python-3.x/ pytorch/ distributed

我想同时使用模型并行和数据并行，并从官网阅读了许多文档和教程。 我面临的一个令人困惑的问题是如何收集每个进程中的各种仪表值？

问题1 ：在官方教程中，他们只是记录每个进程中的仪表值。 但是在我的代码中，我在每个过程中打印损失值，它们是不同的。 所以，我觉得其他仪表的数值也是不一样的。 那个教程错了吗？ 在我看来，我认为正确的方法应该是先同步loss、acc等仪表，然后所有进程保持相同的值，然后我只需要在一个进程中打印仪表信息。

问题 2 ：在官方教程中，他们说“DistributedDataParallel 模块还处理全球梯度的平均，因此我们不必在训练步骤中明确平均梯度”。但是，由于问题 1， API 是否真的像教程所说的那样工作？ 因为每个进程都有不同的损失值，虽然它们从相同的init权重开始，但每个进程中的模型权重会朝着不同的方向优化吗？

1 个解决方案

分布式采样器给每个进程一个不同的训练数据子集，所以每个进程评估的损失都会不同。 如果在没有分布式采样器的情况下只计算每个进程中测试集的损失，您将看到所有进程报告相同的数字。

如何在多个 GPU 的 Pytorch 示例中利用 DistributedDataParallel 的世界大小参数？

[英]How to leverage the world-size parameter for DistributedDataParallel in Pytorch example for multiple GPUs?

具有不同 GPU 速度的 PyTorch DistributedDataParallel 是否同步权重？

[英]Is PyTorch DistributedDataParallel with different GPU speeds syncing weights?

有没有办法替换 Pytorch 中用于 DDP(DistributedDataParallel) 的“allreduce_hook”？

[英]Is there a way to replace the 'allreduce_hook' used for DDP(DistributedDataParallel) in Pytorch?

如何使用PyTorch计算偏导数？

[英]How to use PyTorch to calculate partial derivatives?

如何在 PyTorch 中使用矩阵运算计算前向传递？

[英]How to calculate a Forward Pass with matrix operations in PyTorch?

如何用pytorch计算softmax回归的成本

[英]How to calculate cost for softmax regression with pytorch

计算 LineString 和 Point 之间的距离（以米为单位）

[英]Calculate distance between LineString and Point in meters

使用 PyTorch 计算第二个梯度

[英]Calculate Second Gradient with PyTorch

如何在 Windows x64 上安装 Pytorch 1.1 版？

[英]How can I install Pytorch version 1.1 on Windows x64?

如何在PyTorch中计算迷你批次与一组过滤器之间的距离

[英]How to calculate the distance between a mini batch and a set of filters in PyTorch

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何在多个 GPU 的 Pytorch 示例中利用 DistributedDataParallel 的世界大小参数？具有不同 GPU 速度的 PyTorch DistributedDataParallel 是否同步权重？有没有办法替换 Pytorch 中用于 DDP(DistributedDataParallel) 的“allreduce_hook”？如何使用PyTorch计算偏导数？如何在 PyTorch 中使用矩阵运算计算前向传递？如何用pytorch计算softmax回归的成本计算 LineString 和 Point 之间的距离（以米为单位）使用 PyTorch 计算第二个梯度如何在 Windows x64 上安装 Pytorch 1.1 版？如何在PyTorch中计算迷你批次与一组过滤器之间的距离

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM