簡體 English 中英

TensorFlow：是否可以為多GPU訓練恢復檢查點模型？

[英]TensorFlow: Is it possible to restore checkpoint models for multi-gpu training?

原文 2017-02-22 09:30:00 8 1 python/ machine-learning/ tensorflow/ deep-learning

我目前正在使用主管，並且僅使用TF-slim的預訓練權重構建了一張圖來進行轉移學習。 我想知道是否有一種方法可以在一開始就將檢查點模型還原到多個推理模型？ 我主要關心的是，首先，在TF存儲庫上的參考代碼中定義的名稱范圍可能會由於名稱不匹配而導致無法恢復預訓練變量。 此外，鑒於我必須使用帶有init_fn的管理程序，該管理器僅使用一個可恢復變量的保護程序，我如何才能有多個保護程序將相同的變量恢復到多個GPU（如果我甚至根本需要多個保護程序）。

我的一個想法是，也許我可以將變量還原到一個圖形，然后讓其他GPU使用相同的圖形進行訓練。 但是，是否僅在第一個GPU完成后才進行下一個GPU的培訓？ 但是通過這種方式，除非我編輯檢查點權重的名稱，否則我也將無法根據原始檢查點模型變量名稱來恢復權重。

1 個解決方案

關於保存和恢復變量的tensorflow文檔將您指向saver對象，允許您在構造saver時通過將字典從保存的名稱傳遞到變量對象來指定將哪些保存的變量恢復為模型變量。

TensorFlow MirroredStrategy() 不適用於多 GPU 訓練

[英]TensorFlow MirroredStrategy() not working for multi-gpu training

Tensorflow Multi-GPU丟失

[英]Tensorflow Multi-GPU loss

Tensorflow 多 GPU - NCCL

[英]Tensorflow Multi-GPU - NCCL

AllenNLP共參考分辨率的多GPU訓練

[英]Multi-GPU training of AllenNLP coreference resolution

tensorflow 多 GPU 訓練

[英]tensorflow multi GPU training

Tensorflow GPU /多GPU如何分配內存？

[英]How Tensorflow GPU/multi-GPU allocates memory?

Choiche GPU tensorflow-directml 或多 GPU

[英]Choiche GPU tensorflow-directml or multi-gpu

在特定迭代或檢查點將模型加載/恢復到張量流

[英]Load / restore models into tensorflow at specific iteration or checkpoint

Tensorflow多GPU重用與復制？

[英]Tensorflow Multi-GPU reusing vs. duplicating?

如何為 tensorflow 多 GPU 代碼實現批量歸一化層

[英]How to implement batch normalization layer for tensorflow multi-GPU code

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 TensorFlow MirroredStrategy() 不適用於多 GPU 訓練 Tensorflow Multi-GPU丟失 Tensorflow 多 GPU - NCCL AllenNLP共參考分辨率的多GPU訓練 tensorflow 多 GPU 訓練 Tensorflow GPU /多GPU如何分配內存？ Choiche GPU tensorflow-directml 或多 GPU 在特定迭代或檢查點將模型加載/恢復到張量流 Tensorflow多GPU重用與復制？如何為 tensorflow 多 GPU 代碼實現批量歸一化層

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM