[英]TensorFlow-gpu-2.0.0rc2 cannot find cuda-10.1 libraries and skips registered GPU devices
我在我是非管理員用戶的系統上使用NVIDIA Tesla V100-SXM2-32GB
,因此我無法更改 Cuda 版本。 當前安裝在系統上的 Cuda 版本是10.1
,我正在嘗試讓 TensorFlow 與此版本一起運行。 安裝 TensorFlow 版本2.0.0rc2
(使用cudnn-7.6.4
和cudatoolkit-10.1.243
)后,我收到下面報告的錯誤(在默認啟用的急切執行模式下)。 Cuda 庫的路徑已正確導出。
根據官方文檔和這篇文章,TensorFlow 目前支持 Cuda 10.0
。 有人知道可以與 Cuda 10.1 一起運行的版本(甚至是 alpha 版本)嗎?
python -c "import tensorflow as tf; tf.zeros(10)"
返回
2019-11-10 11:55:36.118647: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcuda.so.1
2019-11-10 11:55:39.393230: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1618] Found device 0 with properties:
name: Tesla V100-SXM2-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53
pciBusID: 0000:1a:00.0
2019-11-10 11:55:39.395456: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1618] Found device 1 with properties:
name: Tesla V100-SXM2-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53
pciBusID: 0000:1c:00.0
2019-11-10 11:55:39.397553: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1618] Found device 2 with properties:
name: Tesla V100-SXM2-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53
pciBusID: 0000:1d:00.0
2019-11-10 11:55:39.399647: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1618] Found device 3 with properties:
name: Tesla V100-SXM2-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53
pciBusID: 0000:1e:00.0
2019-11-10 11:55:39.399986: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcudart.so.10.0'; dlerror: libcudart.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.400135: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcublas.so.10.0'; dlerror: libcublas.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.400274: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcufft.so.10.0'; dlerror: libcufft.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.400414: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcurand.so.10.0'; dlerror: libcurand.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.400552: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcusolver.so.10.0'; dlerror: libcusolver.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.400687: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcusparse.so.10.0'; dlerror: libcusparse.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.405250: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudnn.so.7
2019-11-10 11:55:39.405367: W tensorflow/core/common_runtime/gpu/gpu_device.cc:1641] Cannot dlopen some GPU libraries. Please make sure the missing libraries mentioned above are installed properly if you would like to use GPU. Follow the guide at https://www.tensorflow.org/install/gpu for how to download and setup the required libraries for your platform.
Skipping registering GPU devices...
2019-11-10 11:55:39.405848: I tensorflow/core/platform/cpu_feature_guard.cc:142] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 AVX512F FMA
2019-11-10 11:55:39.412764: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2100000000 Hz
2019-11-10 11:55:39.412951: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x555c0a4adfd0 executing computations on platform Host. Devices:
2019-11-10 11:55:39.413028: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Host, Default Version
2019-11-10 11:55:40.213011: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x555c0a4b0850 executing computations on platform CUDA. Devices:
2019-11-10 11:55:40.213144: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-SXM2-32GB, Compute Capability 7.0
2019-11-10 11:55:40.213208: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-SXM2-32GB, Compute Capability 7.0
2019-11-10 11:55:40.213262: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-SXM2-32GB, Compute Capability 7.0
2019-11-10 11:55:40.213312: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-SXM2-32GB, Compute Capability 7.0
2019-11-10 11:55:40.213562: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1159] Device interconnect StreamExecutor with strength 1 edge matrix:
2019-11-10 11:55:40.213647: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1165]
如果您在 Windows 中使用 Tensorflow,則無需再自行安裝所有 CUDA 和 Cudnn 驅動程序^_^
只需在 conda 上使用以下命令,它就會自行處理相應的包:
創建新環境:
conda create -n [名稱] python=3.6
conda 激活[名稱]
然后,使用:
conda install -c conda-forge tensorflow-gpu==1.14
conda 環境將根據您的系統需要檢查和安裝軟件包。 干杯!
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.