簡體   English   中英

TensorFlow-gpu-2.0.0rc2 找不到 cuda-10.1 庫並跳過已注冊的 GPU 設備

[英]TensorFlow-gpu-2.0.0rc2 cannot find cuda-10.1 libraries and skips registered GPU devices

我在我是非管理員用戶的系統上使用NVIDIA Tesla V100-SXM2-32GB ,因此我無法更改 Cuda 版本。 當前安裝在系統上的 Cuda 版本是10.1 ,我正在嘗試讓 TensorFlow 與此版本一起運行。 安裝 TensorFlow 版本2.0.0rc2 (使用cudnn-7.6.4cudatoolkit-10.1.243 )后,我收到下面報告的錯誤(在默認啟用的急切執行模式下)。 Cuda 庫的路徑已正確導出。

根據官方文檔這篇文章,TensorFlow 目前支持 Cuda 10.0 有人知道可以與 Cuda 10.1 一起運行的版本(甚至是 alpha 版本)嗎?

python -c "import tensorflow as tf; tf.zeros(10)"

返回

2019-11-10 11:55:36.118647: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcuda.so.1
2019-11-10 11:55:39.393230: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1618] Found device 0 with properties: 
name: Tesla V100-SXM2-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53
pciBusID: 0000:1a:00.0
2019-11-10 11:55:39.395456: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1618] Found device 1 with properties: 
name: Tesla V100-SXM2-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53
pciBusID: 0000:1c:00.0
2019-11-10 11:55:39.397553: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1618] Found device 2 with properties: 
name: Tesla V100-SXM2-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53
pciBusID: 0000:1d:00.0
2019-11-10 11:55:39.399647: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1618] Found device 3 with properties: 
name: Tesla V100-SXM2-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53
pciBusID: 0000:1e:00.0
2019-11-10 11:55:39.399986: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcudart.so.10.0'; dlerror: libcudart.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.400135: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcublas.so.10.0'; dlerror: libcublas.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.400274: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcufft.so.10.0'; dlerror: libcufft.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.400414: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcurand.so.10.0'; dlerror: libcurand.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.400552: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcusolver.so.10.0'; dlerror: libcusolver.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.400687: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcusparse.so.10.0'; dlerror: libcusparse.so.10.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda/lib64:/usr/local/cuda-8.0/lib64
2019-11-10 11:55:39.405250: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudnn.so.7
2019-11-10 11:55:39.405367: W tensorflow/core/common_runtime/gpu/gpu_device.cc:1641] Cannot dlopen some GPU libraries. Please make sure the missing libraries mentioned above are installed properly if you would like to use GPU. Follow the guide at https://www.tensorflow.org/install/gpu for how to download and setup the required libraries for your platform.
Skipping registering GPU devices...
2019-11-10 11:55:39.405848: I tensorflow/core/platform/cpu_feature_guard.cc:142] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 AVX512F FMA
2019-11-10 11:55:39.412764: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2100000000 Hz
2019-11-10 11:55:39.412951: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x555c0a4adfd0 executing computations on platform Host. Devices:
2019-11-10 11:55:39.413028: I tensorflow/compiler/xla/service/service.cc:175]   StreamExecutor device (0): Host, Default Version
2019-11-10 11:55:40.213011: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x555c0a4b0850 executing computations on platform CUDA. Devices:
2019-11-10 11:55:40.213144: I tensorflow/compiler/xla/service/service.cc:175]   StreamExecutor device (0): Tesla V100-SXM2-32GB, Compute Capability 7.0
2019-11-10 11:55:40.213208: I tensorflow/compiler/xla/service/service.cc:175]   StreamExecutor device (1): Tesla V100-SXM2-32GB, Compute Capability 7.0
2019-11-10 11:55:40.213262: I tensorflow/compiler/xla/service/service.cc:175]   StreamExecutor device (2): Tesla V100-SXM2-32GB, Compute Capability 7.0
2019-11-10 11:55:40.213312: I tensorflow/compiler/xla/service/service.cc:175]   StreamExecutor device (3): Tesla V100-SXM2-32GB, Compute Capability 7.0
2019-11-10 11:55:40.213562: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1159] Device interconnect StreamExecutor with strength 1 edge matrix:
2019-11-10 11:55:40.213647: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1165]      

如果您在 Windows 中使用 Tensorflow,則無需再自行安裝所有 CUDA 和 Cudnn 驅動程序^_^

只需在 conda 上使用以下命令,它就會自行處理相應的包:

創建新環境:

conda create -n [名稱] python=3.6

conda 激活[名稱]

然后,使用:

conda install -c conda-forge tensorflow-gpu==1.14

conda 環境將根據您的系統需要檢查和安裝軟件包。 干杯!

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM