簡體   English   中英

在 docker 容器中運行 tensorflow 時出錯

[英]Error running tensorflow in docker container

我正在嘗試在 Docker 容器中運行的 Python 應用程序中使用 Tensorflow 模塊(實際上我使用的是 Keras,但錯誤來自 Tensorflow)

我有模型( .json.h5文件),我想加載以使用它:

import logging
import os
from keras.models import model_from_json # library for machine learning
from numpy import array
import json

def load_models():
    global loaded_h_model
    global loaded_u_model
    global loaded_r_model
    global loaded_c_model

    modelPath = os.getenv("MODELPATH", "./models/")

    # load models
    json_h_file = open(modelPath+'model_HD.json', 'r')
    loaded_model_h_json = json_h_file.read()
    json_h_file.close()
    loaded_h_model = model_from_json(loaded_model_h_json)
    loaded_h_model.load_weights(modelPath+"model_HD.h5")

    json_u_file = open(modelPath+'model_UD.json', 'r')
    loaded_model_u_json = json_u_file.read()
    json_u_file.close()
    loaded_u_model = model_from_json(loaded_model_u_json)
    loaded_u_model.load_weights(modelPath+"model_UD.h5")

    json_r_file = open(modelPath+'model_RD.json', 'r')
    loaded_model_r_json = json_r_file.read()
    json_r_file.close()
    loaded_r_model = model_from_json(loaded_model_r_json)
    loaded_r_model.load_weights(modelPath+"model_RD.h5")

    json_c_file = open(modelPath+'model_CD.json', 'r')
    loaded_model_c_json = json_c_file.read()
    json_c_file.close()
    loaded_c_model = model_from_json(loaded_model_c_json)
    loaded_c_model.load_weights(modelPath+"model_CD.h5")

這是我使用的 Dockerfile:

FROM python:3.7

# copy source code files
COPY machinelearning.py ./

# copy models files
COPY models/* ./models/

# install dependencies
RUN pip3 install --upgrade pip \
    && pip3 install h5py \
    && pip3 install tensorflow \
    && pip3 install keras

# run script
CMD [ "python", "./machinelearning.py" ]

但是當我運行 Docker 容器時,我有以下警告/錯誤:

2020-01-29 09:40:24.542588: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libnvinfer.so.6'; dlerror: libnvinfer.so.6: cannot open shared object file: No such file or directory
2020-01-29 09:40:24.542727: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libnvinfer_plugin.so.6'; dlerror: libnvinfer_plugin.so.6: cannot open shared object file: No such file or directory
2020-01-29 09:40:24.542743: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:30] Cannot dlopen some TensorRT libraries. If you would like to use Nvidia GPU with TensorRT, please make sure the missing libraries mentioned above are installed properly.
Using TensorFlow backend.
2020-01-29 09:40:25.394254: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcuda.so.1'; dlerror: libcuda.so.1: cannot open shared object file: No such file or directory
2020-01-29 09:40:25.394289: E tensorflow/stream_executor/cuda/cuda_driver.cc:351] failed call to cuInit: UNKNOWN ERROR (303)
2020-01-29 09:40:25.394321: I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:156] kernel driver does not appear to be running on this host (dd231f397f1f): /proc/driver/nvidia/version does not exist
2020-01-29 09:40:25.394539: I tensorflow/core/platform/cpu_feature_guard.cc:142] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2020-01-29 09:40:25.419513: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 1992000000 Hz
2020-01-29 09:40:25.420250: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x55cab5bf9760 initialized for platform Host (this does not guarantee that XLA will be used). Devices:
2020-01-29 09:40:25.420299: I tensorflow/compiler/xla/service/service.cc:176]   StreamExecutor device (0): Host, Default Version

我相信我需要在我的 Dockerfile 中安裝庫或不同版本的 Tensorflow/Keras。

我該如何解決這個問題? 謝謝

首先,您需要COPY requirements.txt /to/destination 您的 requirements.txt 應包含具有版本號的依賴項。

FROM python:latest
COPY requirements.txt /usr/src/code/

在那之后運行

RUN pip3 install -r requirements.txt

而不是 Dockerfile 中的以下代碼

RUN pip3 install --upgrade pip \
    && pip3 install h5py \
    && pip3 install tensorflow \
    && pip3 install keras 

我希望通過在 requirements.txt 中提及版本號來解決問題,而不僅僅是 --upgrade 標簽。

如果不需要,也不要運行升級。

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM