No module named trainer, Cloud ML Engine for TensorFlow Tutorial, Running Locally

Question

Have been attempting to follow the Google tutorial to use ML Engine for TensorFlow. Have gotten stuck where it says "run a local training job" with the error

/usr/bin/python: No module named trainer

Full command is:

gcloud ml-engine local train \
    --module-name trainer.task \
    --package-path trainer/ \
    --job-dir $MODEL_DIR \
    -- \
    --train-files $TRAIN_DATA \
    --eval-files $EVAL_DATA \
    --train-steps 1000 \
    --eval-steps 100

The three variables are all set up correctly to my knowledge though it doesn't even get to them right now. The tutorial doesn't specify downloading a trainer file or how it is referenced, googling for the past hour hasn't turned up any working solutions. Have found this general explanation:

--module-name specifies the name of your application's main module, using your package's namespace dot notation. This is the Python file that you run to start your application. For example, if your main module is .../my_application/trainer/task.py (see the recommended project structure), then the module name is trainer.task

Any info would be appreciated.

Answer 1

Thanks to the information from Dustin in the comments have found the solution.

Was simply downloading the trainer directory on the Cloud Shell and not my local environment.

Now File Structure Looks Like:

estimator
    |-- data
    |   |-- adult_data.csv
    |   |-- adult_test.csv
    |-- output
    |-- trainer
    |   |-- __init__.py
    |   |-- model.py
    |   |-- task.ipynb
    |   |-- task.py

No module named trainer, Cloud ML Engine for TensorFlow Tutorial, Running Locally

Question

1 answers

solution1
0 ACCPTED 2018-11-19 17:33:07

No module named trainer, Cloud ML Engine for TensorFlow Tutorial, Running Locally

Question

1 answers

solution1 0 ACCPTED 2018-11-19 17:33:07

solution1
0 ACCPTED 2018-11-19 17:33:07