繁体   English   中英

Airflow Docker Operator 无法在本地机器上找到 .sock 文件

[英]Airflow Docker Operator unable to find .sock file on local machine

我想使用 Airflow 按计划运行包含 python 脚本的 docker 容器。 在本地通过 Airflow CLI 运行我的 DockerOperator 任务时遇到问题。

--------------------------------------------------------------------------------
Starting attempt 1 of 4
--------------------------------------------------------------------------------

[2018-10-31 15:20:10,760] {models.py:1569} INFO - Executing <Task(DockerOperator): amplitude_to_s3_docker> on 2018-10-02T00:00:00+00:00
[2018-10-31 15:20:10,761] {base_task_runner.py:124} INFO - Running: ['bash', '-c', 'airflow run get_amplitude_docker_dag amplitude_to_s3_docker 2018-10-02T00:00:00+00:00 --job_id 19 --raw -sd DAGS_FOLDER/amplitude_to_s3_docker_dag.py --cfg_path /var/folders/ys/83xq3b3d1qv3zfx3dtkkp9tc0000gn/T/tmp_lu9mgzz']
[2018-10-31 15:20:12,501] {base_task_runner.py:107} INFO - Job 19: Subtask amplitude_to_s3_docker [2018-10-31 15:20:12,501] {__init__.py:51} INFO - Using executor SequentialExecutor
[2018-10-31 15:20:13,465] {base_task_runner.py:107} INFO - Job 19: Subtask amplitude_to_s3_docker [2018-10-31 15:20:13,464] {models.py:258} INFO - Filling up the DagBag from /Users/thisuser/Projects/GitRepos/DataWarehouse/dags/amplitude_to_s3_docker_dag.py
[2018-10-31 15:20:13,581] {base_task_runner.py:107} INFO - Job 19: Subtask amplitude_to_s3_docker [2018-10-31 15:20:13,581] {example_kubernetes_operator.py:54} WARNING - Could not import KubernetesPodOperator: No module named 'kubernetes'
[2018-10-31 15:20:13,582] {base_task_runner.py:107} INFO - Job 19: Subtask amplitude_to_s3_docker [2018-10-31 15:20:13,582] {example_kubernetes_operator.py:55} WARNING - Install kubernetes dependencies with:     pip install airflow['kubernetes']
[2018-10-31 15:20:13,770] {base_task_runner.py:107} INFO - Job 19: Subtask amplitude_to_s3_docker [2018-10-31 15:20:13,770] {cli.py:492} INFO - Running <TaskInstance: get_amplitude_docker_dag.amplitude_to_s3_docker 2018-10-02T00:00:00+00:00 [running]> on host 254.1.168.192.in-addr.arpa
[2018-10-31 15:20:13,804] {docker_operator.py:169} INFO - Starting docker container from image amplitude
[2018-10-31 15:20:13,974] {models.py:1736} ERROR - create_container() got an unexpected keyword argument 'cpu_shares'
Traceback (most recent call last):
  File "/Users/thisuser/anaconda/lib/python3.5/site-packages/airflow/models.py", line 1633, in _run_raw_task
    result = task_copy.execute(context=context)
  File "/Users/thisuser/anaconda/lib/python3.5/site-packages/airflow/operators/docker_operator.py", line 210, in execute
    working_dir=self.working_dir
TypeError: create_container() got an unexpected keyword argument 'cpu_shares'

我让脚本在 Airflow 之外运行良好,使用以下命令:

docker run amplitude get_amplitude.py 2018-10-02 2018-10-02

这是我的 dag 和任务文件:

from airflow import DAG
from airflow.operators.bash_operator import BashOperator
from airflow.operators.docker_operator import DockerOperator
from datetime import datetime, timedelta


default_args = {
    "owner": "airflow",
    "depends_on_past": False,
    "start_date": datetime(2018, 10, 30),
    "email": ["me@myemail.com"],
    "email_on_failure": True,
    "email_on_retry": False,
    "retries": 3,
    "retry_delay": timedelta(minutes=5),
}

dag = DAG("get_amplitude_docker_dag", default_args=default_args, schedule_interval=timedelta(minutes=10))

templated_command = """
    get_amplitude.py {{ ds }} {{ ds }}
"""

t1 = DockerOperator(
   task_id='amplitude_to_s3_docker',
   command=templated_command,
   image='amplitude',
   dag=dag
)

初始化本地气流数据库并启动网络服务器 + 调度程序后,我使用以下命令运行我的 dag 任务:

airflow run get_amplitude_docker_dag amplitude_to_s3_docker 2018-10-02

此外,如果我将其配置为 bash 操作员,该任务将在气流中正常运行:

templated_command = """
   docker run amplitude get_amplitude.py {{ ds }} {{ ds }} 
"""


t1 = BashOperator(
    task_id="amplitude_to_s3",
    bash_command=templated_command,
    params={},
    dag=dag,
)

我之前读过,安装 docker 守护程序可能会出现问题,但我的 .sock 文件位于默认docker_url参数指向的位置,/ docker_url

谁能帮我配置这个工作?

实际错误是TypeError: create_container() got an unexpected keyword argument 'cpu_shares'这意味着create_container函数不希望cpu_shares作为参数。

我在使用 docker python 库版本 3.5.1 并降级到版本2.7.0 (这似乎是接受create_containercpu_shares参数的最新版本)时遇到了同样的错误,解决了这个问题。

尝试运行它来降级 docker 库:

sudo pip3 install docker==2.7.0

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM