简体繁体中英

Flink Job Cluster vs Session Cluster - deploying and configuration

原文 2020-01-10 14:39:29 7 1 java/ docker/ deployment/ apache-flink/ flink-streaming

I'm researching docker/k8s deployment possibilities for Flink 1.9.1.

I'm after reading/watching [1][2][3][4].

Currently we do think that we will try go with Job Cluster approach although we would like to know what is the community trend with this? We would rather not deploy more than one job per Flink cluster.

Anyways, I was wondering about few things:

How can I change the number of task slots per task manager for Job and Session Cluster? In my case I'm running docker on VirtualBox where I have 4 CPUs assigned to this machine. However each task manager is spawned with only one task slot for Job Cluster. With Session Cluster however, on the same machine, each task manager is spawned with 4 task slots.
In both cases Flink's UI shows that each Task manager has 4 CPUs.
How can I resubmit job if I'm using a Job Cluster. I'm referring this use case [5]. You may say that I have to start the job again but with different arguments. What is the procedure for this? I'm using checkpoints btw.

Should I kill all task manager containers and rerun them with different parameters?

How I can resubmit job using Session Cluster?
How I can provide log config for Job/Session cluster? I have a case, where I changed log level and log format in log4j.properties and this is working fine on local (IDE) environment. However when I build the fat jar, and ran a Job Cluster based on this jar it seams that my log4j properties are not passed to the cluster. I see the original format and original (INFO) level.

Thanks,

[1] https://youtu.be/w721NI-mtAA

[2] https://youtu.be/WeHuTRwicSw

[3] https://ci.apache.org/projects/flink/flink-docs-stable/ops/deployment/docker.html

[4] https://github.com/apache/flink/blob/release-1.9/flink-container/docker/README.md

[5] http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Flink-Job-claster-scalability-td32027.html

1 answers

Currently we do think that we will try go with Job Cluster approach although we would like to know what is the community trend with this? We would rather not deploy more than one job per Flink cluster.

This question is probably better suited on the user mailing list.

How can I change the number of task slots per task manager for Job and Session Cluster?

You can control this via the config option taskmanager.numberOfTaskSlots

How I can resubmit job using Session Cluster?

This is described here . The bottom line is that you create a savepoint and resume your job from it. It is also possible to resume a job from retained checkpoints .

How can I resubmit job if I'm using a Job Cluster.

Conceptually, this is no different from resuming a job from a savepoint in a session cluster. You can specify the path to the savepoint as a command line argument to the cluster entrypoint. The details are described here .

How I can provide log config for Job/Session cluster?

If you are using the scripts in the bin/ directory of the Flink binary distribution to start your cluster (eg, bin/start-cluster.sh , bin/jobmanager.sh , bin/taskmanager.sh , etc.), you can change the log4j configuration by adapting conf/log4j.properties . The logging configuration is passed to the JobManager and TaskManager JVMs as a system variable (see bin/flink-daemon.sh ). See also the Chapter "How to use logging" in the Flink documentation.

flink - cluster not using cluster

Run a Flink job on a remote cluster without providing a .jar

Run already deployed job on Flink Cluster using RestClusterClient

Flink what is the proper way to submit args to job from cluster GUI?

How do I submit a job to a Flink cluster using Java code?

Apache Flink Job cluster rpc.address binding to localhost on kubernetes

Flink cluster on EKS

Cassandra Session vs Cluster What to Share?

Flink: Jar file execution on Flink cluster

Apache Flink (Error in stdout in cluster)

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question flink - cluster not using cluster Run a Flink job on a remote cluster without providing a .jar Run already deployed job on Flink Cluster using RestClusterClient Flink what is the proper way to submit args to job from cluster GUI? How do I submit a job to a Flink cluster using Java code? Apache Flink Job cluster rpc.address binding to localhost on kubernetes Flink cluster on EKS Cassandra Session vs Cluster What to Share? Flink: Jar file execution on Flink cluster Apache Flink (Error in stdout in cluster)

Related Tags

Flink Job Cluster vs Session Cluster - deploying and configuration

Question

1 answers

solution1 2 ACCPTED 2020-01-15 15:48:36

solution1
2 ACCPTED 2020-01-15 15:48:36