简体繁体中英

Netflix conductor HTTP tasks stuck on scheduled state for a long time

原文 2022-06-13 18:08:41 5 1 netflix-conductor

We have Netflix conductor deployed on GCP, with a strong Postgres persistence storage.

Whenever more then 3k workflows are starting to execute in parallel (Each workflow has like 4 HTTP tasks), The time that takes for an HTTP task to start executing is getting larger and larger.

It's simply stuck on scheduled state, could be stuck for long minutes on higher loads.

We checked the workload metrics for the conductor servers and the Postgres DB and they are far from reaching there resource limits.

We thought about using isolation tasks for these HTTP tasks, but that will not be beneficial since 80% of all tasks executed are these HTTP tasks that we don't want to be stuck on scheduled.

Which configurations\Settings\Setup should I change In order to solve the problem of HTTP tasks getting stuck on scheduled state?

Thanks

1 answers

are some of your HTTP tasks longer tasks? These tasks might be using all of your available workers, placing some of the faster tasks into a queue.

You might consider isolation Groups for these longer HTTPS tasks so that the fast tasks can run through the regular HTTP workers:

https://conductor.netflix.com/configuration/isolationgroups.html

How to set retention time in Netflix Conductor?

Why netflix conductor does not provide a way to run tasks/subworkflows asynchronously?

Using Kafka with Netflix Conductor

Netflix Conductor SQS

Netflix Conductor aws integration

Netflix Conductor as a workflow engine solution

Using Netflix Conductor with a different backend

Could Run netflix conductor with postgress -

Running netflix conductor with standalone elastic search?

Change log levels in netflix conductor (java springboot)

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to set retention time in Netflix Conductor? Why netflix conductor does not provide a way to run tasks/subworkflows asynchronously? Using Kafka with Netflix Conductor Netflix Conductor SQS Netflix Conductor aws integration Netflix Conductor as a workflow engine solution Using Netflix Conductor with a different backend Could Run netflix conductor with postgress - Running netflix conductor with standalone elastic search? Change log levels in netflix conductor (java springboot)

Related Tags

Netflix conductor HTTP tasks stuck on scheduled state for a long time

Question

1 answers

solution1 0 2022-08-23 19:44:31

solution1
0 2022-08-23 19:44:31