Hi All. I have a newbie/basic question, but my research has not turned up an obvious answer. I have a Camunda instance running on Ubuntu inside Tomcat with a MariaDB database. I submitted approximately 150 instances of a particular workflow. After some time execution appears complete, but 11 jobs are still shown as running in the cockpit. It is not clear to me in what state these process instances are, and I am looking for tips on how to debug further. I scanned the Tomcat and applications logs, and did not find any obvious failures.
The workflow contains a multi-instance subprocess that runs child flows in parallel. All child flows must complete for the subprocess to be complete. All tasks are running asyncAfter = true and exclusive = false. Most of the tasks are service tasks that use the HTTP connector to call external microservices.
I can provide more detail, but wanted to get some input on the debugging process so that I can provide the right data.
I see 11 jobs in the ACT_RU_JOB table, all of which have 3 retries. In all cases handler-type is async-continuation, and the handler-cfg is either activity-end or transition-notify-listener$SequenceFlow_…
I am also seeing some optimistic locking exceptions.
I am also seeing occurrences of the following exception. The exception references the POST used to start the process, but I am pretty sure that all processes are launched. I can see possible timeout issues between the task and the external microservice being called.
I simplified the execution by queuing one process instance at a time, but still see the problem (processes stalling). All stalled jobs have HANDLER_CFG_ = activity-end or transition-notify-listener-take$SequenceFlow_…
Any suggestions are much appreciated.