-
Improvement
-
Resolution: Unresolved
-
Major
-
None
Ansible Tower jobs can be running for a long time and it's quite normal that connection can be dropped or timeout. Currently, these situations mark the Jenkins build as failed because code throws Exception. I think the Ansible Tower plugin should have some reconnection mechanism that will handle these situations (e.g. fail after 5 consecutive connection timeouts or drops during the configured period).
[JENKINS-62525] Add some reconnection mechanism to Ansible Tower job monitoring
Description | Original: Ansible Tower jobs can be running for a long time and it's quite normal that connection can be dropped or timeout. Currently, these situations mark the Jenkins build as failed because code throws Exception. I think the Ansible Tower plugin should have some reconnecting mechanism that will handle these situations (e.g. fail after 5 consecutive connection timeouts or drops during the configured period). | New: Ansible Tower jobs can be running for a long time and it's quite normal that connection can be dropped or timeout. Currently, these situations mark the Jenkins build as failed because code throws Exception. I think the Ansible Tower plugin should have some reconnection mechanism that will handle these situations (e.g. fail after 5 consecutive connection timeouts or drops during the configured period). |
For some background on this; are you running Tower or AWX?
Also, how long are your jobs running for? Are we talking days or hours?
Do you know what is making your instance become unresponsive during that time?
Also, are you using a pipeline or a freestyle job in Jenkins? If you are using a pipeline have you tried running with the async option?
In general, I am feeling like a Tower infrastructure should be stable enough to remain up during long running jobs rather than try and have the plugin "tolerate" unstable connections. However, if there are special circumstances to be considered I might be willing to try and do something within the plugin to make it more forgiving of bad Tower connections.