-
Bug
-
Resolution: Fixed
-
Critical
-
None
-
Jenkins 1.529
OSX 10.8.4 (running as a VMWare Guest in VMWare Workstation 9.0.2 inside a Windows 7 Host)
also Jenkins 1.645, OSX 10.9, 10.10 (not vm)
also observed with Windows and Linux slaves.
-
-
ssh-slaves-1.31.1
I configured an OSX slave to use an SSH connection. I have an identical setup for a Linux slave. The Linux slave never hangs, but the OSX one does randomly every couple of days.
When the slave hangs, I see:
This node is being launched. See log for more details
When I click on more details I see an empty log (literally no characters) with a spinning wheel.
I'd like to disconnect the channel and try again. Unfortunately, there is no "disconnect" button, seemingly because the hang occurs too early in the connection phase.
The only way I found to fix this problem is restart Jenkins master. I believe this issue is high priority because:
- This hang occurs at least once a day (for over a week now).
- There is no known workaround.
- There is no way to recover except to restart the master node, which means that all running jobs have to be interrupted.
If you can add extra logging, I can try collection more information for you. Where do we get started?
- is duplicated by
-
JENKINS-47012 SSH Slaves launcher's afterDisconnect() is synchronous, it gets blocked by reconnect operations
-
- Resolved
-
- is related to
-
JENKINS-48613 SSH Slaves 1.23 can create lots of threads waiting for SSHLauncher lock in tearDownConnection
-
- Resolved
-
- links to
Hi hyei,
I was having this issue today as well. After hours of googling I've found the following:
https://bugs.java.com/view_bug.do?bug_id=4820090
It looks like there is a bug in Java that makes SSH Slave plugin hang when it's establishing secure connection to slave. I've implemented suggested workaround and the issue is gone for me.
In short, Java can hang when it reads random sequences from /dev/random. To workaround this it is suggested to replace /dev/random with /dev/urandom which works more reliable with Java
So if you have your master running on Linux you can try this workaround.
Let me know if it helped.