-
Bug
-
Resolution: Unresolved
-
Major
-
None
It looks as if there is a scenario where vSphereCloud.java Hashtable currentOnline does not get updated as it should.
The nodes will work as they should, then it will fail to launch new nodes.
From the looks of it the method vSphereCloud.canMarkVMOnline() return FALSE on 2nd or 3rd if statement.
I had a quick look and I think the problem is in the vSphereCloudLauncher.postDisconnectVSphereActions() method.
The vSphereCloud vsC = findOurVsInstance(); throws an exception just before the markVMOffline() is called.
It will then never mark the VM as offline and you can never launch a node on that VM again until you restart Jenkins.
Did this ever go anywhere? We may have just run into this issue, possibly triggered by a storage issue in vSphere. We got some failed startups due to the storage issue, but after clearing that out, slaves would still not start. The only thing in the slave log was "ERROR: The vSphere cloud will not allow this slave to start at this time.". As in this bug report, restarting Jenkins cleared it up