[JENKINS-18438] Node monitoring should run in parallel - Jenkins Jira

Type: Bug
Resolution: Fixed
Priority: Major
Component/s: core, remoting
Labels:
None

Similar Issues:
Powered by SuggestiMate

Show

As of 1.520, AbstractNodeMonitorDescriptor monitors nodes sequentially. As the # of slaves go up, this will take a long time to complete, and this also makes the monitoring susceptive to a hang.

While a ping thread is there to detect unresponsive nodes, its interval is 10mins and the time out is 4mins, so a few unresonsive nodes can quickly push the total running time of node monitoring beyond the default monitoring cycle of 1 hour.

A better approach is to make asynchronous remoting calls to all the slaves at once, then wait for the results to come back. This way, we can get the result back for ones that are functioning.

is related to

JENKINS-18671 Clock Difference broken on Manage Nodes page

Resolved

JENKINS-18152 Thread interruption in remoting proxy results in UndeclaredThrowableException

Resolved

Assignee:: Kohsuke Kawaguchi

Reporter:: Kohsuke Kawaguchi

Votes:: 2 Vote for this issue

Watchers:: 9 Start watching this issue

Created:: 2013-06-20 21:49

Updated:: 2013-07-29 19:13

Resolved:: 2013-07-29 19:13

Details

Description

Attachments

Issue Links

Activity

People

Dates