[JENKINS-2548] Node does not come back online after disk space cleared

Type: Bug
Resolution: Fixed
Priority: Major
Component/s: remoting
Labels:
None
Environment:
Platform: All, OS: All

Similar Issues:
Powered by SuggestiMate

Show

We are using Hudson as a single master server and had it go offline due to less
than 1GB disk space being enabled.

After we clear some disk space Hudson does not come back online until we restart
the servlet container. Could it not detect that there is enough disk space
available and come back online automatically?

is duplicated by

JENKINS-12021 Slave node does not made online if it gets above a certain threshold

Resolved

JENKINS-12882 Master stays offline if once low disk space occured

Closed

is related to

JENKINS-13441 Slave status information shouldn't be stored in main config file

Resolved

manderson23 created issue - 2008-10-30 07:19

Andrew Bayer made changes - 2011-12-01 22:30

Assignee

New: Andrew Bayer [ abayer ]

SCM/JIRA link daemon added a comment - 2011-12-01 22:51

Code changed in jenkins
User: Andrew Bayer
Path:
changelog.html
core/src/main/java/hudson/node_monitors/AbstractDiskSpaceMonitor.java
core/src/main/java/hudson/node_monitors/AbstractNodeMonitorDescriptor.java
core/src/main/resources/hudson/node_monitors/Messages.properties
http://jenkins-ci.org/commit/jenkins/e38e687d5b66238f406d1e3642a3d5f6a02aaeb2
Log:
[FIXED JENKINS-2548] Slaves taken offline for low disk space will now
come back online when disk space becomes available.

SCM/JIRA link daemon added a comment - 2011-12-01 22:51 Code changed in jenkins User: Andrew Bayer Path: changelog.html core/src/main/java/hudson/node_monitors/AbstractDiskSpaceMonitor.java core/src/main/java/hudson/node_monitors/AbstractNodeMonitorDescriptor.java core/src/main/resources/hudson/node_monitors/Messages.properties http://jenkins-ci.org/commit/jenkins/e38e687d5b66238f406d1e3642a3d5f6a02aaeb2 Log: [FIXED JENKINS-2548] Slaves taken offline for low disk space will now come back online when disk space becomes available.

SCM/JIRA link daemon made changes - 2011-12-01 22:51

Resolution		New: Fixed [ 1 ]
Status	Original: Open [ 1 ]	New: Resolved [ 5 ]

dogfood added a comment - 2011-12-02 00:48

Integrated in jenkins_main_trunk #1334
[FIXED JENKINS-2548] Slaves taken offline for low disk space will now

Andrew Bayer : e38e687d5b66238f406d1e3642a3d5f6a02aaeb2
Files :

core/src/main/java/hudson/node_monitors/AbstractNodeMonitorDescriptor.java
core/src/main/java/hudson/node_monitors/AbstractDiskSpaceMonitor.java
changelog.html
core/src/main/resources/hudson/node_monitors/Messages.properties

dogfood added a comment - 2011-12-02 00:48 Integrated in jenkins_main_trunk #1334 [FIXED JENKINS-2548] Slaves taken offline for low disk space will now Andrew Bayer : e38e687d5b66238f406d1e3642a3d5f6a02aaeb2 Files : core/src/main/java/hudson/node_monitors/AbstractNodeMonitorDescriptor.java core/src/main/java/hudson/node_monitors/AbstractDiskSpaceMonitor.java changelog.html core/src/main/resources/hudson/node_monitors/Messages.properties

SCM/JIRA link daemon added a comment - 2011-12-02 01:43

Code changed in jenkins
User: Kohsuke Kawaguchi
Path:
changelog.html
core/src/main/java/hudson/node_monitors/AbstractDiskSpaceMonitor.java
core/src/main/java/hudson/node_monitors/AbstractNodeMonitorDescriptor.java
core/src/main/resources/hudson/node_monitors/Messages.properties
http://jenkins-ci.org/commit/jenkins/706b2dfd71904224399e52843233c12e219803e4
Log:
Revert "[FIXED JENKINS-2548] Slaves taken offline for low disk space will now"

This reverts commit e38e687d5b66238f406d1e3642a3d5f6a02aaeb2.

Compare: https://github.com/jenkinsci/jenkins/compare/e38e687...706b2df

SCM/JIRA link daemon added a comment - 2011-12-02 01:43 Code changed in jenkins User: Kohsuke Kawaguchi Path: changelog.html core/src/main/java/hudson/node_monitors/AbstractDiskSpaceMonitor.java core/src/main/java/hudson/node_monitors/AbstractNodeMonitorDescriptor.java core/src/main/resources/hudson/node_monitors/Messages.properties http://jenkins-ci.org/commit/jenkins/706b2dfd71904224399e52843233c12e219803e4 Log: Revert " [FIXED JENKINS-2548] Slaves taken offline for low disk space will now" This reverts commit e38e687d5b66238f406d1e3642a3d5f6a02aaeb2. Compare: https://github.com/jenkinsci/jenkins/compare/e38e687...706b2df

Kohsuke Kawaguchi made changes - 2011-12-02 01:44

Resolution	Original: Fixed [ 1 ]
Status	Original: Resolved [ 5 ]	New: Reopened [ 4 ]

dogfood added a comment - 2011-12-02 03:18

Integrated in jenkins_main_trunk #1335
Revert "[FIXED JENKINS-2548] Slaves taken offline for low disk space will now"

Kohsuke Kawaguchi : 706b2dfd71904224399e52843233c12e219803e4
Files :

core/src/main/java/hudson/node_monitors/AbstractNodeMonitorDescriptor.java
changelog.html
core/src/main/resources/hudson/node_monitors/Messages.properties
core/src/main/java/hudson/node_monitors/AbstractDiskSpaceMonitor.java

dogfood added a comment - 2011-12-02 03:18 Integrated in jenkins_main_trunk #1335 Revert " [FIXED JENKINS-2548] Slaves taken offline for low disk space will now" Kohsuke Kawaguchi : 706b2dfd71904224399e52843233c12e219803e4 Files : core/src/main/java/hudson/node_monitors/AbstractNodeMonitorDescriptor.java changelog.html core/src/main/resources/hudson/node_monitors/Messages.properties core/src/main/java/hudson/node_monitors/AbstractDiskSpaceMonitor.java

Andrew Bayer added a comment - 2011-12-02 17:00

kohsuke - what would be the best way to record in the DiskSpace OfflineCause which specific monitor is the reason? Subclassing it further, or adding a flag of some sort?

Andrew Bayer added a comment - 2011-12-02 17:00 kohsuke - what would be the best way to record in the DiskSpace OfflineCause which specific monitor is the reason? Subclassing it further, or adding a flag of some sort?

Kohsuke Kawaguchi added a comment - 2011-12-02 23:09

I think we need Computers to treat NodeMonitors as something special. We can have Computers remember the set of NodeMonitors that raising a red flag, and isOffline() would check if this set is empty. This leaves "temporarily offline" concept for administrator's use alone.

This also means NodeMonitors should have a backdoor to raise/drop this red flag, and existing NodeMonitors should be modified to use this mechanism so that automatic on/off and administrative manual on/off will not collide with each other.

I think such a distinction is the only way to make it work correctly in the presence of multiple node monitors reporting problems.

Kohsuke Kawaguchi added a comment - 2011-12-02 23:09 I think we need Computers to treat NodeMonitors as something special. We can have Computers remember the set of NodeMonitors that raising a red flag, and isOffline() would check if this set is empty. This leaves "temporarily offline" concept for administrator's use alone. This also means NodeMonitors should have a backdoor to raise/drop this red flag, and existing NodeMonitors should be modified to use this mechanism so that automatic on/off and administrative manual on/off will not collide with each other. I think such a distinction is the only way to make it work correctly in the presence of multiple node monitors reporting problems.

Assignee:: Andrew Bayer

Reporter:: manderson23

Votes:: 8 Vote for this issue

Watchers:: 9 Start watching this issue

Created:: 2008-10-30 07:19

Updated:: 2020-10-09 12:31

Resolved:: 2013-02-12 13:52

Jenkins

Details

Description

Attachments

Issue Links

Activity

Collapse comment: SCM/JIRA link daemon added a comment - 2011-12-01 22:51

Expand comment: SCM/JIRA link daemon added a comment - 2011-12-01 22:51

Collapse comment: dogfood added a comment - 2011-12-02 00:48

Expand comment: dogfood added a comment - 2011-12-02 00:48

Collapse comment: SCM/JIRA link daemon added a comment - 2011-12-02 01:43

Expand comment: SCM/JIRA link daemon added a comment - 2011-12-02 01:43

Collapse comment: dogfood added a comment - 2011-12-02 03:18

Expand comment: dogfood added a comment - 2011-12-02 03:18

Collapse comment: Andrew Bayer added a comment - 2011-12-02 17:00

Expand comment: Andrew Bayer added a comment - 2011-12-02 17:00

Collapse comment: Kohsuke Kawaguchi added a comment - 2011-12-02 23:09

Expand comment: Kohsuke Kawaguchi added a comment - 2011-12-02 23:09

People

Dates