[JENKINS-32986] hard killing a pipeline leaves the JVM CPS thread running.

Type: Improvement
Resolution: Unresolved
Priority: Minor
Component/s: workflow-cps-plugin
Labels:
None
Environment:
pipeline 1.13
jenkins 1.642.1

Similar Issues:
Powered by SuggestiMate

Show

In the event a pipeline build will not die you can hard kill it - however hard killing it will leave the JVMs CPS thread still running on the master.

e.g. with the script

def spin() {
    while (true) {}
}

def map = [:]
map ["spin_it"] = { spin() } 
}
parallel map

you will need to hard kill it to stop it (on windows at least) - but inspecting the JVM threads you can see the CPS thread is still running in a tight loop.
A hard kill should probably (if it is safe without causing deadlocks elsewhere) brutally kill the thread as well. After a while you may run out of handles or other native resources due to the thread usage, meaning you need to restart Jenkins to get it working again.

is blocking

JENKINS-25550 Hard kill

Resolved

is related to

JENKINS-25623 timeout step should be able to kill infinite loop

Resolved

JENKINS-45772 Build cannot be aborted when plugin is in waitUntilContainerIsReady

Open

JENKINS-31484 Endless loop in DefaultInvoker.getProperty when accessing field via getter/setter without @

Resolved

JENKINS-37719 Build cannot be interrupted if `docker stop` hangs

Resolved

JENKINS-30978 URLConnection.content.text hangs

Resolved

JENKINS-32228 Event when timeout is reach should be customizable

Resolved

relates to

JENKINS-43276 CoreWrapperStep should run SimpleBuildWrapper.setUp asynchronously

Resolved

JENKINS-47006 durable-task's BourneShellScript.launchWithCookie trips workflow-cps-plugin's 5-minute timeout

Resolved

JENKINS-44785 Add Built-in Request timeout support in Remoting

Open

JENKINS-42561 Users should be able to custom configure the timeout on pipeline build wrappers/steps

Resolved

links to

CloudBees Internal OSS-1488

workflow-cps PR 102

workflow-support PR 29

workflow-support PR 37

(2 is related to, 4 relates to, 4 links to)

James Nord created issue - 2016-02-16 22:51

James Nord made changes - 2016-02-16 22:52

Link

New: This issue is blocking ~~JENKINS-25550~~ [ ~~JENKINS-25550~~ ]

James Nord added a comment - 2016-02-16 22:52

Not sure it can be blocking something that is fixed but here goes.

James Nord added a comment - 2016-02-16 22:52 Not sure it can be blocking something that is fixed but here goes.

James Nord made changes - 2016-02-16 22:53

Description

Original: In the event a workflow won;t die you can hard kill it - however hard killing it will leave the JVMs CPS thread still running on the master.

e.g. with the workflow
{{noformat}}
def spin() {
while (true) {}
}

def map = [:]
map ["spin_it"] = { spin() }
}
parallel map
{{noformat}}

you will need to hard kill it to stop it (on windows at least) - but inspecting the JVM threads you can see the CPS thread is still running in a tight loop.
A hard kill should probably (if it is safe without causing deadlocks elsewhere) brutally kill the thread as well. After a while you may run out of handles or other native resources due to the thread usage, meaning you need to restart Jenkins to get it working again.

New: In the event a workflow won;t die you can hard kill it - however hard killing it will leave the JVMs CPS thread still running on the master.

e.g. with the workflow
{noformat}
def spin() {
while (true) {}
}

def map = [:]
map ["spin_it"] = { spin() }
}
parallel map
{noformat}

you will need to hard kill it to stop it (on windows at least) - but inspecting the JVM threads you can see the CPS thread is still running in a tight loop.
A hard kill should probably (if it is safe without causing deadlocks elsewhere) brutally kill the thread as well. After a while you may run out of handles or other native resources due to the thread usage, meaning you need to restart Jenkins to get it working again.

Jesse Glick made changes - 2016-02-16 22:54

Description

Original: In the event a workflow won;t die you can hard kill it - however hard killing it will leave the JVMs CPS thread still running on the master.

e.g. with the workflow
{noformat}
def spin() {
while (true) {}
}

def map = [:]
map ["spin_it"] = { spin() }
}
parallel map
{noformat}

you will need to hard kill it to stop it (on windows at least) - but inspecting the JVM threads you can see the CPS thread is still running in a tight loop.
A hard kill should probably (if it is safe without causing deadlocks elsewhere) brutally kill the thread as well. After a while you may run out of handles or other native resources due to the thread usage, meaning you need to restart Jenkins to get it working again.

New: In the event a pipeline build will not die you can hard kill it - however hard killing it will leave the JVMs CPS thread still running on the master.

e.g. with the script

{code}
def spin() {
while (true) {}
}

def map = [:]
map ["spin_it"] = { spin() }
}
parallel map
{code}

you will need to hard kill it to stop it (on windows at least) - but inspecting the JVM threads you can see the CPS thread is still running in a tight loop.
A hard kill should probably (if it is safe without causing deadlocks elsewhere) brutally kill the thread as well. After a while you may run out of handles or other native resources due to the thread usage, meaning you need to restart Jenkins to get it working again.

R. Tyler Croy made changes - 2016-07-25 23:52

Workflow

Original: JNJira [ 168715 ]

New: JNJira + In-Review [ 183239 ]

Jesse Glick made changes - 2016-08-18 13:08

Link

New: This issue is related to ~~JENKINS-25623~~ [ ~~JENKINS-25623~~ ]

Jesse Glick added a comment - 2016-08-26 15:00

Picking up some stuff from ~~JENKINS-25623~~:

If the CPS VM is running native code, Thread.interrupt should be called. It should be given a limited grace period—say, a few seconds—to terminate; after that, resort to Thread.stop, making sure we are able to provide a fresh Thread for the pool so we can still run finally blocks or whatever.
We may also need some sort of per-build CPS VM CPU quota, distinct from timeout in that we do not care about wall clock time spent running a shell script on an agent, we just care about not overloading the master. Alternately, if a given build starts taking too much CPU time (measurable via System.nanoTime around runNextChunk), gradually being delaying its chunk execution (i.e., CpsThreadGroup.scheduleRun may call schedule rather than submit) so that it does not hog the system, and also institute a hard time limit for individual chunks (such as slow native methods).

Jesse Glick added a comment - 2016-08-26 15:00 Picking up some stuff from JENKINS-25623 : If the CPS VM is running native code, Thread.interrupt should be called. It should be given a limited grace period—say, a few seconds—to terminate; after that, resort to Thread.stop , making sure we are able to provide a fresh Thread for the pool so we can still run finally blocks or whatever. We may also need some sort of per-build CPS VM CPU quota, distinct from timeout in that we do not care about wall clock time spent running a shell script on an agent, we just care about not overloading the master. Alternately, if a given build starts taking too much CPU time (measurable via System.nanoTime around runNextChunk ), gradually being delaying its chunk execution (i.e., CpsThreadGroup.scheduleRun may call schedule rather than submit ) so that it does not hog the system, and also institute a hard time limit for individual chunks (such as slow native methods).

Jesse Glick made changes - 2016-08-26 15:19

Link

New: This issue is related to ~~JENKINS-37719~~ [ ~~JENKINS-37719~~ ]

Andrew Bayer made changes - 2016-08-26 21:51

Component/s

New: pipeline-general [ 21692 ]

Assignee:: Jesse Glick

Reporter:: James Nord

Votes:: 4 Vote for this issue

Watchers:: 14 Start watching this issue

Created:: 2016-02-16 22:51

Updated:: 2017-12-05 05:43

Jenkins

Details

Description

Attachments

Issue Links

Activity

Collapse comment: James Nord added a comment - 2016-02-16 22:52

Expand comment: James Nord added a comment - 2016-02-16 22:52

Collapse comment: Jesse Glick added a comment - 2016-08-26 15:00

Expand comment: Jesse Glick added a comment - 2016-08-26 15:00

People

Dates