[JENKINS-17667] Unable to kill a job which is running

Aswini Rajasekaran added a comment - 2013-04-19 10:11

Any updates on this issue?

Aswini Rajasekaran added a comment - 2013-04-19 10:11 Any updates on this issue?

Aswini Rajasekaran added a comment - 2013-04-23 11:02

Why am I not getting any updates? I know that there will be lot of issues daily and its hard to resolve everything at the earliest. But atleast a small comment saying that we are working on this would do. This is a blocker for me daily. I end up restarting jenkins to resolve this issue.

Aswini Rajasekaran added a comment - 2013-04-23 11:02 Why am I not getting any updates? I know that there will be lot of issues daily and its hard to resolve everything at the earliest. But atleast a small comment saying that we are working on this would do. This is a blocker for me daily. I end up restarting jenkins to resolve this issue.

Mark Waite added a comment - 2013-04-23 11:54

Aswini, I think you may have misunderstood the nature of open source projects. Open source projects are addressed by a community of interested users who implement enhancements and work on problems based on their needs and their interests.

I think the response you're expecting is much closer to commercial support, rather than an open source community. You might consider contacting CloudBees about their commercially supported offering based on Jenkins.

There is also a concept of a "bug bounty" that I've seen offered elsewhere in the Jenkins project, though I'm not sure if that generally has the result you are seeking, since you seem to be seeking response times more typical of commercial products than open source projects.

Mark Waite added a comment - 2013-04-23 11:54 Aswini, I think you may have misunderstood the nature of open source projects. Open source projects are addressed by a community of interested users who implement enhancements and work on problems based on their needs and their interests. I think the response you're expecting is much closer to commercial support, rather than an open source community. You might consider contacting CloudBees about their commercially supported offering based on Jenkins. There is also a concept of a "bug bounty" that I've seen offered elsewhere in the Jenkins project, though I'm not sure if that generally has the result you are seeking, since you seem to be seeking response times more typical of commercial products than open source projects.

Aswini Rajasekaran added a comment - 2013-04-23 12:02

Hi Mark, I thought this forum will be watched by the Jenkins developers and they will post a solution for my question. If you have any answer for my query, can you let me know? Thanks.

Aswini Rajasekaran added a comment - 2013-04-23 12:02 Hi Mark, I thought this forum will be watched by the Jenkins developers and they will post a solution for my question. If you have any answer for my query, can you let me know? Thanks.

Mark Waite added a comment - 2013-04-23 12:11

I don't have an answer to your question. I've observed that sometimes a Jenkins job is harder to interrupt than others. My usual technique has been to click the "x" to stop the job, then if the job has not stopped shortly, I'll click somewhere else in the UI (causing the page to refresh), then click the "x" to stop the job a second time.

Mark Waite added a comment - 2013-04-23 12:11 I don't have an answer to your question. I've observed that sometimes a Jenkins job is harder to interrupt than others. My usual technique has been to click the "x" to stop the job, then if the job has not stopped shortly, I'll click somewhere else in the UI (causing the page to refresh), then click the "x" to stop the job a second time.

Aswini Rajasekaran added a comment - 2013-04-23 12:15

My issue is that there is no log itself for that job and it keeps running for null time in the machine. The only way for me to kick off the job is to do after restarting jenkins.
This happens when I kill the job as soon as it starts.

Aswini Rajasekaran added a comment - 2013-04-23 12:15 My issue is that there is no log itself for that job and it keeps running for null time in the machine. The only way for me to kick off the job is to do after restarting jenkins. This happens when I kill the job as soon as it starts.

ikedam added a comment - 2014-02-26 05:18

This seems a issue of Jenkins-core rather than build-timeout plugin.

ikedam added a comment - 2014-02-26 05:18 This seems a issue of Jenkins-core rather than build-timeout plugin.

Axel Berndt added a comment - 2014-03-12 10:44 - edited

I think I have the same problem. What I see is this:

Job timed out (using the build timeout plugin)
There is no system process on the machine any more for the job (ps ax)

There is a thread running inside Jenkins having this stack (taken from http://jenkins:8080/threadDump):

Executor #12 for master : executing MyJob #1381

"Executor #12 for master : executing MyJob #1381" Id=4432 Group=main RUNNABLE
	at java.util.WeakHashMap.get(WeakHashMap.java:471)
	at hudson.tools.InstallerTranslator.getToolHome(InstallerTranslator.java:55)
	at hudson.tools.ToolLocationNodeProperty.getToolHome(ToolLocationNodeProperty.java:107)
	at hudson.tools.ToolInstallation.translateFor(ToolInstallation.java:204)
	at hudson.tasks.Maven$MavenInstallation.forNode(Maven.java:610)
	at hudson.maven.MavenModuleSetBuild.getEnvironment(MavenModuleSetBuild.java:182)
	at hudson.scm.SubversionSCM.getModuleRoot(SubversionSCM.java:1554)
	at hudson.model.AbstractBuild.getModuleRoot(AbstractBuild.java:372)
	at hudson.maven.MavenModuleSetBuild$MavenModuleSetBuildExecution.doRun(MavenModuleSetBuild.java:698)
	at hudson.model.AbstractBuild$AbstractBuildExecution.run(AbstractBuild.java:585)
	at hudson.model.Run.execute(Run.java:1676)
	at hudson.maven.MavenModuleSetBuild.run(MavenModuleSetBuild.java:519)
	at hudson.model.ResourceController.execute(ResourceController.java:88)
	at hudson.model.Executor.run(Executor.java:231)

	Number of locked synchronizers = 1
	- java.util.concurrent.locks.ReentrantLock$NonfairSync@67ae5a69

Jenkins UI shows the job still running and denies another execution

Axel Berndt added a comment - 2014-03-12 10:44 - edited I think I have the same problem. What I see is this: Job timed out (using the build timeout plugin) There is no system process on the machine any more for the job ( ps ax ) There is a thread running inside Jenkins having this stack (taken from http://jenkins:8080/threadDump): Executor #12 for master : executing MyJob #1381 "Executor #12 for master : executing MyJob #1381" Id=4432 Group=main RUNNABLE at java.util.WeakHashMap.get(WeakHashMap.java:471) at hudson.tools.InstallerTranslator.getToolHome(InstallerTranslator.java:55) at hudson.tools.ToolLocationNodeProperty.getToolHome(ToolLocationNodeProperty.java:107) at hudson.tools.ToolInstallation.translateFor(ToolInstallation.java:204) at hudson.tasks.Maven$MavenInstallation.forNode(Maven.java:610) at hudson.maven.MavenModuleSetBuild.getEnvironment(MavenModuleSetBuild.java:182) at hudson.scm.SubversionSCM.getModuleRoot(SubversionSCM.java:1554) at hudson.model.AbstractBuild.getModuleRoot(AbstractBuild.java:372) at hudson.maven.MavenModuleSetBuild$MavenModuleSetBuildExecution.doRun(MavenModuleSetBuild.java:698) at hudson.model.AbstractBuild$AbstractBuildExecution.run(AbstractBuild.java:585) at hudson.model.Run.execute(Run.java:1676) at hudson.maven.MavenModuleSetBuild.run(MavenModuleSetBuild.java:519) at hudson.model.ResourceController.execute(ResourceController.java:88) at hudson.model.Executor.run(Executor.java:231) Number of locked synchronizers = 1 - java.util.concurrent.locks.ReentrantLock$NonfairSync@67ae5a69 Jenkins UI shows the job still running and denies another execution

Steven Deal added a comment - 2014-03-14 18:49 - edited

I've seen this from time to time. Any attempt at killing the job is useless (from the gui, from a curl command). I have seen this more frequently in the past and now it's back. I'm suspecting it may be related to using locks and latches... I wonder if there's a timeout that may have been exceeded and it's left in a state of limbo.
One thing I can suggest... it's rather drastic. But as I'm a pretty heavy user of the jenkins cli, if you pull down the job configuration, you can delete the job (removing this ghost running build) and recreate the job. You loose build history etc. but if you really need to get rid of it and a restart of Jenkins is simply out of the question.
jenkins get-job abc > config.xml
jenkins delete-job abc
jenkins create-job abc < config.xml
hth,steven
and of course 'jenkins' in the above example is a shell script of
#!/bin/bash
java -jar ~/bin/jenkins-cli.jar -s https://jenkins_url -i ~/.ssh/id_rsa $@

And by the way... having used commercial software for years.. I've never seen the level of response that's being suggested.

Steven Deal added a comment - 2014-03-14 18:49 - edited I've seen this from time to time. Any attempt at killing the job is useless (from the gui, from a curl command). I have seen this more frequently in the past and now it's back. I'm suspecting it may be related to using locks and latches... I wonder if there's a timeout that may have been exceeded and it's left in a state of limbo. One thing I can suggest... it's rather drastic. But as I'm a pretty heavy user of the jenkins cli, if you pull down the job configuration, you can delete the job (removing this ghost running build) and recreate the job. You loose build history etc. but if you really need to get rid of it and a restart of Jenkins is simply out of the question. jenkins get-job abc > config.xml jenkins delete-job abc jenkins create-job abc < config.xml hth,steven and of course 'jenkins' in the above example is a shell script of #!/bin/bash java -jar ~/bin/jenkins-cli.jar -s https://jenkins_url -i ~/.ssh/id_rsa $@ And by the way... having used commercial software for years.. I've never seen the level of response that's being suggested.

Joshua Kolash added a comment - 2014-04-07 18:52

I am encountering this issue as well. There are many threads stuck on the line

	at java.util.WeakHashMap.get(WeakHashMap.java:380)
	at hudson.tools.InstallerTranslator.getToolHome(InstallerTranslator.java:55)

The version of java I'm using:

java -version
java version "1.6.0_30"
OpenJDK Runtime Environment (IcedTea6 1.13.1) (6b30-1.13.1-1ubuntu2~0.12.04.1)
OpenJDK 64-Bit Server VM (build 23.25-b01, mixed mode)

Jenkins version:
Jenkins ver. 1.557

It seems as if the same WeakHashMap instance is being used by multiple threads, and since in the documentation for WeakHashMap it says

http://docs.oracle.com/javase/7/docs/api/java/util/WeakHashMap.html

Like most collection classes, this class is not synchronized. A synchronized WeakHashMap may be constructed using the Collections.synchronizedMap method.

It seems like you should be using Collections.synchronizedMap on this or you should prboably use
http://docs.guava-libraries.googlecode.com/git/javadoc/com/google/common/cache/CacheBuilder.html
which allows for weak keys and is thread safe.

Joshua Kolash added a comment - 2014-04-07 18:52 I am encountering this issue as well. There are many threads stuck on the line at java.util.WeakHashMap.get(WeakHashMap.java:380) at hudson.tools.InstallerTranslator.getToolHome(InstallerTranslator.java:55) The version of java I'm using: java -version java version "1.6.0_30" OpenJDK Runtime Environment (IcedTea6 1.13.1) (6b30-1.13.1-1ubuntu2~0.12.04.1) OpenJDK 64-Bit Server VM (build 23.25-b01, mixed mode) Jenkins version: Jenkins ver. 1.557 It seems as if the same WeakHashMap instance is being used by multiple threads, and since in the documentation for WeakHashMap it says http://docs.oracle.com/javase/7/docs/api/java/util/WeakHashMap.html Like most collection classes, this class is not synchronized . A synchronized WeakHashMap may be constructed using the Collections.synchronizedMap method. It seems like you should be using Collections.synchronizedMap on this or you should prboably use http://docs.guava-libraries.googlecode.com/git/javadoc/com/google/common/cache/CacheBuilder.html which allows for weak keys and is thread safe.

Joshua Kolash added a comment - 2014-04-07 19:25 - edited

I took a look at
https://github.com/jenkinsci/jenkins/blob/master/core/src/main/java/hudson/tools/InstallerTranslator.java
and see a possible issue.

none of the Map.put or Map.get calls are wrapped in a synchronized block.

According to the java memory model Each Thread can have its own local view of memory that is inconsistent with another Thread.
This is to enable multiple CPUS to have their own caching and not force all reads/writes to be consistent with each other, which would slow things down.

So that if you have

static class A { static int val= 0 };
Thread1: A.a=1;
Thread2: System.out.println(A.a); //Can print out either 0 or 1.

Inorder to have consistency you can use volatiles, so

static class A { static volatile int val= 0 };
Thread1: A.a=1;
Thread2: System.out.println(A.a); //Will print out 1.

A volatile in effect forces a read on main memory instead a per thread cache.

Another way to achieve this effect is to use a synchronized block.

Thread1: synchrnoized(lock) { A.a=1; }
Thread2: synchronized(lock) {System.out.println(A.a); } //Will print out 1.

This is from my understanding of
http://www.cs.umd.edu/~pugh/java/memoryModel/DoubleCheckedLocking.html

Now in your case, your gets and puts are not synchronized, so you can end up with strange behavior. I suggest wrapping your new WeakHashMap() invocations with Collections.synchronizedMap(new WeakHashMap())

Joshua Kolash added a comment - 2014-04-07 19:25 - edited I took a look at https://github.com/jenkinsci/jenkins/blob/master/core/src/main/java/hudson/tools/InstallerTranslator.java and see a possible issue. none of the Map.put or Map.get calls are wrapped in a synchronized block. According to the java memory model Each Thread can have its own local view of memory that is inconsistent with another Thread. This is to enable multiple CPUS to have their own caching and not force all reads/writes to be consistent with each other, which would slow things down. So that if you have static class A { static int val= 0 }; Thread1: A.a=1; Thread2: System .out.println(A.a); //Can print out either 0 or 1. Inorder to have consistency you can use volatiles, so static class A { static volatile int val= 0 }; Thread1: A.a=1; Thread2: System .out.println(A.a); //Will print out 1. A volatile in effect forces a read on main memory instead a per thread cache. Another way to achieve this effect is to use a synchronized block. Thread1: synchrnoized(lock) { A.a=1; } Thread2: synchronized (lock) { System .out.println(A.a); } //Will print out 1. This is from my understanding of http://www.cs.umd.edu/~pugh/java/memoryModel/DoubleCheckedLocking.html Now in your case, your gets and puts are not synchronized, so you can end up with strange behavior. I suggest wrapping your new WeakHashMap() invocations with Collections.synchronizedMap(new WeakHashMap())

SCM/JIRA link daemon added a comment - 2014-07-11 23:19

Code changed in jenkins
User: Joshua Kolash
Path:
core/src/main/java/hudson/tools/InstallerTranslator.java
http://jenkins-ci.org/commit/jenkins/e3952f41c5649c326ace3cc263420a8c287e1e7c
Log:
[FIXED JENKINS-17667] - Syncronization of InstallerTranslator::getToolHome()

There appears to be some unthreadsafe initialization going on here.
Initialize/get inside a synchronized block for threadsaftey.

SCM/JIRA link daemon added a comment - 2014-07-11 23:19 Code changed in jenkins User: Joshua Kolash Path: core/src/main/java/hudson/tools/InstallerTranslator.java http://jenkins-ci.org/commit/jenkins/e3952f41c5649c326ace3cc263420a8c287e1e7c Log: [FIXED JENKINS-17667] - Syncronization of InstallerTranslator::getToolHome() There appears to be some unthreadsafe initialization going on here. Initialize/get inside a synchronized block for threadsaftey.

SCM/JIRA link daemon added a comment - 2014-07-11 23:19

Code changed in jenkins
User: Daniel Beck
Path:
core/src/main/java/hudson/tools/InstallerTranslator.java
http://jenkins-ci.org/commit/jenkins/5ef293d7a0ac0a2f7a443a8460abda196f8056e0
Log:
Merge pull request #1176 from jkolash/master

[FIXED JENKINS-17667] - Syncronization of InstallerTranslator::getToolHome()

Compare: https://github.com/jenkinsci/jenkins/compare/7b70bd96d7ad...5ef293d7a0ac

SCM/JIRA link daemon added a comment - 2014-07-11 23:19 Code changed in jenkins User: Daniel Beck Path: core/src/main/java/hudson/tools/InstallerTranslator.java http://jenkins-ci.org/commit/jenkins/5ef293d7a0ac0a2f7a443a8460abda196f8056e0 Log: Merge pull request #1176 from jkolash/master [FIXED JENKINS-17667] - Syncronization of InstallerTranslator::getToolHome() Compare: https://github.com/jenkinsci/jenkins/compare/7b70bd96d7ad...5ef293d7a0ac

SCM/JIRA link daemon added a comment - 2014-07-12 00:24

Code changed in jenkins
User: Daniel Beck
Path:
changelog.html
http://jenkins-ci.org/commit/jenkins/fdc0b5c5650b3ee849f02e3c5d94d23c12886adc
Log:
Noting #1314, #1316, #1308, ~~JENKINS-17667~~, ~~JENKINS-22395~~, ~~JENKINS-18065~~

SCM/JIRA link daemon added a comment - 2014-07-12 00:24 Code changed in jenkins User: Daniel Beck Path: changelog.html http://jenkins-ci.org/commit/jenkins/fdc0b5c5650b3ee849f02e3c5d94d23c12886adc Log: Noting #1314, #1316, #1308, JENKINS-17667 , JENKINS-22395 , JENKINS-18065

dogfood added a comment - 2014-07-12 00:52

Integrated in jenkins_main_trunk #3515
[FIXED JENKINS-17667] - Syncronization of InstallerTranslator::getToolHome() (Revision e3952f41c5649c326ace3cc263420a8c287e1e7c)

Result = SUCCESS
joshua.kolash : e3952f41c5649c326ace3cc263420a8c287e1e7c
Files :

core/src/main/java/hudson/tools/InstallerTranslator.java

dogfood added a comment - 2014-07-12 00:52 Integrated in jenkins_main_trunk #3515 [FIXED JENKINS-17667] - Syncronization of InstallerTranslator::getToolHome() (Revision e3952f41c5649c326ace3cc263420a8c287e1e7c) Result = SUCCESS joshua.kolash : e3952f41c5649c326ace3cc263420a8c287e1e7c Files : core/src/main/java/hudson/tools/InstallerTranslator.java

Jesse Glick added a comment - 2014-07-15 16:53

Reverting as that change seems to have caused a major regression.

Jesse Glick added a comment - 2014-07-15 16:53 Reverting as that change seems to have caused a major regression.

SCM/JIRA link daemon added a comment - 2014-07-15 16:57

Code changed in jenkins
User: Jesse Glick
Path:
changelog.html
core/src/main/java/hudson/tools/InstallerTranslator.java
http://jenkins-ci.org/commit/jenkins/7c253c1cef6a40bf504e313e68a85e4fc065aa0f
Log:
~~JENKINS-17667~~ Reverting commit e3952f41c5649c326ace3cc263420a8c287e1e7c.

SCM/JIRA link daemon added a comment - 2014-07-15 16:57 Code changed in jenkins User: Jesse Glick Path: changelog.html core/src/main/java/hudson/tools/InstallerTranslator.java http://jenkins-ci.org/commit/jenkins/7c253c1cef6a40bf504e313e68a85e4fc065aa0f Log: JENKINS-17667 Reverting commit e3952f41c5649c326ace3cc263420a8c287e1e7c.

Raj vasikarla added a comment - 2014-07-15 16:57

Any updates on this issue....

Raj vasikarla added a comment - 2014-07-15 16:57 Any updates on this issue....

SCM/JIRA link daemon added a comment - 2014-07-15 18:17

Code changed in jenkins
User: Jesse Glick
Path:
changelog.html
core/src/main/java/hudson/tools/InstallerTranslator.java
test/src/test/java/hudson/tools/InstallerTranslatorTest.java
http://jenkins-ci.org/commit/jenkins/17d90931655e6c67651ec371344552d7c23bdcda
Log:
[FIXED JENKINS-17667] Fixed race condition when running tool installers on many slaves at once.
Correcting change made in #1176, which introduced an NPE, to restore original logic merely wrapped in a synchronized block.
Reproduced NPE in new functional test (original bug probably very hard to reproduce).

Compare: https://github.com/jenkinsci/jenkins/compare/84d49ceef2d6...17d90931655e

SCM/JIRA link daemon added a comment - 2014-07-15 18:17 Code changed in jenkins User: Jesse Glick Path: changelog.html core/src/main/java/hudson/tools/InstallerTranslator.java test/src/test/java/hudson/tools/InstallerTranslatorTest.java http://jenkins-ci.org/commit/jenkins/17d90931655e6c67651ec371344552d7c23bdcda Log: [FIXED JENKINS-17667] Fixed race condition when running tool installers on many slaves at once. Correcting change made in #1176, which introduced an NPE, to restore original logic merely wrapped in a synchronized block. Reproduced NPE in new functional test (original bug probably very hard to reproduce). Compare: https://github.com/jenkinsci/jenkins/compare/84d49ceef2d6...17d90931655e

dogfood added a comment - 2014-07-15 19:11

Integrated in jenkins_main_trunk #3525
[FIXED JENKINS-17667] Fixed race condition when running tool installers on many slaves at once. (Revision 17d90931655e6c67651ec371344552d7c23bdcda)

Result = SUCCESS
Jesse Glick : 17d90931655e6c67651ec371344552d7c23bdcda
Files :

test/src/test/java/hudson/tools/InstallerTranslatorTest.java
changelog.html
core/src/main/java/hudson/tools/InstallerTranslator.java

dogfood added a comment - 2014-07-15 19:11 Integrated in jenkins_main_trunk #3525 [FIXED JENKINS-17667] Fixed race condition when running tool installers on many slaves at once. (Revision 17d90931655e6c67651ec371344552d7c23bdcda) Result = SUCCESS Jesse Glick : 17d90931655e6c67651ec371344552d7c23bdcda Files : test/src/test/java/hudson/tools/InstallerTranslatorTest.java changelog.html core/src/main/java/hudson/tools/InstallerTranslator.java

dogfood added a comment - 2014-07-21 07:32

Integrated in jenkins_main_trunk #3532
~~JENKINS-17667~~ Reverting commit e3952f41c5649c326ace3cc263420a8c287e1e7c. (Revision 7c253c1cef6a40bf504e313e68a85e4fc065aa0f)

Result = SUCCESS
Jesse Glick : 7c253c1cef6a40bf504e313e68a85e4fc065aa0f
Files :

changelog.html
core/src/main/java/hudson/tools/InstallerTranslator.java

dogfood added a comment - 2014-07-21 07:32 Integrated in jenkins_main_trunk #3532 JENKINS-17667 Reverting commit e3952f41c5649c326ace3cc263420a8c287e1e7c. (Revision 7c253c1cef6a40bf504e313e68a85e4fc065aa0f) Result = SUCCESS Jesse Glick : 7c253c1cef6a40bf504e313e68a85e4fc065aa0f Files : changelog.html core/src/main/java/hudson/tools/InstallerTranslator.java

SCM/JIRA link daemon added a comment - 2014-09-07 16:33

Code changed in jenkins
User: Jesse Glick
Path:
core/src/main/java/hudson/tools/InstallerTranslator.java
test/src/test/java/hudson/tools/InstallerTranslatorTest.java
http://jenkins-ci.org/commit/jenkins/65d34a5076d8c4ec15601cecba1257d0cbfe867a
Log:
[FIXED JENKINS-17667] Fixed race condition when running tool installers on many slaves at once.
Correcting change made in #1176, which introduced an NPE, to restore original logic merely wrapped in a synchronized block.
Reproduced NPE in new functional test (original bug probably very hard to reproduce).
(cherry picked from commit 17d90931655e6c67651ec371344552d7c23bdcda)

Conflicts:
changelog.html
core/src/main/java/hudson/tools/InstallerTranslator.java

SCM/JIRA link daemon added a comment - 2014-09-07 16:33 Code changed in jenkins User: Jesse Glick Path: core/src/main/java/hudson/tools/InstallerTranslator.java test/src/test/java/hudson/tools/InstallerTranslatorTest.java http://jenkins-ci.org/commit/jenkins/65d34a5076d8c4ec15601cecba1257d0cbfe867a Log: [FIXED JENKINS-17667] Fixed race condition when running tool installers on many slaves at once. Correcting change made in #1176, which introduced an NPE, to restore original logic merely wrapped in a synchronized block. Reproduced NPE in new functional test (original bug probably very hard to reproduce). (cherry picked from commit 17d90931655e6c67651ec371344552d7c23bdcda) Conflicts: changelog.html core/src/main/java/hudson/tools/InstallerTranslator.java

Baptiste Mathus added a comment - 2015-03-05 09:37

For future reference, since I found that bug googling, seems like we're currently having some form of reminiscence of that issue. Running 1.593.
Currently crawling the thread dump, I don't see anything obvious, yet.

Baptiste Mathus added a comment - 2015-03-05 09:37 For future reference, since I found that bug googling, seems like we're currently having some form of reminiscence of that issue. Running 1.593. Currently crawling the thread dump, I don't see anything obvious, yet.

Daniel Beck added a comment - 2015-03-05 09:58

Likely a different issue. If in doubt, file a new bug.

Daniel Beck added a comment - 2015-03-05 09:58 Likely a different issue. If in doubt, file a new bug.

Baptiste Mathus added a comment - 2015-03-05 10:08 - edited

Reopening, as this is exactly the same behaviour described above:
"null on master" and so on.

Btw, this is very weird because this job is "restricted" to slaves who have a label which is not set on the master.

I've got a thread dump

Baptiste Mathus added a comment - 2015-03-05 10:08 - edited Reopening, as this is exactly the same behaviour described above: "null on master" and so on. Btw, this is very weird because this job is "restricted" to slaves who have a label which is not set on the master. I've got a thread dump

Baptiste Mathus added a comment - 2015-03-05 10:12

Thread dump for the issue with Jenkins 1.593

Baptiste Mathus added a comment - 2015-03-05 10:12 Thread dump for the issue with Jenkins 1.593

Baptiste Mathus added a comment - 2015-03-05 10:17

After Jenkins restart, the timing has been adjusted and the node on which it seems Jenkins actually wanted to send the build has been fixed:
"took 0 ms on rhel6-3" (instead now of "master" as it was displayed while it was stuck).

So, that also matches the issue described, and the guy who did this confirmed: the build was tried to be killed very early during its launch.

Baptiste Mathus added a comment - 2015-03-05 10:17 After Jenkins restart, the timing has been adjusted and the node on which it seems Jenkins actually wanted to send the build has been fixed: "took 0 ms on rhel6-3" (instead now of "master" as it was displayed while it was stuck). So, that also matches the issue described, and the guy who did this confirmed: the build was tried to be killed very early during its launch.

Baptiste Mathus added a comment - 2015-03-05 10:23

@Daniel I reopened before seing your comment, because the symptoms are exactly the same at first sight, and I didn't want to disseminate data onto different JIRA issues when it seemed to be the same one.
But if needed I can still file a new one, and link to here.

Baptiste Mathus added a comment - 2015-03-05 10:23 @Daniel I reopened before seing your comment, because the symptoms are exactly the same at first sight, and I didn't want to disseminate data onto different JIRA issues when it seemed to be the same one. But if needed I can still file a new one, and link to here.

Ozgur Kaya added a comment - 2015-05-12 10:07

Disable the job and then enable the job.. You will see that all jobs has killed and not rerunning.

Ozgur Kaya added a comment - 2015-05-12 10:07 Disable the job and then enable the job.. You will see that all jobs has killed and not rerunning.

Daniel Beck added a comment - 2015-05-12 11:28

I didn't want to disseminate data onto different JIRA issues

A good idea as long as it never misleads or contradicts. As soon as that happens it's a mess and you need to figure out what's going on. Since you can mark jobs as being related, this shouldn't be an issue.

Daniel Beck added a comment - 2015-05-12 11:28 I didn't want to disseminate data onto different JIRA issues A good idea as long as it never misleads or contradicts. As soon as that happens it's a mess and you need to figure out what's going on. Since you can mark jobs as being related, this shouldn't be an issue.

Jesse Glick added a comment - 2015-05-12 14:59

Not sure why this got reassigned away from me. I committed the fix to the known issue. (If there are other issues with similar symptoms, they should be filed separately and linked.)

Jesse Glick added a comment - 2015-05-12 14:59 Not sure why this got reassigned away from me. I committed the fix to the known issue. (If there are other issues with similar symptoms, they should be filed separately and linked.)

Santosh Phalke added a comment - 2016-02-08 22:42

I observed the same issue in the Jenkins version. 1.625.3. The issue occurred in the Multi-configuration project job. In my case the issue appeared after changing the Job weight of the Multi-configuration project job to 2 from 1. The next build of the Multi-configuration project job were non responsive. The non-responsive build under the build history displayed the on hover message "Started Null ago, Estimated remaining time: null." Could trigger the next build after reverting the job weight to 1 and after enabling the job configuration option "Execute concurrent builds if necessary"

Santosh Phalke added a comment - 2016-02-08 22:42 I observed the same issue in the Jenkins version. 1.625.3. The issue occurred in the Multi-configuration project job. In my case the issue appeared after changing the Job weight of the Multi-configuration project job to 2 from 1. The next build of the Multi-configuration project job were non responsive. The non-responsive build under the build history displayed the on hover message "Started Null ago, Estimated remaining time: null." Could trigger the next build after reverting the job weight to 1 and after enabling the job configuration option "Execute concurrent builds if necessary"

Jenkins

Details

Description

Attachments

Attachments

Activity

Collapse comment: Aswini Rajasekaran added a comment - 2013-04-19 10:11

Expand comment: Aswini Rajasekaran added a comment - 2013-04-19 10:11

Collapse comment: Aswini Rajasekaran added a comment - 2013-04-23 11:02

Expand comment: Aswini Rajasekaran added a comment - 2013-04-23 11:02

Collapse comment: Mark Waite added a comment - 2013-04-23 11:54

Expand comment: Mark Waite added a comment - 2013-04-23 11:54

Collapse comment: Aswini Rajasekaran added a comment - 2013-04-23 12:02

Expand comment: Aswini Rajasekaran added a comment - 2013-04-23 12:02

Collapse comment: Mark Waite added a comment - 2013-04-23 12:11

Expand comment: Mark Waite added a comment - 2013-04-23 12:11

Collapse comment: Aswini Rajasekaran added a comment - 2013-04-23 12:15

Expand comment: Aswini Rajasekaran added a comment - 2013-04-23 12:15

Collapse comment: ikedam added a comment - 2014-02-26 05:18

Expand comment: ikedam added a comment - 2014-02-26 05:18

Collapse comment: Axel Berndt added a comment - 2014-03-12 10:44, Edited by Axel Berndt - 2014-03-12 10:46

Expand comment: Axel Berndt added a comment - 2014-03-12 10:44, Edited by Axel Berndt - 2014-03-12 10:46

Collapse comment: Steven Deal added a comment - 2014-03-14 18:49, Edited by Steven Deal - 2014-03-14 18:58

Expand comment: Steven Deal added a comment - 2014-03-14 18:49, Edited by Steven Deal - 2014-03-14 18:58

Collapse comment: Joshua Kolash added a comment - 2014-04-07 18:52

Expand comment: Joshua Kolash added a comment - 2014-04-07 18:52

Collapse comment: Joshua Kolash added a comment - 2014-04-07 19:25, Edited by Joshua Kolash - 2014-04-07 19:26

Expand comment: Joshua Kolash added a comment - 2014-04-07 19:25, Edited by Joshua Kolash - 2014-04-07 19:26

Collapse comment: SCM/JIRA link daemon added a comment - 2014-07-11 23:19

Expand comment: SCM/JIRA link daemon added a comment - 2014-07-11 23:19

Collapse comment: SCM/JIRA link daemon added a comment - 2014-07-11 23:19

Expand comment: SCM/JIRA link daemon added a comment - 2014-07-11 23:19

Collapse comment: SCM/JIRA link daemon added a comment - 2014-07-12 00:24

Expand comment: SCM/JIRA link daemon added a comment - 2014-07-12 00:24

Collapse comment: dogfood added a comment - 2014-07-12 00:52

Expand comment: dogfood added a comment - 2014-07-12 00:52

Collapse comment: Jesse Glick added a comment - 2014-07-15 16:53

Expand comment: Jesse Glick added a comment - 2014-07-15 16:53

Collapse comment: SCM/JIRA link daemon added a comment - 2014-07-15 16:57

Expand comment: SCM/JIRA link daemon added a comment - 2014-07-15 16:57

Collapse comment: Raj vasikarla added a comment - 2014-07-15 16:57

Expand comment: Raj vasikarla added a comment - 2014-07-15 16:57

Collapse comment: SCM/JIRA link daemon added a comment - 2014-07-15 18:17

Expand comment: SCM/JIRA link daemon added a comment - 2014-07-15 18:17

Collapse comment: dogfood added a comment - 2014-07-15 19:11

Expand comment: dogfood added a comment - 2014-07-15 19:11

Collapse comment: dogfood added a comment - 2014-07-21 07:32

Expand comment: dogfood added a comment - 2014-07-21 07:32

Collapse comment: SCM/JIRA link daemon added a comment - 2014-09-07 16:33

Expand comment: SCM/JIRA link daemon added a comment - 2014-09-07 16:33

Collapse comment: Baptiste Mathus added a comment - 2015-03-05 09:37

Expand comment: Baptiste Mathus added a comment - 2015-03-05 09:37

Collapse comment: Daniel Beck added a comment - 2015-03-05 09:58

Expand comment: Daniel Beck added a comment - 2015-03-05 09:58

Collapse comment: Baptiste Mathus added a comment - 2015-03-05 10:08, Edited by Baptiste Mathus - 2015-03-05 10:12

Expand comment: Baptiste Mathus added a comment - 2015-03-05 10:08, Edited by Baptiste Mathus - 2015-03-05 10:12

Collapse comment: Baptiste Mathus added a comment - 2015-03-05 10:12

Expand comment: Baptiste Mathus added a comment - 2015-03-05 10:12

Collapse comment: Baptiste Mathus added a comment - 2015-03-05 10:17

Expand comment: Baptiste Mathus added a comment - 2015-03-05 10:17

Collapse comment: Baptiste Mathus added a comment - 2015-03-05 10:23

Expand comment: Baptiste Mathus added a comment - 2015-03-05 10:23

Collapse comment: Ozgur Kaya added a comment - 2015-05-12 10:07

Expand comment: Ozgur Kaya added a comment - 2015-05-12 10:07

Collapse comment: Daniel Beck added a comment - 2015-05-12 11:28

Expand comment: Daniel Beck added a comment - 2015-05-12 11:28

Collapse comment: Jesse Glick added a comment - 2015-05-12 14:59

Expand comment: Jesse Glick added a comment - 2015-05-12 14:59

Collapse comment: Santosh Phalke added a comment - 2016-02-08 22:42

Expand comment: Santosh Phalke added a comment - 2016-02-08 22:42

People

Dates