-
Bug
-
Resolution: Unresolved
-
Minor
-
## Master
Jenkins Version : 2.65
OS: Alpine OS 3.5
Swarm Plugin 3.4
## Slave
Jenkins Swarm Client: 3.3
OS: CentOS 7
Java: 1.8
Jenkins Swarm successfully connect to master and take jobs but keep on going offline and then reconnecting after few seconds
This is how I connect to Jenkins Master
java -jar /opt/swarm-client.jar \ -master http://${master_elb} \ -username ${user} \ -password ${password} \ -labels slave \ -executors ${executors} \ -description 'Jenkins Slave' \ -retryInterval 5 \ -fsroot ${app_dir} \ -name ${LOCAL_IP}
These are the logs
INFO: Agent discovery successful Agent address: my-jenkins-master.com Agent port: 50000 Identity: 05:d2:dc:14:94:0e:3a:e9:18:7b:2b:dc:2b:e0:06:4c Jul 19, 2017 6:40:36 PM hudson.remoting.jnlp.Main$CuiListener status INFO: Handshaking Jul 19, 2017 6:40:36 PM hudson.remoting.jnlp.Main$CuiListener status INFO: Connecting to my-jenkins-master.com:50000 Jul 19, 2017 6:40:36 PM hudson.remoting.jnlp.Main$CuiListener status INFO: Trying protocol: JNLP4-connect Jul 19, 2017 6:40:36 PM hudson.remoting.jnlp.Main$CuiListener status INFO: Remote identity confirmed: 05:d2:dc:14:94:0e:3a:e9:18:7b:2b:dc:2b:e0:06:4c Jul 19, 2017 6:40:36 PM hudson.remoting.jnlp.Main$CuiListener status INFO: Connected Jul 19, 2017 6:41:41 PM hudson.remoting.jnlp.Main$CuiListener status INFO: Terminated Jul 19, 2017 6:41:41 PM hudson.plugins.swarm.Client run INFO: Retrying in 5 seconds Jul 19, 2017 6:41:46 PM hudson.plugins.swarm.Client run INFO: Attempting to connect to http://my-jenkins-master.com/ ae3d1dbd-4a93-4c89-b98c-f90ed62f1f71 with ID 7aef328a Jul 19, 2017 6:41:46 PM hudson.plugins.swarm.SwarmClient getCsrfCrumb SEVERE: Could not obtain CSRF crumb. Response code: 404 Jul 19, 2017 6:41:47 PM hudson.remoting.jnlp.Main createEngine INFO: Setting up slave: 10-61-67-169-7aef328a Jul 19, 2017 6:41:47 PM hudson.remoting.jnlp.Main$CuiListener <init> INFO: Jenkins agent is running in headless mode. Jul 19, 2017 6:41:47 PM hudson.remoting.jnlp.Main$CuiListener status INFO: Locating server among [http://my-jenkins-master.com/] Jul 19, 2017 6:41:47 PM hudson.remoting.jnlp.Main$CuiListener status INFO: Agent discovery successful Agent address: my-jenkins-master.com Agent port: 50000 Identity: 05:d2:dc:14:94:0e:3a:e9:18:7b:2b:dc:2b:e0:06:4c
Is there something else I can do to make it stable?
I'm also seeing this error on 3.4 Traces are below:
Jul 20, 2017 3:01:14 AM hudson.plugins.swarm.Client run
INFO: Discovering Jenkins master
SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder".
SLF4J: Defaulting to no-operation (NOP) logger implementation
SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further details.
Jul 20, 2017 3:01:14 AM hudson.plugins.swarm.Client run
INFO: Attempting to connect to http://jenkins.us-west-2.utils.aws.geniussports.com/ b50070f6-196f-4ea9-b881-18817e73e750 with ID
Jul 20, 2017 3:01:14 AM hudson.remoting.jnlp.Main createEngine
INFO: Setting up slave: i-0d8cc16d1963fc8f9
Jul 20, 2017 3:01:14 AM hudson.remoting.jnlp.Main$CuiListener <init>
INFO: Jenkins agent is running in headless mode.
Jul 20, 2017 3:01:14 AM hudson.remoting.jnlp.Main$CuiListener status
INFO: Locating server among http://jenkins.us-west-2.utils.aws.geniussports.com/
Jul 20, 2017 3:01:14 AM hudson.remoting.jnlp.Main$CuiListener status
INFO: Agent discovery successful
Agent address: 172.16.0.206
Agent port: 5000
Identity: e8:9d:53:db:54:0d:59:9a:d2:8e:e4:32:5d:10:c1:f8
Jul 20, 2017 3:01:14 AM hudson.remoting.jnlp.Main$CuiListener status
INFO: Handshaking
Jul 20, 2017 3:01:14 AM hudson.remoting.jnlp.Main$CuiListener status
INFO: Connecting to 172.16.0.206:5000
Jul 20, 2017 3:01:14 AM hudson.remoting.jnlp.Main$CuiListener status
INFO: Trying protocol: JNLP4-connect
Jul 20, 2017 3:01:15 AM hudson.remoting.jnlp.Main$CuiListener status
INFO: Remote identity confirmed: e8:9d:53:db:54:0d:59:9a:d2:8e:e4:32:5d:10:c1:f8
Jul 20, 2017 3:01:15 AM hudson.remoting.jnlp.Main$CuiListener status
INFO: Connected
Jul 20, 2017 3:01:15 AM hudson.remoting.jnlp.Main$CuiListener status
INFO: Terminated
Jul 20, 2017 3:01:15 AM hudson.plugins.swarm.Client run
WARNING: Connection closed, exiting...
And on the master i'm seeing:
Jul 20, 2017 3:01:15 AM INFO hudson.TcpSlaveAgentListener$ConnectionHandler run
Accepted JNLP4-connect connection #4 from 172.16.2.207/172.16.2.207:49842
Jul 20, 2017 3:01:15 AM WARNING jenkins.slaves.DefaultJnlpSlaveReceiver channelClosed
Computer.threadPoolForRemoting 8 for i-0d8cc16d1963fc8f9 terminated java.nio.channels.ClosedChannelException at org.jenkinsci.remoting.protocol.impl.ChannelApplicationLayer.onReadClosed(ChannelApplicationLayer.java:208) at org.jenkinsci.remoting.protocol.ApplicationLayer.onRecvClosed(ApplicationLayer.java:222) at org.jenkinsci.remoting.protocol.ProtocolStack$Ptr.onRecvClosed(ProtocolStack.java:832) at org.jenkinsci.remoting.protocol.FilterLayer.onRecvClosed(FilterLayer.java:287) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.onRecvClosed(SSLEngineFilterLayer.java:181) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.switchToNoSecure(SSLEngineFilterLayer.java:283) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.processWrite(SSLEngineFilterLayer.java:503) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.processQueuedWrites(SSLEngineFilterLayer.java:248) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.doSend(SSLEngineFilterLayer.java:200) at org.jenkinsci.remoting.protocol.impl.SSLEngineFilterLayer.doCloseSend(SSLEngineFilterLayer.java:213) at org.jenkinsci.remoting.protocol.ProtocolStack$Ptr.doCloseSend(ProtocolStack.java:800) at org.jenkinsci.remoting.protocol.ApplicationLayer.doCloseWrite(ApplicationLayer.java:173) at org.jenkinsci.remoting.protocol.impl.ChannelApplicationLayer$ByteBufferCommandTransport.closeWrite(ChannelApplicationLayer.java:311) at hudson.remoting.Channel.close(Channel.java:1295) at hudson.remoting.Channel.close(Channel.java:1263) at jenkins.slaves.DefaultJnlpSlaveReceiver.afterChannel(DefaultJnlpSlaveReceiver.java:173) at org.jenkinsci.remoting.engine.JnlpConnectionState$4.invoke(JnlpConnectionState.java:421) at org.jenkinsci.remoting.engine.JnlpConnectionState.fire(JnlpConnectionState.java:312) at org.jenkinsci.remoting.engine.JnlpConnectionState.fireAfterChannel(JnlpConnectionState.java:418) at org.jenkinsci.remoting.engine.JnlpProtocol4Handler$Handler$1.run(JnlpProtocol4Handler.java:334) at jenkins.util.ContextResettingExecutorService$1.run(ContextResettingExecutorService.java:28) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:748)