[JENKINS-46893] Huge amount of TcpSlaveAgentListener EOFException on Kubernetes

Type: Bug
Resolution: Not A Defect
Priority: Major
Component/s: core, remoting
Labels:
None
Environment:
jenkins-2.60.2

Similar Issues:
Powered by SuggestiMate

Show

I set up a Jenkins master on Azure container service (Kubernetes). I got a lot of warning since the Jenkins master was set up and never end:

Sep 15, 2017 8:33:23 AM hudson.TcpSlaveAgentListener$ConnectionHandler run
WARNING: Connection #600 failed
java.io.EOFException
        at java.io.DataInputStream.readFully(DataInputStream.java:197)
        at java.io.DataInputStream.readFully(DataInputStream.java:169)
        at hudson.TcpSlaveAgentListener$ConnectionHandler.run(TcpSlaveAgentListener.java:213)
Sep 15, 2017 8:33:23 AM hudson.TcpSlaveAgentListener$ConnectionHandler run
WARNING: Connection #602 failed
java.io.EOFException
        at java.io.DataInputStream.readFully(DataInputStream.java:197)
        at java.io.DataInputStream.readFully(DataInputStream.java:169)
        at hudson.TcpSlaveAgentListener$ConnectionHandler.run(TcpSlaveAgentListener.java:213)
Sep 15, 2017 8:33:27 AM hudson.TcpSlaveAgentListener$ConnectionHandler run
WARNING: Connection #603 failed
java.io.EOFException
        at java.io.DataInputStream.readFully(DataInputStream.java:197)
        at java.io.DataInputStream.readFully(DataInputStream.java:169)
        at hudson.TcpSlaveAgentListener$ConnectionHandler.run(TcpSlaveAgentListener.java:213)

Although there is a lot of warning, but everything is OK. I can even use Azure VM plugin to set up jnlp slaves.

I think this issue related to port: 50000 because if I expose port 50000 as a cluster port, there is no error. However, if I set port 50000 as LoadBalancer (we have to do this), the errors shows above.

Here is my kube files:

kind: PersistentVolumeClaim
apiVersion: v1
metadata:
  name: azdisk
spec:
  accessModes:
    - ReadWriteOnce
  resources:
    requests:
      storage: 5Gi
  storageClassName: azuredisk
---
kind: Deployment
apiVersion: apps/v1beta1
metadata:
  name: jenkins-1
spec:
  replicas: 1
  template:
    metadata:
      name: jenkins-1
      labels:
        app: jenkins-1
    spec:
      containers:
      - name: jenkins-container
        image: zackliu1995/jenkins
        volumeMounts:
        - name: azure
          mountPath: /var/jenkins_home
        securityContext:
          privileged: true
        ports:
          - name: port8080
            containerPort: 8080
            protocol: TCP
          - name: port50000
            containerPort: 50000
            protocol: TCP
          - name: port22
            containerPort: 22
      volumes:
        - name: azure
          persistentVolumeClaim:
            claimName: azdisk
---
apiVersion: v1
kind: Service
metadata:
  name: jenkins-srv
spec:
  selector:
    app: jenkins-1
  ports:
    - name: http
      port: 80
      protocol: TCP
      targetPort: 8080
    - name: slave
      port: 50000
      protocol: TCP
      targetPort: 50000
    - name: ssh
      port: 22
      targetPort: 22
  type: LoadBalancer

I tried official Jenkins image, it caused the same issue.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

jenkins-master.jpg
93 kB
2019-02-20 06:28
jenkins-slave.txt
4 kB
2019-02-20 06:29

Chenyang Liu added a comment - 2017-09-15 13:57

Oh, you probably misunderstood the issue. There is no agent but only a Jenkins master. This issue happened since the very beginning that I haven't even finished my wizard (haven't unlocked the Jenkins and haven't installed the suggest plugins) and of cause haven't installed VM agent plugin.

So, I think maybe it caused by Jenkins's self check... I don't know, so strange.

Chenyang Liu added a comment - 2017-09-15 13:57 Oh, you probably misunderstood the issue. There is no agent but only a Jenkins master. This issue happened since the very beginning that I haven't even finished my wizard (haven't unlocked the Jenkins and haven't installed the suggest plugins) and of cause haven't installed VM agent plugin. So, I think maybe it caused by Jenkins's self check... I don't know, so strange.

Oleg Nenashev added a comment - 2017-09-15 16:32

I can add more diagnostics, but there is no such self-check in Jenkins for sure.

The errors may be also coming from Remoting-based CLI, non-terminated Jenkins Maven project runs on instances. There are also other components from vendors which may be trying to connect to the master via Remoting.

Oleg Nenashev added a comment - 2017-09-15 16:32 I can add more diagnostics, but there is no such self-check in Jenkins for sure. The errors may be also coming from Remoting-based CLI, non-terminated Jenkins Maven project runs on instances. There are also other components from vendors which may be trying to connect to the master via Remoting.

Daniel Beck added a comment - 2017-09-17 07:03

Does a load balancer connect to the port in question as a health check of sorts?

Perhaps rtyler or olblak can take a look at this as our resident Kubernetes on Azure experts.

Daniel Beck added a comment - 2017-09-17 07:03 Does a load balancer connect to the port in question as a health check of sorts? Perhaps rtyler or olblak can take a look at this as our resident Kubernetes on Azure experts.

Chenyang Liu added a comment - 2017-09-17 08:36

That's probably the case, Azure Load Balancer does have health probes for every 5 second by default.

I will check it later on Monday.

Chenyang Liu added a comment - 2017-09-17 08:36 That's probably the case, Azure Load Balancer does have health probes for every 5 second by default. I will check it later on Monday.

R. Tyler Croy added a comment - 2017-09-17 16:33

danielbeck, from my understanding this JIRA is not a support forum.

R. Tyler Croy added a comment - 2017-09-17 16:33 danielbeck , from my understanding this JIRA is not a support forum.

Daniel Beck added a comment - 2017-09-17 22:37 - edited

rtyler It is not, but so far it's unclear to me whether this is a bug or not. You saying my guess is right (or some other unreasonable load balancer behavior causes this) would make this Not A Defect.

Daniel Beck added a comment - 2017-09-17 22:37 - edited rtyler It is not, but so far it's unclear to me whether this is a bug or not. You saying my guess is right (or some other unreasonable load balancer behavior causes this) would make this Not A Defect.

Oleg Nenashev added a comment - 2017-12-25 23:13

zackliu ping.

Oleg Nenashev added a comment - 2017-12-25 23:13 zackliu ping.

Chenyang Liu added a comment - 2017-12-26 01:15

It's caused by health probes, can close this issue.

Chenyang Liu added a comment - 2017-12-26 01:15 It's caused by health probes, can close this issue.

Oleg Nenashev added a comment - 2018-03-27 21:54

Closing according to the response

Oleg Nenashev added a comment - 2018-03-27 21:54 Closing according to the response

gp guan added a comment - 2019-02-20 06:30

I met the same issue in my local k8s cluster, and service type is ClusterIp. When the problem occurs，the log of jenkins master and jenkins slave as follows.
jenkins-slave.txt

gp guan added a comment - 2019-02-20 06:30 I met the same issue in my local k8s cluster, and service type is ClusterIp. When the problem occurs，the log of jenkins master and jenkins slave as follows. jenkins-slave.txt

Assignee:: Unassigned

Reporter:: Chenyang Liu

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2017-09-15 09:12

Updated:: 2019-02-20 06:30

Resolved:: 2018-03-27 21:54

Details

Description

Attachments

Attachments

Activity

Collapse comment: Chenyang Liu added a comment - 2017-09-15 13:57

Expand comment: Chenyang Liu added a comment - 2017-09-15 13:57

Collapse comment: Oleg Nenashev added a comment - 2017-09-15 16:32

Expand comment: Oleg Nenashev added a comment - 2017-09-15 16:32

Collapse comment: Daniel Beck added a comment - 2017-09-17 07:03

Expand comment: Daniel Beck added a comment - 2017-09-17 07:03

Collapse comment: Chenyang Liu added a comment - 2017-09-17 08:36

Expand comment: Chenyang Liu added a comment - 2017-09-17 08:36

Collapse comment: R. Tyler Croy added a comment - 2017-09-17 16:33

Expand comment: R. Tyler Croy added a comment - 2017-09-17 16:33

Collapse comment: Daniel Beck added a comment - 2017-09-17 22:37, Edited by Daniel Beck - 2017-09-18 00:16

Expand comment: Daniel Beck added a comment - 2017-09-17 22:37, Edited by Daniel Beck - 2017-09-18 00:16

Collapse comment: Oleg Nenashev added a comment - 2017-12-25 23:13

Expand comment: Oleg Nenashev added a comment - 2017-12-25 23:13

Collapse comment: Chenyang Liu added a comment - 2017-12-26 01:15

Expand comment: Chenyang Liu added a comment - 2017-12-26 01:15

Collapse comment: Oleg Nenashev added a comment - 2018-03-27 21:54

Expand comment: Oleg Nenashev added a comment - 2018-03-27 21:54

Collapse comment: gp guan added a comment - 2019-02-20 06:30

Expand comment: gp guan added a comment - 2019-02-20 06:30

People

Dates