[JENKINS-22320] High CPU consumption because of SSH communication - Jenkins Jira

Type: Bug
Resolution: Fixed
Priority: Critical
Component/s: ssh-slaves-plugin
Labels:
- performance

Similar Issues:
Powered by SuggestiMate

Show

Our Jenkins instance runs 1.553 (since two weeks, was an three year old version before) on Linux with 10 slaves: 6 Linux connected through SSH and 4 windows with WebStart agent started as services.

When no job is running, CPU is almost 0. Correct.

When only two or three jobs are running, Jenkins CPU raises between 70% to 160%. atop declares 96% of CPU time spent in IRQ (even with "no" disk access), most CPU consumption is considered as system time. On average since boot, the master node consumes 100% of one CPU. Even if it has 4 CPUs, job execution time is between 2x and 3x compared to older version.

I configured JMX and did a quick CPU profiling. Top consumer threads are unnamed and are all related to SSH communication

"Thread-13" - Thread t@90
   java.lang.Thread.State: RUNNABLE
	at java.net.SocketInputStream.socketRead0(Native Method)
	at java.net.SocketInputStream.read(SocketInputStream.java:152)
	at java.net.SocketInputStream.read(SocketInputStream.java:122)
	at com.trilead.ssh2.crypto.cipher.CipherInputStream.fill_buffer(CipherInputStream.java:41)
	at com.trilead.ssh2.crypto.cipher.CipherInputStream.internal_read(CipherInputStream.java:52)
	at com.trilead.ssh2.crypto.cipher.CipherInputStream.getBlock(CipherInputStream.java:79)
	at com.trilead.ssh2.crypto.cipher.CipherInputStream.read(CipherInputStream.java:108)
	at com.trilead.ssh2.transport.TransportConnection.receiveMessage(TransportConnection.java:232)
	at com.trilead.ssh2.transport.TransportManager.receiveLoop(TransportManager.java:682)
	at com.trilead.ssh2.transport.TransportManager$1.run(TransportManager.java:480)
	at java.lang.Thread.run(Thread.java:744)

So there is chance the "SSH agent plugin" is concerned.

I am ready to do deeper analysis on my system if required, and of course to test patches.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

Selection_145.png
25 kB
2014-03-24 08:41
Selection_146.png
31 kB
2014-03-24 08:41
trilead-ssh2-64K-buffer.patch
0.7 kB
2014-05-19 12:37

duplicates

JENKINS-22297 ssh-slave connection seems to have performance issues

Closed

is related to

JENKINS-20550 download artifacts is very slow

Open

Assignee:: Kohsuke Kawaguchi

Reporter:: Yves Martin

Votes:: 3 Vote for this issue

Watchers:: 10 Start watching this issue

Created:: 2014-03-24 08:41

Updated:: 2016-02-10 17:06

Resolved:: 2015-12-07 07:36

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates