Uploaded image for project: 'Jenkins'
  1. Jenkins
  2. JENKINS-75135

Permament machine crashes/disconnects during pipeline execution

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Major Major

      Hi,

      we observed Jenkins agents disconnecting during pipeline execution.
      Unfortunately it happens rarely and it's hard to pinpoint exact root cause.

      Quite often it occurs during finishing of TestComplete tests.

      Error I got is 
      [2025-01-14T12:29:01.547Z] [TestComplete] [ERROR] An exception occurred:
      [2025-01-14T12:29:01.547Z] java.nio.channels.ClosedChannelException.

      Followed by another one during post actions:

      Error when executing always post condition:
      Also:   org.jenkinsci.plugins.workflow.actions.ErrorAction$ErrorId: 54bbc34a-e9b0-407c-9866-a0b807f89a82
      org.jenkinsci.plugins.workflow.support.steps.AgentOfflineException: Unable to create live FilePath for TSLDESOT118

      Same issue appeared few times for different machines.

      I'm not sure if it's related but we also saw such INFO logs on Jenkins instance during this time:
      2025-01-14 12:29:00.512+0000 [id=424462]        INFO    j.s.DefaultJnlpSlaveReceiver#channelClosed: IOHub#1: Worker[channel:java.nio.channels.SocketChannel[connected local=/10.100.29.20:56429 remote=TLSDVSOT118.company.com/10.128.80.30:62895]] / Computer.threadPoolForRemoting 53461 for TLSDVSOT118 terminated: java.nio.channels.ClosedChannelException

      Those logs appeared almost for all machines, but we only saw similar issue only on 3 of them.

      In attachment I'm adding full log of execution

          [JENKINS-75135] Permament machine crashes/disconnects during pipeline execution

          Grzegorz created issue -
          Grzegorz made changes -
          Description Original: Hi,

          we observed Jenkins agents disconnecting during pipeline execution.
          Unfortunately it happens rarely and it's hard to pinpoint exact root cause.

          Quite often it occurs during finishing of TestComplete tests.

          Error I got is 
          [2025-01-14T12:29:01.547Z] [TestComplete] [ERROR] An exception occurred:
          [2025-01-14T12:29:01.547Z] java.nio.channels.ClosedChannelException.

          Followed by another one during post actions:

          Error when executing always post condition:
          Also:   org.jenkinsci.plugins.workflow.actions.ErrorAction$ErrorId: 54bbc34a-e9b0-407c-9866-a0b807f89a82
          org.jenkinsci.plugins.workflow.support.steps.AgentOfflineException: Unable to create live FilePath for TSLDESOT118


          Same issue appeared few times for different machines.

          In attachment I'm adding full log of execution
          New: Hi,

          we observed Jenkins agents disconnecting during pipeline execution.
          Unfortunately it happens rarely and it's hard to pinpoint exact root cause.

          Quite often it occurs during finishing of TestComplete tests.

          Error I got is 
          [2025-01-14T12:29:01.547Z] [TestComplete] [ERROR] An exception occurred:
          [2025-01-14T12:29:01.547Z] java.nio.channels.ClosedChannelException.

          Followed by another one during post actions:

          Error when executing always post condition:
          Also:   org.jenkinsci.plugins.workflow.actions.ErrorAction$ErrorId: 54bbc34a-e9b0-407c-9866-a0b807f89a82
          org.jenkinsci.plugins.workflow.support.steps.AgentOfflineException: Unable to create live FilePath for TSLDESOT118



          Same issue appeared few times for different machines.

          I'm not sure if it's related but we also saw such INFO logs on Jenkins instance during this time:
          2025-01-14 12:29:00.512+0000 [id=424462]        INFO    j.s.DefaultJnlpSlaveReceiver#channelClosed: IOHub#1: Worker[channel:java.nio.channels.SocketChannel[connected local=/10.100.29.20:56429 remote=TLSDVSOT118.company.com/10.128.80.30:62895]] / Computer.threadPoolForRemoting [#53461] for TLSDVSOT118 terminated: java.nio.channels.ClosedChannelException

          Those logs appeared almost for all machines, but we only saw similar issue only on 3 of them.




          In attachment I'm adding full log of execution
          Grzegorz made changes -
          Attachment Original: agents_disconnects_log-1.txt [ 63800 ]

            Unassigned Unassigned
            novinxy Grzegorz
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated: