• Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Major Major
    • ec2-plugin
    • None
    • Jenkins ver. 2.204.2
      ec2 plugin 1.49.1

      When trying to provision a new agent the plugin would start and then terminate an EC2 instance several times before succeeding in the end.
      I have no explanation for this behavior. Might be related to JENKINS-61343
      Might be because the plugin somehow allocates "2 computer(s)" even though the instance cap is 1.

      Mar 06, 2020 3:05:22 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      bootstrap()
      
      Mar 06, 2020 3:05:22 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Getting keypair...
      
      Mar 06, 2020 3:05:22 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Using private key j4a-ec2-ssh-key (SHA-1 fingerprint a7:b4:70:08:35:11:e3:cf:4b:f5:92:57:b8:02:7f:c6:8e:54:52:02)
      
      Mar 06, 2020 3:05:22 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Authenticating as admin
      
      Mar 06, 2020 3:05:22 PM INFO hudson.slaves.NodeProvisioner lambda$update$6
      
      EC2 (ec2) - ec2 (ami-028d96c69234f9d1a) provisioning successfully completed. We have now 2 computer(s)
      
      Mar 06, 2020 3:05:22 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Connecting to 10.20.4.41 on port 22, with timeout 10000.
      
      Mar 06, 2020 3:05:29 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Connected via SSH.
      
      Mar 06, 2020 3:05:29 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      connect fresh as root
      
      Mar 06, 2020 3:05:29 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Connecting to 10.20.4.41 on port 22, with timeout 10000.
      
      Mar 06, 2020 3:05:30 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Connected via SSH.
      
      Mar 06, 2020 3:05:30 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Creating tmp directory (/tmp) if it does not exist
      
      Mar 06, 2020 3:05:30 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Verifying: java -fullversion
      
      Mar 06, 2020 3:05:30 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Verifying: which scp
      
      Mar 06, 2020 3:05:30 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Copying remoting.jar to: /tmp
      
      Mar 06, 2020 3:05:30 PM INFO hudson.plugins.ec2.EC2Cloud log
      
      Launching remoting agent (via Trilead SSH2 Connection):  java  -jar /tmp/remoting.jar -workDir /opt/jenkins
      
      Mar 06, 2020 3:05:31 PM INFO hudson.plugins.ec2.EC2OndemandSlave terminate
      
      Terminated EC2 instance (terminated): i-021d76d0ffff3375f
      
      Mar 06, 2020 3:05:31 PM INFO hudson.plugins.ec2.EC2OndemandSlave terminate
      
      Removed EC2 instance from jenkins master: i-021d76d0ffff3375f
      
      Mar 06, 2020 3:05:32 PM INFO hudson.plugins.ec2.EC2Cloud provision
      
      SlaveTemplate{ami='ami-028d96c69234f9d1a', labels='docker docker-bakery'}. Attempting to provision slave needed by excess workload of 1 units
      
      Mar 06, 2020 3:05:32 PM INFO hudson.plugins.ec2.SlaveTemplate logProvisionInfo
      
      SlaveTemplate{ami='ami-028d96c69234f9d1a', labels='docker docker-bakery'}. Considering launching
      

          [JENKINS-61370] EC2 instances are terminated during launch

          After changing to "SSH process" connection method I was able to see additional errors, which are swallowed by the Trilead java connector.

          Exception in thread "main" java.nio.file.AccessDeniedException: /opt/jenkins
          	at sun.nio.fs.UnixException.translateToIOException(UnixException.java:84)
          	at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:102)
          	at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:107)
          	at sun.nio.fs.UnixFileSystemProvider.createDirectory(UnixFileSystemProvider.java:384)
          	at java.nio.file.Files.createDirectory(Files.java:674)
          	at java.nio.file.Files.createAndCheckIsDirectory(Files.java:781)
          	at java.nio.file.Files.createDirectories(Files.java:767)
          	at org.jenkinsci.remoting.engine.WorkDirManager.initializeWorkDir(WorkDirManager.java:211
          

          Turns out that if you mount a block device in user-data script it can sometimes not be available when the master SSH connection comes in.
          The solution seems to be to use the init-script instead.

          Jakub Bochenski added a comment - After changing to "SSH process" connection method I was able to see additional errors, which are swallowed by the Trilead java connector. Exception in thread "main" java.nio.file.AccessDeniedException: /opt/jenkins at sun.nio.fs.UnixException.translateToIOException(UnixException.java:84) at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:102) at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:107) at sun.nio.fs.UnixFileSystemProvider.createDirectory(UnixFileSystemProvider.java:384) at java.nio.file.Files.createDirectory(Files.java:674) at java.nio.file.Files.createAndCheckIsDirectory(Files.java:781) at java.nio.file.Files.createDirectories(Files.java:767) at org.jenkinsci.remoting.engine.WorkDirManager.initializeWorkDir(WorkDirManager.java:211 Turns out that if you mount a block device in user-data script it can sometimes not be available when the master SSH connection comes in. The solution seems to be to use the init-script instead.

            thoulen FABRIZIO MANFREDI
            jbochenski Jakub Bochenski
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated: