Upon launching a vSphere Node Driver cluster in Rancher v2.x, nodes within the cluster are stuck in provisioning, with the message
Waiting for SSH to be available. Logging into the nodes via SSH and checking the auth log directly reveals failed SSH connection attempts for a missing
- A Rancher v2.x provisioned vSphere cluster, using the vSphere Node Driver.
When provisioning a vSphere Node Driver cluster Rancher v2.x uses cloud-init to generate an ssh-keypair for the user
docker and copy this into the Virtual Machine on initial boot.
In some Linux distributions, including Ubuntu Server 18.04, the standard OS installation process generates a cloud-init configuration. Installation of the OS is performed during the intitial setup of the VM Templates, prior to cluster provisioning via Rancher, and this existing cloud-init configuration within the Template can intefere with Rancher's ability to insert its own cloud-init.
Convert the Template back to a VM and run:
sudo cloud-init clean
This command will clean the Template of any existing cloud-inits, once complete you can convert the VM back to a template to try again.