Node cannot join existing cluster #14716
-
| Describe the bugI had a 3 node cluster. Had to reinstall one of the VMs, I ran  but I'm getting: Somehow, it says that node3 is unresponsive, even though 1 and 3 are clustered and working. If I stop node 3, and try to join 2, it works. 1 and 2 can form the cluster, but the second I try to bring back node 3, I get the same error as above, but it's node2 who timesout. Reproduction steps
 I was able to reproduce this three times already. As additional info, I'm running RabbitMQ 4.1.4-1 Expected behaviorThe node should join the cluster, as any new node would. Additional contextNo response | 
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 6 replies
-
| I converted this to a discussion because there is no evidence this is a bug in RabbitMQ. This is a decent start to a report, but what is missing are the exact commands, and transcripts of you running those commands, when you remove and reinstall RabbitMQ. There is a lot of information missing that you're asking us to guess. In steps 4 and 5, what happens if you remove the node from the cluster, and uninstall RabbitMQ from that node (remove data directory, configuration). Then, re-install RabbitMQ and attempt to re-join the cluster. If that succeeds, then there must be something in how you're re-building the node that isn't quite identical to before. | 
Beta Was this translation helpful? Give feedback.
@hac3ru sorry but
is very specific: the Erlang cookie does not match. This message is logged in only a few very closely related scenarios.
One known scenario where this can happen is RabbitMQ CLI tools running before the cookie file is created by the node. Then the CLI tools will generate a random cookie that won't match the one used by the existing cluster.