Details
-
Bug
-
Resolution: Duplicate
-
Critical
-
EAP 5.1.0 Post Release
-
None
-
Windows, Integrated with jboss5.1
Description
Cluster messages not redistributed after node hard kill.
So we have a cluster going with 2 nodes using hornetQ integrated with Jboss5. We bring the two nodes up and they discover each other fine. I bring a client up and start sending messages to a queue that is clustered across the nodes. Everything will work great for a while the messages get load balanced and all is happy. Then I can get in a strange state in a couple of different ways, but the easiest way is if I go to one of the server nodes and just kill the server process. The one server appears to detect the other node went down and adjusts its cluster view but after this happens every other message I send to the server doesn't get processed. It looks like the message is placed on the bridge queue that was connecting the two nodes. The consumer count on the bridge queue is one but the other server is dead and so it never gets picked up by anything. I would like it if the other node dies that the node that is still running would grab the messages and start running them. In this case I can restart the other node and it starts picking the messages up when it comes back up but there are other situations where this same sort of thing will happen but even after a restart of the one server it never starts grabbing messages off the brigde any more. In that case I have to restart the entire cluster