Setup:
-
2 nodes in cluster using Hazelcast+AWS connector (OpenJDK 8u101, OpenFire 4.0.2, HazelCast 2.2.0)
-
2 desktop clients connected to different nodes
Steps to reproduce:
- Send a message from client A (connected to node A) to client B (connected to node B)
- Client B receives the message
- Send a SIGTERM to the OpenFire process running on node A
- Restart OpenFire on node A
- Reconnect client A
- Send a message from client A (connected to node A) to client B (connected to node B)
- Client B receives the message
- Send a SIGTERM to the OpenFire process running on node B
- Restart OpenFire on node B
- Reconnect client B
- Send a message from client A (connected to node A) to client B (connected to node B)
Results:
Message never arrives to client B
Logs from node B: [Java] openfire bug - Pastebin.com
Notes:
This HazelCast issue (Null value on compute remove · Issue #7020 · hazelcast/hazelcast · GitHub ) seems similar.
I tried the same test using openJDK 8-b132, but the bug is still reproduced.
I tried the same test using openJDK 9-b132, but OpenFire doesn’t boot.