I am experiencing an issue with Latency with my BOSH setup and looking for advise / ideas on where the bottle neck might be.
My Setup.
- Windows,
- Apache 2.23
- Openfire 3.7.1
- Candy 1.0.9
When I first had users test the service, it was great until about 50 connections, then I had major reports of latency and some dropped connections. Quickly it became unusable
This test was running on a virtual server so I replicated it on one of our internal servers to rule out a hardware issue and a similar thing happened.
During both tests, CPU and RAM were fine so it did not point to a hardware issue and two different networks were involved (1 internal, 1 external) and so I think that is also in the clear.
To determine where the bottleneck might be, I setup 2 identical rooms. One in Openfire’s spank directory and the other in Apache’s htdocs directory.
I then had 50-100 users join the room: **http://example.com:7070/testroom **which had no issues. It was fast and performed very well with the data being served up by openfire’s Jetty server. (Good news for Openfire)
I then had 50-100 users join the room: http://example.com/testroom which began to have problems again at about 50 users. The issues were with latency and disconnects. This room was served up via apache and a .htaccess file proxying the connection over to port 7070 on the same server (to Openfire).
So based on this test, it would appear the issue is an apache bottleneck.
I am just wondering if anyone has any thoughts on this.
Also, if you have a large bosh deployment, whether or not they have seen that with Apache and if there are any apache tweaks/settings that may fix this issue.
Ultimately I am looking to connect about 300 users with them each being in 1-2 rooms. I would use port 7070 / 7443 directly (jetty) but the will be locked in our live setup so I am limited to port 443 (7443 is blocked externally because of PCI compliance, Openfire’s port 7443 uses a weak cipher)
Any help is appreciated. Thanks.