Associate
- Joined
- 29 Dec 2007
- Posts
- 1,414
- Location
- London
Hey guys,
I couldn't decide whether to stick this in the enterprise or network section, so I went for here.
I've a strange issue which I can't seem to get my head around.
We have 2 virtual web servers which host client websites. They're in a load balanced cluster using the Windows Server 2003 default nlb program.
I've changed pulic IP for privacy.
WEB1:
Public NIC: 10.10.10.139
Private NIC: 192.168.220.30
WEB2:
Public NIC: 10.10.10.140
Private NIC: 192.168.220.40
CLUSTER IP: 10.10.10.150
NLB settings:
Both servers unicast
Both servers have a different host priority (1, 2)
WEB1 "dedicated IP address": 10.10.10.139
WEB2 "dedicated IP address": 10.10.10.140
This has been in operation for about 2 years, but I've found a pretty major flaw today.
Now from the surface all looks well, the settings look totally correct in the nlb manager.
WEB2 was drainstopped and restarted (unrelated issue). WEB1 should obviously notice and start serving the connections, but we had instant complaints that the websites were down.
Order of events:
WEB1/2 both servicing web requests, equal load.
WEB2 drainstopped
WEB2 restarted
WEB1 stops servicing web requests
WEB2 comes back up
WEB1 still not servicing web requests
WEB2 added back to the cluster
WEB2 servicing connections
WEB1 still not servicing web requests
WEB1 drainstopped
WEB1 added back to the cluster
WEB1/2 both servicing connections
What the monkeys!?
Something that may well be to do with it is:
Opening "Network Load Balancing Manager" on either server and connect to the cluster with ip 10.10.10.150
The configuration from the whichever machine your on is loaded but we get "Host unreachable, error connecting to "othermachinename.domain.biz" when reading the configuration from the other web server.
Googling this seems to point to icmp being blocked. There is no FW between the servers. We can ping each NIC from each machine fine.
I've been reading everything I can find to do with load balancing but I'm yet to find anything which could be wrong. This was set up before I joined the company.
Any ideas/pointers?
I couldn't decide whether to stick this in the enterprise or network section, so I went for here.
I've a strange issue which I can't seem to get my head around.
We have 2 virtual web servers which host client websites. They're in a load balanced cluster using the Windows Server 2003 default nlb program.
I've changed pulic IP for privacy.
WEB1:
Public NIC: 10.10.10.139
Private NIC: 192.168.220.30
WEB2:
Public NIC: 10.10.10.140
Private NIC: 192.168.220.40
CLUSTER IP: 10.10.10.150
NLB settings:
Both servers unicast
Both servers have a different host priority (1, 2)
WEB1 "dedicated IP address": 10.10.10.139
WEB2 "dedicated IP address": 10.10.10.140
This has been in operation for about 2 years, but I've found a pretty major flaw today.
Now from the surface all looks well, the settings look totally correct in the nlb manager.
WEB2 was drainstopped and restarted (unrelated issue). WEB1 should obviously notice and start serving the connections, but we had instant complaints that the websites were down.
Order of events:
WEB1/2 both servicing web requests, equal load.
WEB2 drainstopped
WEB2 restarted
WEB1 stops servicing web requests
WEB2 comes back up
WEB1 still not servicing web requests
WEB2 added back to the cluster
WEB2 servicing connections
WEB1 still not servicing web requests
WEB1 drainstopped
WEB1 added back to the cluster
WEB1/2 both servicing connections
What the monkeys!?
Something that may well be to do with it is:
Opening "Network Load Balancing Manager" on either server and connect to the cluster with ip 10.10.10.150
The configuration from the whichever machine your on is loaded but we get "Host unreachable, error connecting to "othermachinename.domain.biz" when reading the configuration from the other web server.
Googling this seems to point to icmp being blocked. There is no FW between the servers. We can ping each NIC from each machine fine.
I've been reading everything I can find to do with load balancing but I'm yet to find anything which could be wrong. This was set up before I joined the company.
Any ideas/pointers?
). Is there anything special I need to setup in nlb manager to specify comms between the servers on the private (192.168.220.x) network?