Hey Mike,
My previous suspisions that the problem was fixed were incorrect. However in doing the full gamut of testing I have been able to determine that the issue is being caused by the worker.properties/tomcat configuration.
Just to review, we have two apache front end servers that pass traffic to the two cms servers. If I set up the worker.properties file so that there is only one worker in a non load balanced configuration I’m not able to reproduce the error.
On Web01 on the worker.properties is the following
worker.list=boe1
worker.boe1.port=8009
worker.boe1.host=CMS01
worker.boe1.type=ajp13
worker.boe1.socket_keepalive=true
On Web02
worker.list=boe1
worker.boe1.port=8009
worker.boe1.host=CMS02
worker.boe1.type=ajp13
worker.boe1.socket_keepalive=true
***Now here is the configuration that is producing the kickout.
On WEB01
#The Advanced router LB
worker.list=router
#Define a worker using ajp13
worker.boe1.port=8009
worker.boe1.host=CMS01
worker.boe1.type=ajp13
worker.boe1.lbfactor=1
#prefered Failover node for boe1
worker.boe1.redirect=boe2
#Define another worker using ajp13
worker.boe2.port=8009
worker.boe2.host=CMS02
worker.boe2.type=ajp13
worker.boe2.lbfactor=1
worker.boe2.socket_keepalive=1
worker.boe2.socket_timeout=60
#disable boe2 except for failover
worker.boe2.activation=disabled
#Define th LB worker
worker.router.type=lb
worker.router.balance_workers=boe1,boe2
#Define keepalive
#worker.router.socket_keepalive=1
On WEB02
#The Advanced router LB
worker.list=router
#Define a worker using ajp13
worker.boe1.port=8009
worker.boe1.host=CMS02
worker.boe1.type=ajp13
worker.boe1.lbfactor=1
#prefered Failover node for boe1
worker.boe1.redirect=boe2
#Define another worker using ajp13
worker.boe2.port=8009
worker.boe2.host=CMS01
worker.boe2.type=ajp13
worker.boe2.lbfactor=1
worker.boe2.socket_keepalive=1
worker.boe2.socket_timeout=60
#disable boe2 except for failover
worker.boe2.activation=disabled
#Define th LB worker
worker.router.type=lb
worker.router.balance_workers=boe1,boe2
#Define keepalive
#worker.router.socket_keepalive=1
When this configuration is in place we are being kicked out of the application at random times as described previously in the post. Any ideas as to why this is happening?
aheeter (BOB member since 2008-04-30)