Striim 3.9.7 documentation

Continuing operation after server failure

When an application is deployed in a deployment group with more than one server, a failover copy of each event is stored on a different server from the one processing it. If one server goes down, Striim will automatically continue running the application using the failover copies of events. When the server comes back up, the application will automatically be reallocated as per its original deployment. Such failover is automatic and does not require any special options when deploying.

If a Forwarding Agent's output is partitioned (see Using the Striim Forwarding Agent) and one of several servers receiving its output goes down, the output it was receiving will be redistributed to the remaining servers while it is offline. 

The web UI runs on all servers, so if the server where users normally access the UI goes down, they can manually change the URL to use another server. Alternatively, you may implement standard Linux load-balancing across multiple servers so that in the event one fails its web UI users will automatically be switched to another.

Note

If the server hosting the Derby metadata repository goes down, the cluster will crash. See Recovering applications for further discussion.