4.8 Failover
If a component is corrupted in the main server cluster, or if it is time to install the latest build, failover is the best method to switch to another Master Controller. Failover is the process of switching from your primary server cluster to your backup system.
Failover happens when the primary system needs brought down or fails, and the backup system takes over all properly scheduled tasks. The secondary system is NOT a copy of the primary. Common reasons for failover include:
- rebuilding FSS on MC00
- installing software on MC00 (best to do on back-up servers first, then run off of those while installing on MC00)
- rebuilding the configuration on MC00
- crash of any major component on MC00
Reminder: Failover only works if you have scheduled all of your required tasks and synching correctly and follow failover procedures. Go through your list of tasks and make sure to check the failover box on required tasks.
The basic steps in failover include using the Administration Interface to set the MC status to fail and then restoring the MC status the same way.
For specific instructions, please see the job sheets below.
Job Sheets: MC Failover | MC Restore