Disaster Recovery
This page is designed to be printed
Last updated
Was this helpful?
This page is designed to be printed
Last updated
Was this helpful?
As much as we don't want it to happen, somethings things die. Sometimes power runs out.
This documentation can be followed for the below scenarios
Power outage
Server hardware failure
Check all infrastructure is powered on (look for power lights)
Refer to Physical Hardware section on the left
Remove faceplate from NTD and confirm powered on
Confirm is powered on
Network Test
Ping 8.8.8.8 to confirm internet is working
Ping google.com to confirm external DNS is working
Ping setup.ui.com to confirm internal DNS is working
Internal link loads login page (use Linux credentials)
All storage pools are online
VM's show and are booting (there is a delay between boots so some may be on, others off)
Internal link loads login page (use Linux credentials)
All storage pools are online
Open Storage Manager and confirm 'system is healthy'
An excessive, but very thorough way, to check all services are online is to go through each page in this doco and trying to access any "link to app" links
Unfortunately I'm unable to write specific doco here as there is to much to capture. Please refer to the troubleshooting section on the left panel and/or the hints below
Compare the down services against the Cloudflare tunnels - are they all on the 1 server?
Confirm is accessible
Confirm is accessible
Confirm the is accessible (creds in vault)
Log into and confirm that all services are green. It may take 15 minutes for them all to report as online
Confirm servers are reporting data back to and check for any alerts Alerts related to disk backlog, IO delay or disk usage can be ignored for now. Backlog and IO delay can be caused by multiple VM's starting up ay once