IBM Cloud Docs
Recovering your location

Recovering your location

The following steps outline the general flow of recovering from a disaster event within a Satellite location.

  1. Replace any unhealthy infrastructure in the Location control plane. Remove unhealthy hosts, then attach new hosts and assign them to the control plane.

  2. After your control plane is healthy and has sufficient capacity for the services running in the Satellite location, the automated restoration process is executed in the Satellite platform.

    Typically at this state, the location shows the R0025: The Satellite location has OpenShift clusters in critical health warning.

  3. Open a support case to track the status of the automated recovery. In the case details, provide the following information.

    Satellite Location: LOCATION-ID had a disaster event across the infrastructure associated with the Satellite location. We have proceeded to recover/replace the unhealthy infrastructure within the location control plane and have sufficient capacity to run all cluster control planes. These are the following OpenShift clusters within the location:
       CLUSTER-ID
       CLUSTER-ID
    
  4. After the automated restoration process completes, the R0025 message is removed and the location is ready for deployments.

  5. Recover or replace any unhealthy infrastructure in the data plane of each OpenShift cluster within the location. Remove unhealthy worker nodes. Attach new hosts and assign them as worker nodes. Repeat this process until all worker nodes in the cluster are healthy and running.

  6. Check the status of the cluster components.

  7. Begin application and persistent storage DR. Consult the appropriate application specific documentation or storage solution documentation for more details.