{
  "id": "cluster-recovery",
  "title": "Recover a failed cluster",
  "url": "https://redis.io/docs/latest/operate/rs/7.8/clusters/cluster-recovery/",
  "summary": "How to use the cluster configuration file and database data to recover a failed cluster.",
  "content": "When a Redis Enterprise Software cluster fails,\nyou must use the cluster configuration file and database data to recover the cluster.\n\n\nFor cluster recovery in a Kubernetes deployment, see [Recover a Redis Enterprise cluster on Kubernetes]().\n\n\nCluster failure can be caused by:\n\n- A hardware or software failure that causes the cluster to be unresponsive to client requests or administrative actions.\n- More than half of the cluster nodes lose connection with the cluster, resulting in quorum loss.\n\nTo recover a cluster and re-create it as it was before the failure,\nyou must restore the cluster configuration `ccs-redis.rdb` to the cluster nodes.\nTo recover databases in the new cluster, you must restore the databases from persistence files such as backup files, append-only files (AOF), or RDB snapshots.\nThese files are stored in the [persistent storage location]().\n\nThe cluster recovery process includes:\n\n1. Install Redis Enterprise Software on the nodes of the new cluster.\n1. Mount the persistent storage with the recovery files from the original cluster to the nodes of the new cluster.\n1. Recover the cluster configuration on the first node in the new cluster.\n1. Join the remaining nodes to the new cluster.\n1. [Recover the databases]().\n\n## Prerequisites\n\n- We recommend that you recover the cluster to clean nodes.\n    If you use the original nodes,\n    make sure there are no Redis processes running on any nodes in the new cluster.\n- We recommend that you use clean persistent storage drives for the new cluster.\n    If you use the original storage drives,\n    make sure you back up the files on the original storage drives to a safe location.\n- Identify the cluster configuration file that you want to use as the configuration for the recovered cluster.\n    The cluster configuration file is `/ccs/ccs-redis.rdb` on the persistent storage for each node.\n\n## Recover the cluster\n\n1. (Optional) If you want to recover the cluster to the original cluster nodes, uninstall Redis Enterprise Software from the nodes.\n\n1. [Install Redis Enterprise Software]() on the new cluster nodes.\n\n    The new servers must have the same basic hardware and software configuration as the original servers, including:\n\n    - The same number of nodes\n    - At least the same amount of memory\n    - The same Redis Enterprise Software version\n    - The same installation user and paths\n\n    \nThe cluster recovery can fail if these requirements are not met.\n    \n\n1. Mount the persistent storage drives with the recovery files to the new nodes.\n    These drives must contain the cluster configuration backup files and database persistence files.\n\n    \nMake sure that the user redislabs has permissions to access the storage location\nof the configuration and persistence files on each of the nodes.\n    \n\n    If you use local persistent storage, place all of the recovery files on each of the cluster nodes.\n\n1. To recover the original cluster configuration, run [`rladmin cluster recover`]() on the first node in the new cluster:\n\n    ```sh\n    rladmin cluster recover filename [ \u003cpersistent_path\u003e | \u003cephemeral_path\u003e ]\u003cfilename\u003e node_uid \u003cnode_uid\u003e rack_id \u003crack_id\u003e\n    ```\n\n    For example:\n\n    ```sh\n    rladmin cluster recover filename /tmp/persist/ccs/ccs-redis.rdb node_uid 1 rack_id 5\n    ```\n\n    When the recovery command succeeds,\n    this node is configured as the node from the old cluster that has ID 1.\n\n1. To join the remaining servers to the new cluster, run [`rladmin cluster join`]() from each new node:\n\n    ```sh\n    rladmin cluster join nodes \u003ccluster_member_ip_address\u003e username \u003cusername\u003e password \u003cpassword\u003e replace_node \u003cnode_id\u003e\n    ```\n\n    For example:\n\n    ```sh\n    rladmin cluster join nodes 10.142.0.4 username admin@example.com password mysecret replace_node 2\n    ```\n\n1. Run [`rladmin status`]() to verify the recovered nodes are now active and the databases are pending recovery:\n\n    ```sh\n    rladmin status\n    ```\n\n    \nMake sure that you update your [DNS records]()\nwith the IP addresses of the new nodes.\n    \n\nAfter the cluster is recovered, you must [recover the databases]().\n",
  "tags": ["docs","operate","rs"],
  "last_updated": "2026-04-01T08:10:08-05:00"
}

