- Sharding >
- Create Backup of a Sharded Cluster with Filesystem Snapshots
Create Backup of a Sharded Cluster with Filesystem Snapshots¶
Overview¶
This document describes a procedure for taking a backup of all
components of a sharded cluster. This procedure uses file system
snapshots to capture a copy of the mongod
instance. An
alternate procedure that uses mongodump
to create binary
database dumps when file-system snapshots are not available. See
Create Backup of a Sharded Cluster with Database Dumps for the
alternate procedure.
See Sharded Cluster Backup Considerations for a full higher level overview backing up a sharded cluster as well as links to other tutorials that provide alternate procedures.
Important
To capture a point-in-time backup from a sharded cluster you must stop all writes to the cluster. On a running production system, you can only capture an approximation of point-in-time snapshot.
Procedure¶
In this procedure, you will stop the cluster balancer and take a backup up of the config database, and then take backups of each shard in the cluster using a file-system snapshot tool. If you need an exact moment-in-time snapshot of the system, you will need to stop all application writes before taking the filesystem snapshots; otherwise the snapshot will only approximate a moment in time.
For approximate point-in-time snapshots, you can improve the quality of the backup while minimizing impact on the cluster by taking the backup from a secondary member of the replica set that provides each shard.
Disable the balancer process that equalizes the distribution of data among the shards. To disable the balancer, use the
sh.stopBalancer()
method in themongo
shell, and see the Disable the Balancer procedure.Warning
It is essential that you stop the balancer before creating backups. If the balancer remains active, your resulting backups could have duplicate data or miss some data, as chunks may migrate while recording backups.
Lock one member of each replica set in each shard so that your backups reflect the state of your database at the nearest possible approximation of a single moment in time. Lock these
mongod
instances in as short of an interval as possible.To lock or freeze a sharded cluster, you must:
- use the
db.fsyncLock()
method in themongo
shell connected to a single secondary member of the replica set that provides shardmongod
instance. - Shutdown one of the config servers, to prevent all metadata changes during the backup process.
- use the
Use
mongodump
to backup one of the config servers. This backs up the cluster’s metadata. You only need to back up one config server, as they all hold the same data.Issue this command against one of the config
mongod
instances or via themongos
:Back up the replica set members of the shards that you locked. You may back up the shards in parallel. For each shard, create a snapshot. Use the procedures in Use Filesystem Snapshots to Backup and Restore MongoDB Databases.
Unlock all locked replica set members of each shard using the
db.fsyncUnlock()
method in themongo
shell.Re-enable the balancer with the
sh.setBalancerState()
method.Use the following command sequence when connected to the
mongos
with themongo
shell: