How to Set Up a MongoDB Cluster

In order to create a cluster you will need a MongoDB Atlas account.

As a modern database, MongoDB was initially built with the cloud in mind and now has built-in features to help maintain a high availability and easy scalability through distributed workloads. While it’s true that it can run as a single instance, most of the time it runs as a cluster. In this article, you will learn about the different types of clusters in MongoDB and how you can set them up in MongoDB Atlas.

What is a MongoDB Cluster?

In MongoDB, clusters can refer to two different architectures. They can either mean a replica set or a sharded cluster. Let’s take a closer look at both.

Replica Sets

A MongoDB replica set is a group of one or more servers containing the exact copy of the data. While it’s technically possible to have one or two nodes, the recommended minimum is three. A primary node is responsible for providing your application’s read and write operations, while two secondary nodes contain a replica of the data.

A diagram showing a client application with read and write access to a primary node. Arrows are showing that the data from the primary node is asynchronously replicated in the secondary nodes.

A typical replica set in MongoDB.

Should the primary node become unavailable for some reason, a new primary node would be picked by an election process. This new primary node is now responsible for the read and write operations.

A diagram showing a client application connecting to a new primary node. The primary node is labelled unavailable. An arrow indicates that replication is still occurring with the secondary node.

If a primary node is unavailable, the traffic from the client application is redirected to a new primary node.

Once the faulty server comes back online, it will sync up with the primary node and become a new secondary node in the cluster.

A diagram showing a client application connecting to a primary node. Arrows indicate replication with the secondary nodes. The primary node from the previous diagram is now labelled “secondary node”.

When the previous primary node comes back online, it comes back as a secondary node.

The goal is to provide your application with high availability over your data. Even in a server failure, your client application can still connect to the cluster and access the data, reducing the overall potential downtime.

Sharded Clusters

A sharded cluster is a way to scale horizontally by distributing your data across multiple replica sets. When a read or write operation is performed on a collection, the client sends the request to a router (mongos). The router will then validate which shard the data is stored in via the configuration server and send the requests to the specific cluster.

A diagram showing a client application connecting to a router. An arrow shows a relation between the configuration server and the router. A line is split in three, indicating that the request is sent from the router to a specific shard.

A typical sharded cluster in MongoDB.

Each of the shards would contain its own replica set. You should also have more than one router or configuration server to ensure high availability. With this type of architecture, you can scale your database as much as you want without compromising availability or worrying about storage capacity.

Creating a MongoDB Cluster

Depending on your needs, there are multiple ways to create a MongoDB cluster. The easiest way is using Atlas, the Database-as-a-Service platform by MongoDB. You can find detailed instructions in the documentation. If you need to run MongoDB on your infrastructure, the instructions are provided later in this article.

To create a MongoDB cluster in Atlas, follow these steps.

  1. Log in to your MongoDB Atlas account at
  2. Click on the “Create” button.
  3. Choose your cluster type (dedicated, serverless, shared).
  4. Choose your cloud provider and region.
  5. Click on “Create cluster.”

Your MongoDB cluster will start its provisioning and will be available to you in a few minutes. As you create your cluster, you will see many options to accommodate your specific needs. Each of those setup options is covered in the next section.

Setting Up a MongoDB Cluster on Atlas

Every application is different, and MongoDB Atlas provides you with numerous ways to set up your cluster to suit your specific needs. Some particular configurations need to be thought of ahead of time, while you can change others on the fly. Using these settings, you will put in place all the best practices for Atlas in production. In this section, you will learn more about the various configurations that you can adjust on your initial cluster creation.

Deployment Type

The deployment type is the first option you will need to pick. Based on what you decide for the type of instance, the other configuration options will vary.

  • Serverless: This type of cluster is the most flexible one from a pricing point of view. It’s meant for applications that have infrequent or variable traffic. The possible configurations are kept at the bare minimum.
  • Dedicated: A dedicated cluster is meant for production loads. It can support a wide range of server sizes as well as advanced configurations. You should choose this for your production environment.
  • Shared: These clusters are meant to be a way to explore MongoDB. They can provide you with a sandbox where you can try out MongoDB for free. The server configurations available are somewhat limited.

You can find more information about the different database deployment types in the documentation.

Global Cluster Configuration

If you need multiple sharded clusters with read and write operations in specific locations, you will need to enable Global Cluster Configuration. From here, you can choose exactly where you want each of your clusters and configure the mappings between the user country and the server they will use to access the data.

Cloud Provider and Region

No matter which deployment type you picked, you will need to choose the cloud provider, along with the specific region in which you want to deploy your cluster. You can instantiate MongoDB clusters on any of the three major cloud providers. If you want to ensure even better availability, you can deploy each node of your cluster on different regions or even different clouds. To do so, you will need to enable the Multi-Cloud, Multi-Region & Workload Isolation option. From here, you will be able to configure the number and types of nodes (electable, read-only, or analytical) that will be part of your replica set.

Cluster Tier

Now that you’ve picked a region and cloud provider, you will need to choose which tier you want to use for the nodes in your cluster. This configuration will have the most significant impact on the pricing of your cluster. There is a wide range of options available, and you can further tweak each of them. Take into consideration the amount of CPU and RAM you will need. Your resource needs will help you find the right tier for your cluster.

You can then further tweak the cluster configuration by adjusting the storage size, toggling the auto-scaling options and the IOPS that you will need. On AWS higher tiers (M40+), you will also be able to choose the class of servers (low-CPU, general, or local NVMe SSD), which will also impact the number of CPUs, RAM, and storage capacity.

Additional Settings

In this last tab, you will find many additional services that you can add to your cluster. The first option is the MongoDB version that you want to use for MongoDB. You will then also have the option to enable or disable the automatic backups. You can also expand the additional settings, which will provide you with more advanced options such as sharding your cluster, adding the BI connector, and managing your encryption keys.

Most of the settings you set can be changed on demand in the future, which is a powerful ability as your application evolves.

Creating a MongoDB Cluster in Different Environments

If a cloud-based instance of MongoDB is not an option for you, or if you need to run a cluster on your infrastructure, you can install MongoDB on supported operating systems. In addition to MongoDB itself, installing Ops Manager or Cloud Manager is recommended to manage your clusters.

Create a MongoDB Cluster in Linux (Ubuntu, CoreOS)

To create a MongoDB cluster in Linux, you will need three instances of MongoDB running. These instances need to be able to communicate with each other on a local network.

You can find the detailed instructions to set up your cluster in Linux in the documentation.

Create a MongoDB Cluster with Docker

If you want to install a local instance of MongoDB in your environment or set up an ephemeral development environment that you can share with your teammates, you might want to use Docker. You can do so by starting three local instances of MongoDB in Docker, then following the instructions in the documentation.

Next Steps

Now that you know how to create and set up your cluster, you might want to learn more about finding the right cluster topology for your needs. Why not continue your learning with the cluster setup topology video on MongoDB University? Once you are comfortable with all the possible configurations for your MongoDB cluster, you can try MongoDB Atlas for free. With your cluster set up as you see fit, you can now ensure that your data will be made available to your users and that you will be able to scale once you need to.

Ready to try for yourself?

Get practical and hands-on experience by creating and setting up a cluster today. All you need is a free Atlas account.