MongoDB
Atlas Data Lake

Query and analyze data across AWS S3 and MongoDB Atlas in-place and in its native format using the MongoDB Query Language (MQL).

Why MongoDB Atlas Data Lake?

Unlock the value of your data with a serverless, scalable data lake. Combine and analyze live and historical data without data movement or operational overhead and pay only for queries run.
MongoDB Atlas Data Tiering

Tier your data,
query it where it lives

Automatically tier your data across fully managed databases and cloud object storage with Atlas Online Archive. Combine and analyze data in-place with federated queries and easily persist the results of your aggregation pipelines to your preferred storage tier.

MongoDB Atlas Data Lake Federated Query

Analyze rich data
easily and intuitively

Natively query your richly structured data across your database and AWS S3 store in-place using a single connection string. Run powerful, easy-to-understand aggregations using the MongoDB Query Language (MQL) for a consistent experience across data types.

Work with data on-demand,
at any scale

Atlas Data Lake is serverless, so there is no infrastructure to set up or manage and no need to predict capacity. You only pay for the queries run when actively working with your data. Scale your data lake to deliver performance by parallelizing workloads and enable global data lake analytics.

Fully integrated with the
MongoDB Cloud Platform

Spin up your data lake right alongside your operational Atlas database clusters with a few clicks from a common UI and start querying data instantly.

Learn More about MongoDB Cloud

See Atlas Data Lake In Action

Ready to get started?

Try MongoDB Atlas Data Lake today

The Right Features For Your Data Lake

Multiple Formats

Analyze data stored in JSON, BSON, CSV, TSV, Avro, ORC and Parquet in place without the complexity, cost, and time-sink of data ingestion and transformation.

Powerful Aggregations

Run powerful, modular and easy-to-understand aggregations using the MongoDB Query Language (MQL) and persist the results to your preferred storage tier.

Federated Query

Run a single query to analyze your live MongoDB Atlas data and historical data on Amazon S3 together and in-place for faster insights.

Serverless

There's no infrastructure to set up and manage - simply provide access to your existing AWS S3 buckets and start running queries immediately.

On Demand

Pay only for the queries run and only when actively working with your data. Eliminate the need to predict demand or capacity.

Fully Integrated

Fully integrated with the MongoDB Cloud Platform for provisioning, access, billing and support.

Atlas Data Lake was key to maintaining our company’s growth in a healthy way. It made it easier for us to access data in any storage layer because the query that we type in for applications to access hot data in Atlas is going to be the same query that we’re going to use to access the cold data in S3. It’s like we snap our fingers and it’s done.

Igor Agenor Piovezan, Software Specialist Developer, Segware



Atlas Data Lake was key to maintaining our company’s growth in a healthy way. It made it easier for us to access data in any storage layer because the query that we type in for applications to access hot data in Atlas is going to be the same query that we’re going to use to access the cold data in S3. It’s like we snap our fingers and it’s done.

Igor Agenor Piovezan, Software Specialist Developer, Segware

MongoDB Atlas Data Lake Architecture

MongoDB Atlas Data Lake Architecture Diagram

Pricing

MongoDB Atlas Data Lake is a fully managed data lake as a service with pricing based on data processed and data returned.
Data processed
Data returned
$5 per TB with a 10MB minimum per query
Standard AWS data transfer rates

Frequently Asked Questions

What is MongoDB Atlas Data Lake?

MongoDB Atlas Data Lake is a fully managed data lake as a service that allows you to natively query and analyze data across AWS S3 and MongoDB Atlas in-place. You can seamlessly combine and analyze your richly structured data stored in JSON, BSON, CSV, TSV, Avro, ORC and Parquet formats without the cost and complexity of data movement and transformation.

What is MongoDB Atlas Online Archive?

With MongoDB Atlas Online Archive you can automatically tier your data based on performance requirements for a more efficient system. Move infrequently accessed data from your MongoDB Atlas databases to queryable archival storage to reduce costs while preserving easy access to your archives.

Is MongoDB Atlas Data Lake available on AWS?

MongoDB Atlas Data Lake allows you to query your AWS S3 data in-place and in its native format. Simply spin up a data lake with a few clicks from the MongoDB Atlas UI and connect to your own AWS S3 buckets to begin querying and analyzing your data.

How do I use MongoDB Atlas Data Lake?

MongoDB Atlas Data Lake is a self-serve application that can be accessed and set up through the MongoDB Atlas control plane. Create and connect to a data lake, configure databases and collections from files stored in AWS S3, and run powerful aggregations using the MongoDB Query Language (MQL) and tools.

You can connect your own AWS S3 buckets or leverage Atlas Online Archive to automatically tier your MongoDB Atlas data to fully managed cloud object storage and query it in-place. Expose all of your historical data to your real-time application for new insights and an improved user experience.

Where can I learn more about MongoDB Atlas Data Lake?

Check out our documentation to learn more about getting started with Atlas Data Lake.