The MongoDB Connector for Spark provides integration between MongoDB and Apache Spark.
Version 10.x of the MongoDB Connector for Spark is an all-new connector based on the latest Spark API. Install and migrate to version 10.x to take advantage of new capabilities, such as tighter integration with Spark Structured Streaming.
Version 10.x uses the new namespace
This allows you to use old versions of the connector
(versions 3.x and earlier) in parallel with version 10.x.
To learn more about the new connector and its advantages, see the MongoDB announcement blog post.
With the connector, you have access to all Spark libraries for use with MongoDB datasets: Datasets for analysis with SQL (benefiting from automatic schema inference), streaming, machine learning, and graph APIs. You can also use the connector with the Spark Shell.
The MongoDB Connector for Spark is compatible with the following versions of Apache Spark and MongoDB:
MongoDB Connector for Spark
3.1 or later
4.0 or later