Introducing the Newest Version of the MongoDB Spark Connector

Robert Walters
March 31, 2022 | Updated: April 5, 2022

MongoDB has just released an all-new version of our Spark Connector. This article discusses the background behind the MongoDB Spark Connector and some of the key features of the new release.

Why a new version?

The current version of the MongoDB Spark Connector was written in 2016 and is based on Version 1 of the Spark Data Source API. This API is still supported, but Databricks has released an updated version of the API, making it easier for data sources like MongoDB to work within the Spark ecosystem. By using Version 2 of the MongoDB Spark Connector, you’ll immediately benefit from capabilities such as tighter integration with Spark Structured Streaming.

MongoDB will continue to support Version 1 until Databricks deprecates its Datasource API, but no new features will be implemented, and upgrades to the Connector will include only bug fixes and support for the current version.

Which version should I use?

The new Spark Connector (Version 10.0) is not intended to be a direct replacement for applications that use the current MongoDB Spark Connector. Note that the new connector uses a different namespace, “com.mongodb.spark.sql.connector.MongoTableProvider”, versus the original Spark Connector, which uses “com.mongodb.spark.DefaultSource”. Having a different namespace makes it possible to use both versions of the Connector within the same Spark application. This is helpful in unit testing your application with the new Connector and making the transition on your timeline.

Also note a change with versioning of the MongoDB Spark Connector. The current version of the existing MongoDB Spark Connector is 3.0. Up until now, as MongoDB released versions of the connector, the number was aligned with the version of Spark that was supported—i.e., Version 2.4 of the MongoDB Spark Connector works with Spark 2.4. Going forward, this will not be the case. MongoDB's documentation will make clear which versions of Spark the Connector supports and provide the appropriate information.

Structured Streaming to MongoDB

Apache Spark comes with a stream processing engine called Structured Streaming, which is based on Spark's SQL engine and DataFrame APIs. Spark Structured Streaming treats each incoming stream of data as a microbatch, continually appending each microbatch to the target dataset. This makes it easy to convert existing Spark batch jobs into streaming jobs. Structured Streaming provides maximum throughput via the same distributed capabilities that have made Spark such a popular platform. In the following example, we’ll show you how to stream data to MongoDB using Structured Stream.

Consider a CSV file that contains natural gas prices. The following PySpark code will read the CSV file into a stream, compute a moving average, and stream the results into MongoDB.

from pyspark.sql.types import StructType, DateType, StringType, TimestampType, DoubleType
from pyspark.sql import functions as F
from pyspark.sql.window import Window
from pyspark.sql.functions import lit, count

sc.setLogLevel('DEBUG')

readSchema = ( StructType()
  .add('Type', StringType())
  .add('Date', TimestampType())
  .add('Price', DoubleType())
)


ds = (spark
  .readStream.format("csv")
  .option("header", "true")
  .schema(readSchema)
  .load("daily*.csv"))


slidingWindows = (ds
  .withWatermark("Date", "1 minute")
  .groupBy(ds.Type, F.window(ds.Date, "7 day"))
  .avg()
  .orderBy(ds.Type,'window'))

dsw = (
  slidingWindows
    .writeStream
    .format("mongodb")
    .queryName("7DaySlidingWindow")
    .option("checkpointLocation", "/tmp/pyspark/")
    .option("forceDeleteTempCheckpointLocation", "true")
    .option('spark.mongodb.connection.uri', 'MONGODB CONNECTION HERE')
    .option('spark.mongodb.database', 'Pricing')
    .option('spark.mongodb.collection', 'NaturalGas')
    .outputMode("complete"))

query = dsw.start()
query.processAllAvailable()
query.stop()

For more information and examples on the new MongoDB Spark Connector V10.0, check out our documentation.

Ask questions and give feedback on the MongoDB Community Forum. The Connector is open sourced; feel free to contribute at GitHub.

← Previous

How These Women Are Leading Teams and Growing Their Careers at MongoDB

Each year, MongoDB highlights some of our most influential leaders in celebration of International Women’s Day and Women’s History Month. These women were nominated by their colleagues for making a huge impact on their teams. They’re taking their careers (and MongoDB) to new heights and inspire us every day! Reba Cox , Director of Procurement I currently lead the Procurement and Travel teams, and I was recently asked to build out a new function for Sustainability at MongoDB. My team ensures that MongoDB spends money safely, efficiently, and compliantly! I have somehow turned my personal passion for shopping into a full-time job. I started at MongoDB as a manager and have since been promoted twice and expanded my leadership to three functions. I have gained more experience in four years at MongoDB than I gained in all of my career prior! I think a good leader has to have a combination of traditional leadership skills, such as making and implementing multiyear strategic goals, being well-spoken, and standing up for what’s right, but they also need to have compassion and put a huge emphasis on developing their team. If you are not spending the majority of your time building leaders, then you are missing a great opportunity to multiply your positive impact at a company and in people’s lives. The greatest lesson I have learned as a leader is to have compassion. I can be very demanding of my team, but the one thing they know is that I always have their back and I honestly try to do what’s best for them every single day. If my team doesn’t trust or believe in me, then how can I ask great and difficult things from them? If you want to be a leader someday, then showing up, speaking up, and consistently doing what you say will help you get there. I always tell my team to be someone who people can rely on and trust. Be more helpful, more kind, and more ‘can do’ than anyone expects you to be! You will gain respect among your peers and put yourself in the right position to receive additional responsibilities and opportunities. The opportunities at MongoDB are limitless. As a female leader at MongoDB, I have been able to create so many lasting friendships and mentorship opportunities with an endless list of incredible women. The most rewarding part of my job is mentoring and working with the many fabulous women I come in contact with on a day-to-day basis. I love how the culture of MongoDB allows us each to be unique and really lean into who we are as women and leaders. The growth you can experience here, both personally and professionally, is endless. Anokhee Mepani , Director of Operations, Technical Services For the past several years, I’ve led customer success and operations teams at rapidly growing startups. I deeply enjoy working through the opportunities and challenges that come with fast, exponential scaling, and I wanted to expand my experience at a larger but equally fast-growing tech company. My previous companies used MongoDB to power their businesses, and I saw firsthand how impactful MongoDB is. I knew I would find the work here meaningful and exciting, so it was an easy decision to join my team. Technical Services Operations supports the Technical Services team , which provides 24/7 support to our customers. Our mission is to enable Technical Services to serve our customers successfully. As a leader, it’s my responsibility to ensure my team is best set up for success, and my continued growth and learning are a big part of that. I am focused on learning through inclusion — learning from my team’s and my colleagues’ different perspectives and varied experiences. As an individual contributor, I learned a lot from industry best practices and standard approaches; as a leader, I believe my greatest growth comes from seeking novel and diverse solutions to challenges. What I believe makes someone a good leader is truly caring about your team and providing support and advocacy that is in the team’s best interest. I truly believe that if you set high standards and take care of your people, then success will follow. Cristina Castillo , Director of Sales Development, EMEA I currently lead the South and Central European regions for the Sales Development organization. Our team plays a crucial role for MongoDB’s Sales team. We generate business opportunities for the most strategic accounts in all types of industries and work with the Enterprise organization to plan and execute pipeline-generation strategies. Working in the Sales Development organization is really exciting. As a leader, you have a huge responsibility for the development of emerging professionals, and I want to make my team’s time in Sales Development a memorable experience and empower them to own their careers . From a leadership perspective, compassion is key. You need to understand your team’s feelings, goals, strengths, and areas of development, and be passionate and committed to supporting their success and career growth. Self-awareness is also really important; we are an example for the people around us, so we need to be able to self-reflect, ask for feedback, and analyze our development areas to always become better as leaders. In addition, you need to be able to build trust with your team so you can create a safe environment where people will be encouraged to speak up and communicate not only the positives, but also the negatives so you can take action. If your team is happy, then they will deliver at their best! If you are new to a leadership role, then you may face many challenging situations along the way. It is important that you believe in yourself and trust your gut. I also think it’s helpful to find a mentor you can trust who will give you guidance when you are lost and feedback to help you accelerate your skills. Last but not least, remember that, at the end of the day, we are humans dealing with humans, so make time for fun! Melissa Mahoney , Lead Technical Writer, Cloud I am fortunate to lead the incredibly smart, resourceful, and compassionate technical writers of the Cloud Documentation team. We produce procedural, reference, and tutorial content for MongoDB’s Cloud services and strive to provide users with delightful learning experiences and self-serve support through documentation. I have always had a passion for mentorship, both in learning from others and in helping those around me grow. I have been fortunate to turn that passion into a key part of my job. I transitioned from an individual contributor role to a team lead about a year and a half ago, and every day I learn something new about myself and how best to serve my team and their goals. I think a good leader must be grounded in empathy. Empathy for your users drives you to fully understand their needs, sparks creativity to meet those needs, and inspires the best possible deliverable. Empathy for your team requires that you listen actively and openly and communicate clearly and honestly, and it motivates you to seek collaborative solutions. Empathy for yourself forces you to delegate, seek help when you need it, and allow yourself to learn and grow from mistakes. The biggest lesson I’ve learned as a leader is that I can’t (and shouldn’t!) do it all. The transition from an individual contributor to a manager was difficult for me; at first, it was hard to let go of projects or meetings. But as my calendar filled up and the work I promised to do fell behind, I realized I had become the bottleneck I had tried so hard to prevent. Learning to delegate not only freed up my time to be a better manager, but also created new opportunities for leadership and expertise for members of the team. My advice to someone new to leadership is to learn to manage your time early on. Your calendar will fill quickly, so ask yourself if you really need to be there or if someone else could use this opportunity to learn or lead. Interested in making an impact at MongoDB? We have several open roles on our teams around the globe and would love for you to transform your career with us!

March 30, 2022

Next →

MongoDB Announces Leadership Transition

Dev Ittycheria, President and Chief Executive Officer, shared the following message with MongoDB employees this morning. This is the hardest email I have ever had to write to all of you. If you have not seen the announcement, I have decided to retire as CEO. Effective November 10, 2025, Chirantan “CJ” Desai will become the new CEO of MongoDB. This was not an easy decision for me. The process to get to this point has been deeply emotional, as I care profoundly about MongoDB and the people who have made the company what it is today. This news may come as a surprise, and for some, perhaps even a shock. That’s natural. Leadership transitions can evoke a range of reactions. I want to share why this is happening, and why it’s the right thing for MongoDB. Every personnel change, including the most senior leadership changes, involves two key decisions: first, recognizing that it is the right time for change, and second, selecting the best person to replace the person leaving. This email is intended to explain both decisions. Earlier this year, as part of our regular succession planning process, the Board and I discussed my long-term commitment. They asked if I would continue as CEO for another five years. After many conversations with my family and the Board, I realized I could not make that commitment. Some CEOs see their title as their identity. I do not. My core responsibility is to serve in the company's best interests. The company is primed for a new leader. One with a fresh perspective, grounded in experience and skills needed to guide MongoDB through its next evolution as a company, what we call MongoDB 3.0. Consequently, I informed the Board that I would commit to two more years to help find a successor. That began the search process for a suitable successor. To our surprise and delight, what we thought would easily take 12 to 24 months happened much faster than anyone expected. After engaging with multiple qualified candidates, we found the right successor in CJ. CJ is uniquely qualified for this role. CJ brings the rare growth-at-scale experience that will help continue to build MongoDB into an iconic technology company. At ServiceNow, he was the only executive to work directly with three of its highly regarded public company CEOs and played a pivotal role in organically scaling the company from just over $1 billion to more than $10 billion in revenue. Only a handful of independent software companies have ever reached that milestone. CJ helped transform ServiceNow from a product company to a platform company, scaled engineering, drove go-to-market excellence, and engaged deeply with investors. More recently, as President of Product and Engineering at Cloudflare, he helped fuel strong growth and stock performance. CJ also possesses the personal qualities needed to succeed as CEO. He is humble, eager to learn, and wants to draw on the perspectives of the people at MongoDB and other stakeholders to inform his thinking. This blend of experience, judgment, and character gives me full confidence that he is well-equipped to lead MongoDB through its next phase of growth. I often think of MongoDB’s journey as a long and extraordinary expedition. For the past eleven years, I have had the privilege of serving as its guide, helping chart the course, rally the team, and climb together through both calm and challenging terrain. Along the way, we have reached remarkable summits and proven what is possible through relentless innovation, persistence, and teamwork. Now it is time for a new guide to lead the next stage of the ascent and take MongoDB to even greater heights. CJ is the right leader to take MongoDB to the next summit. MongoDB is on a strong footing, with a clear strategy, an exceptional leadership team, a product platform that is more relevant than ever, and a business that is executing well. The rise of AI and the explosion of data-intensive applications play directly to MongoDB’s strengths. Our technology sits at the center of how modern applications are built and how organizations will harness data to power intelligent, adaptive systems. I am confident MongoDB is perfectly positioned to capture this next wave of innovation. As for me, I am not running away from MongoDB or leaving to join another company as CEO. I will remain on the Board and work closely with CJ to ensure a seamless transition. Over the years, this role has demanded an enormous amount of focus and energy; as a result, there are many things I’ve missed doing along the way. I’m looking forward to being more present for those moments — from simple time with my family to experiences and travel we’ve long put off. I plan to hold on to my MongoDB stock, as I firmly believe in the people and the opportunity, knowing that MongoDB’s best days are ahead of it. Yes, change can be unsettling. I’m sure you will have many questions about this change, such as why now, why CJ is the best person to lead the company, and what this means for you. We will hold an all-hands meeting tomorrow at 10:30AM ET to discuss this transition, introduce CJ and take your questions. That being said, I want to emphasize that the right change at the right time is how great companies get stronger. Just as a championship team refreshes its roster to stay competitive, MongoDB is bringing in new leadership, including other recent C-suite leaders who came before CJ, to drive our next phase of growth. This is not an ending; it’s the founding of a new moment. I am incredibly proud of what we have built together and genuinely excited about what lies ahead with CJ leading us forward. I also want to thank each of you for making this journey so meaningful. Words cannot fully capture my gratitude for your passion, creativity, and belief in building something truly special. I have often said that I want MongoDB to be an inflection point in people’s careers, a place where they can grow, take risks, and do the best work of their lives. I can say without hesitation that it has been exactly that for me. The skills I have developed, the experiences I have gained, and the relationships I have formed here have shaped me more than any other chapter in my professional life. I will carry them with me always, and will continue to cheer for and support MongoDB every step of the way. --Dev

November 3, 2025