Part 1: The Modernization Journey with Exafluence and MongoDB

Prasad Pillalamarri, Paresh Saraf, Ravikiran Dharmavaram, and Richard Robins
December 9, 2020 | Updated: April 24, 2026

>> Announcement: Some features mentioned below will be deprecated on Sep. 30, 2025. Learn more.

Welcome to the first in a series of conversations between Exafluence and MongoDB about how our partnership can use open source tools and the application of data, artificial intelligence/machine learning and neuro-linguistic programming to power your business’s digital transformation.

In this installment, MongoDB Senior Partner Solutions Architect Paresh Saraf and Director for WW Partner Presales Prasad Pillalamarri sit down with Exafluence CEO Ravikiran Dharmavaram and exf Insights Co-Founder Richard Robins to discuss how to start the journey to build resilient, agile, and quick-to-market applications.

From Prasad Pillalamari:

I first met Richard Robins, MD & Co-Founder of exf Insights at Exafluence back in June 2016 at a MongoDB world event. Their approach towards building data-driven applications was fascinating for me. Since then Exafluence has grown by leaps and bounds in the System Integration space and MongoDB has outperformed its peers in the database market. So Paresh and I decided to interview Richard to deep-dive into their perspective on Modernization with MongoDB.

Prasad & Paresh: We first met the Exafluence team in 2016. Since then, MongoDB has created the Atlas cloud data platform that now supports multi-cloud clusters and Exafluence has executed multiple projects on mainframe and legacy modernization. Could you share your perspective on the growth aspects and synergies of both companies from a modernization point of view?

Richard Robins: Paresh and Prasad, I’m delighted to share our views with you. We’ve always focused on what happens after you successfully offload read traffic from mainframes and legacy RDBMS to the cloud. That’s digital transformation and legacy app modernization. Early on, Exafluence made a bet that if the development community embraces something we should, too. That’s how we locked in on MongoDB when we formed our company. Having earned our stripes in the legacy data world, we knew that getting clients to MongoDB would mean mining the often poorly documented IP contained in the legacy code.

That code is often where long-retired subject matter expert (SME) knowledge resides. To capture it, we built tools to scan COBOL/DB2 and stored procedures to reverse engineer the current state. This helps us move clients to a modern cloud native application, and it's an effective way to merge, migrate, and retire the legacy data stores all of our clients contend with.

Once we’d mined the IP with those tools we needed to provide forward-engineered transformation rules to reach the new MongoDB Atlas endpoint. Using a metadata driven approach, we built a rules catalog that included a full audit and REST API to keep data governance programs and catalogs up to date as an additional benefit of our modernization efforts. We’ve curated these tools as exf Insights, and we bring them to each modernization project.

Essentially, we applied NLP, ML, and AI to data transformation to improve modernization analysts’ efficiency, and added a low-to-no code transformation rule builder, complete with version control and rollback capabilities.

Codeless Transform Rules for Quick Transformation

All this has resulted in our clients getting world-class, resilient capabilities at a lower cost in less time. We’re delighted to say that our modernization projects have been successful by following simple tenets — to embrace what the development community embraces and to offer as much help as possible — embodied in the accelerator tools we’ve built. That’s why we are so confident we'll continue our rapid growth.

P&P: How do you think re-architecting legacy applications with MongoDB as the core data layer will add value to your business?

RR: We believe that MongoDB Atlas will continue to be the developers go-to document database, and that we’ll see our business grow 200-300% over the next three years. With MongoDB Atlas and Realm we can provide clients with resilient, agile applications that scale, are easily upgraded, and are able to run on any cloud as well as the popular mobile iOS and Android devices. Digital transformation is key to remaining competitive and being agile going forward. With MongoDB Atlas, we can give our clients the same capabilities we all take for granted on our mobile apps: they’re resilient, easy to upgrade, usually real-time, scale via Kubernetes clusters, and can be rolled back quickly if necessary. Most importantly, they save our clients money and can be automatically deployed.

P&P: At a high level, how will Exafluence help customers take this journey?

RR: We’re unusual as a services firm in that we spend 20% of gross revenue on R&D, so our platform and approach are proven. Thus, relatively small teams for our healthcare, financial services, and industrial 4.0 clients can leverage our approach, platform, and tools to deliver advanced analytical systems that combine structured and unstructured data across multiple domains.

We built our exf Insights accelerator platform using MongoDB and designed it for interoperability, too. On projects we often encounter legacy ETL and messaging tools. To show how easy it is, we recently integrated exf Insights with SAP HANA and the SAP Data Intelligence platform. Further, we can publish JSON code blocks and provide Python code for integration into ETL platforms like Informatica and Talend.

Our approach is to reverse engineer by mining IP from legacy data estates and then forward engineer the target data estate, using these steps and tools:

Reverse Engineer

Extract stored procedures, business logic, and technical data from the legacy estate and load it into our platform.
Use our AI/ML/NLP algorithms to analyse business transformation logic and metadata, with outliers identified for cleansing.
Provide DB scans to assess legacy data quality to cleanse and correct outliers, and provide tools to compare DB level data reconciliations.

Forward Engineer

To produce a clean set of metadata and business transformation logic, and baseline with version control, we:

Extract, transform, and load metadata to the target state.
Score metadata via NLP and ML to recommend matches to the Analyst who accepts/rejects or overrides recommendations.
Analysts can then add additional transformations which are catalogued.
Deploy and load cleansed data to the target state platform so any transformations and gold copies may be built.
Automate Data Governance via Rest API, Code Block generation (Python/JSON) to provide enterprise catalogs with the latest transforms.

P&P: What are your keys to a successful transformation journey?

RR: Over the past several years we’ve identified these elements and observations:

Subject matter experts and technologists must work together to provide new solutions.
There’s a shortage of skilled technologists able to write, deploy, and securely manage next generation solutions. Using accelerators and transferring skills are vital to mitigating the skills shortage.
Existing IP that’s buried in legacy applications must be understood and mined in order for a modernization program to succeed.
A data-driven approach that combines reverse and forward engineering speeds migration and also provides new data governance and data science catalog capabilities.
The building, caring, and feeding of new, open source-enabled applications is markedly different from the way monolithic legacy applications were built. The document model enables analytics and interoperability.
Cybersecurity and data consumption patterns must be articulated and be part of the process, not afterthoughts.
Even with aggressive transformation plans, new technology must co-exist with legacy applications for some time; progress works best if it’s not a big bang.
Success requires business and technology to learn new ways to provide, acquire, and build agile solutions.

P&P: Can you talk about solutions you have which will accelerate the modernization journey for the customers?

RR: exf Insights helps our clients visualize what’s possible with extensive, pre-built, modular solutions for health care, financial services, and industrial 4.0. They show the power of MongoDB Atlas and also the power of speed layers using Spark and Confluent Kafka. These solutions are readily adaptable to client requirements and reduce the risk and time required to provide secure, production-ready applications.

Source data loading. Analyze and integrate raw structured and unstructured data, including support for reference and transactional data.
Metadata scan. Match data using AI/NLP, scoring results and providing side-by-side comparison.
Source alignment. Use ML to check underlying data and score results for analysts, and leverage that learning to accelerate future changes.
Codeless transformation. Empower data SMEs to build the logic with a multiple-sources-to-target approach and transform rules which support code value lookups and complex Boolean logic. Includes versioned gold copies of any data type (e.g., reference, transaction, client, product, etc.).
Deployment. Deploy for scheduled or event-driven repeatability and dynamically populate Snowflake or other repositories. Generates code blocks that are usable in your estate or REST API.

We used the same 5-step workflow data scientists use when we enabled business analysts to accelerate the retirement of internal data stores to build and deploy the COVID-19 self-checking app in three weeks, including active directory integration and downloadable apps. We will be offering a Realm COVID-19 screening app on web, Android, and IOS to the entire MongoDB Atlas community in addition to our own clients.

The accelerator integrates key data governance tools, including exf Insights repository management of all sources and targets with versioned lineage; as-built transformation rules for internal and client implementations; and a business glossary integrated into metadata repositories.

P&P: Usually one of the key challenges for businesses is data being locked in silos.

RR: We couldn’t agree more. Our data modernization projects routinely integrate with source transactional systems that were never built to work together. We provide scanning tools to understand disparate data as well as ways to ingest, align, and stitch them together. Using health care as an example, exf Insights provides a comprehensive analytical capability, able to integrate data from hospitals, claims, pharmaceutical companies, patients, and providers. Some of this is NonSQL, such as radiological images; for pharma companies we provide capabilities to support clinical research organizations (CROs) via a follow-the-molecule approach. Of course, we also have to work with and subscribe to Centers for Medicare & Medicaid Services (CMS) guidelines. Our data migration focuses on collecting the IP behind the data and making the source, logic, and any transformations rules available to our clients.

In financial services, it’s critical to understand source and targets. No matter how data is accessed (federated or direct store), with Spark and Kafka we can talk to just about any data repository.

P&P: Once we discover the data to be migrated, we need to model the data according to MongoDB’s data model paradigm. That requires multiple transformations before data is loaded to MongoDB. Can you explain more about how your accelerators help here?

RR: By understanding data consumption and then looking at existing data structures, we seek to simplify and then apply the capabilities of MongoDB’s document model. It’s not unlike what a data architect would do in the relational world, but with MongoDB Atlas it’s easier. We ourselves use MongoDB for our exf Insights platform to align, transform, and make data ready for consumption in new applications. We’re able to provide full rules lineage and audit trail, and even support rollback. For the real-time speed layer we use Spark and Kafka as well.

This data-driven modernization approach also turns data governance into an active consumer of the rules catalog, so exf Insights works well for regulated industries.

P&P: It’s great that we have data migrated now. Consider a scenario where it’s a mainframe application and we have lots of COBOL code in there. It has to be moved to a new programming language like Python, with a change in the data access layer to point to MongoDB. Do you have accelerators which can facilitate the application migration? If so, how?

RR: Yes, we do have accelerators that understand the COBOL syntax to create JSON and ultimately Java, which speeds modernization. We also found we had to reverse engineer stored procedures as part of our client engagements for Exadata migration.

P&P: Once we migrate the data from legacy databases to MongoDB, validation is the key step. As this is a heterogeneous migration it can be challenging. How can Exafluence add value here?

RR: We’ve built custom accelerators that migrate data from the RDBMS world to MongoDB, and offer data comparisons as clients go from development to testing to production, documenting all data transformations along the way.

P&P: Now that we’ve talked about all your tools which can help in the modernization journey, can you tell us about how you already helped your customers to achieve this?

RR: Certainly. We’ve already outlined how we’ve created solution starters for modernization, with sample solutions as accelerators. But that’s not enough; our key tenet for successful modernization projects is pairing SMEs and developers. That’s what enables our joint client and Exafluence teams to understand the business, key regulations, and technical standards.

Our data-driven focus lets us understand the data regardless of industry vertical. We’ve successfully used exf Insights now in financial services, healthcare, and industry 4.0. Whether it’s understanding the nuances of financial instruments and data sources for reference and transactional data, or Medical Device IoT sensors in healthcare, or shop floor IoT and PLC data for predictive analytics and digital twin modeling, a data-driven approach reduces modernization risks and costs.

Below are some of the possibilities this data-driven approach has delivered for our healthcare clients using MongoDB Atlas. By aggregating provider, membership, claims, pharma, and EHR clinical data, we offer robust reporting that:

Transforms health care data from its raw form into actionable insights that improve member care quality, health outcomes, and satisfaction
Provides FHIR support
Surfaces trends and patterns in claims, membership, and provider data
Lets users access, visualize, and analyze data from different sources
Tracks provider performance and identifies operational inefficiencies

P&P: Thank you, Richard!

Keep an eye out for upcoming conversations in our series with Exafluence, where we'll be talking about agility in infrastructure and data as well as interoperability.

MongoDB and Modernization

To learn more about MongoDB's overall Modernization strategy, read here.

← Previous

Intern Series: From Ecuador, to the University of Toronto, to MongoDB - Meet Jose Cabrera-Ormaza

Jose Cabrera-Ormaza is in the process of completing his final year in computer engineering at the University of Toronto. He spent this summer interning on the MongoDB Realm team. I recently sat down with Jose to discuss his goals as a software engineer and his experience at MongoDB. Kate Wright: Thanks for spending some time with me Jose! I know you discovered programming as an undergrad. Can you tell us a little bit about how you came to pursue a career in software engineering? Jose Cabrera-Ormaza: I decided to study software engineering because I want to challenge and change the world’s perception of countries such as my home country of Ecuador and South America generally in the context of tech. When people speak about South America, they say great things about our food, landscapes, culture, and more, which makes me extremely proud. However, I would love to help the tech industry grow in South America. I admire the Ecuadorian mining industry, and I originally came to the University of Toronto on a scholarship to study mineral engineering. However, in my second term of university, I took my first-ever programming course, which completely changed my perspective and goals. Before taking that course, I had no idea computers could be used to write programs. I didn’t grow up with a personal computer of my own and had no exposure to software engineering. After writing my first few programs (which were terrible, by the way), I saw the potential impact software engineering could have both in my life and in changing economies of nations such as Ecuador. I would personally love to increase the amount of tech exposure students receive in certain regions of South America where students like myself have limited access to technology. I want students in Ecuador to know they can be the engineers behind some of the newest technological breakthroughs and inventions. KW: Wow, that’s a powerful story. Thank you for sharing it. I know you see internships as an opportunity to further develop your software engineering skills, but what made you decide to spend a summer at MongoDB? JCO: To start with, MongoDB University! I’m extremely passionate about education and free access to knowledge. I was excited to join MongoDB because I felt that my values and beliefs align with MongoDB’s commitment to providing a free learning platform. Additionally, I really wanted to join a company that builds and develops cutting-edge technology used by other software engineers. MongoDB is a modern database platform offering a document data model that developers love compared to legacy database technologies that haven’t changed much in 50 years. I wanted to be a part of the database revolution with MongoDB. KW: This summer, you worked on one of those cutting-edge technologies used by other software engineers. Can you share a bit about your team and what you worked on? JCO: I interned on the Realm Cloud Team. Realm is MongoDB’s back end as a service offering. It allows users to focus less on building a back-end architecture on their own, and to focus more on building other aspects of their project. Realm Cloud offered me the chance to work on very interesting projects and to learn a lot. I had the chance to work alongside a fellow intern and my mentor to build a transpiler microservice. The microservice processes and transpiles user-uploaded JavaScript dependencies. On top of the aforementioned, we optimized the performance of this microservice by introducing concurrent processing. We implemented this project both in Node.js and in Go. The project was challenging and felt complete in that it required more than just writing code. Because we built the microservice in two languages, we established unit testing and performance testing, had to analyze and compare our performance results, and had to use critical thinking to draw conclusions on which implementation might fit our needs best. It was incredibly rewarding to have the chance to freely experiment and do much more than just write code. KW: What did you enjoy most about your summer at MongoDB? JCO: Just having had the opportunity to be at MongoDB makes me feel really proud and accomplished. I’ve loved the opportunity to learn from very talented and brilliant people, and I’m talking not only about technical skills, but also in terms of diversity of backgrounds, cultures, and ideas. One of MongoDB’s core values is “Build Together,” and it seems that everyone I met at the company lives and applies that value to everything they do. Everyone here really wants you to grow both personally and professionally. My teammates and mentors were always available to help and share knowledge. Finally, throughout the intern events and the speaker series, I found out that many people in the company in leadership and managerial positions started out as interns a few years ago. That’s a perfect example of how MongoDB fosters and values everyone at any level in the company! KW: Is there anything you’d like to share with future MongoDBers reading this blog? JCO: As someone who is considered part of an underrepresented group in STEM, I came to MongoDB with the mindset that I had to change who I am to fit into the tech industry, but I have found the exact opposite to be true. For those who have felt they don’t belong in STEM or have experienced imposter syndrome, I’d like to tell you to battle those feelings and keep pursuing your goals. Interested in pursuing a career at MongoDB? We have several open roles on our teams across the globe , and would love for you to build your career with us!

December 8, 2020

Next →

10 Years of MongoDB Atlas: Built for What’s Next

Nearly a decade ago, I joined MongoDB as a Senior Product Manager to help build the company’s new cloud product, MongoDB Atlas. Our customers had been telling us they wanted to bring MongoDB’s familiar developer experience to the cloud, with the reliability and confidence teams needed to run in production. Atlas was our answer. Today, we’re celebrating 10 years of MongoDB Atlas, the generational data platform for AI applications, and the customers who pushed us to build it. Atlas was shaped in close conversation with those customers and scaled alongside them every step of the way. Today, more than 250,000 builders get started on Atlas every month. Atlas serves more than three trillion queries a day (a roughly threefold increase just since 2023!), and represents 75% of MongoDB’s revenue. Those numbers reflect something more important than growth: the trust builders and customers have placed in us to scale their businesses. That trust was earned by listening closely. Every major capability and architectural investment in Atlas was rooted in what customers asked for: the flexibility and speed of MongoDB’s document model, delivered in a platform that removed operational overhead and could scale with their applications. Over time, Atlas expanded beyond a managed database into a broader data platform, because builders kept asking for more flexibility, more simplicity, and more room to build. That matters even more in the AI era. AI applications create new demands, but the underlying requirement is familiar: builders need a platform that can support operational data, search, and retrieval while scaling through constant change—without forcing them to stitch together a mess of disconnected systems. We spent ten years becoming the flexible, durable data platform that builders trust. Those are the same qualities AI applications need most, and that’s why builders are now using Atlas to build trustworthy AI applications with highly accurate retrieval, real-time context, and the scale to run in production. Atlas 10 Year Anniversary Blog - Image 1 media Managed cloud databases become the default When Atlas launched in 2016, organizations were moving away from traditional data center build-outs and toward cloud-based delivery, a market Gartner forecasted would reach $204 billion (and is now approaching $1 trillion). Developers loved MongoDB as a flexible, intuitive foundation for building applications, but they also wanted to take advantage of the cloud. Atlas’s first promise was simple: bring MongoDB’s familiar developer experience to the cloud, with the reliability and confidence teams needed to run in production. Atlas 10 Year Anniversary Blog - Quote 1 aside To deliver that confidence to developer teams, we built Atlas with security, resilience, and performance at its core—from encryption and access controls to backups and high availability. The result was a service that teams could run in production with confidence, freeing developers to do their very best work without the headaches associated with database administration. By 2018, 81% of enterprises were operating in multi-cloud environments, and an IDG study found that more than half indicated they were thinking about cloud as a portfolio strategy. As customer architectures became more distributed, teams needed the flexibility to choose the cloud environment that fit their applications, teams, and compliance needs. To support them, we extended our original promise of simplicity into multi-cloud flexibility, with availability across all three major cloud providers. And in 2020, we introduced Atlas Multi-Cloud Clusters, making Atlas the first and only cloud database to let customers run applications simultaneously across AWS, Azure, and Google Cloud regions—a unique achievement that gave organizations that require ultra-high availability one consistent data foundation across all the major clouds. Today, customers can run across over 125 AWS, Google Cloud, and Microsoft Azure cloud regions, making Atlas the most widely available managed data platform in the world. Atlas 10 Year Anniversary Blog - Quote 2 aside Enterprises' scale, and consolidation becomes a customer priority As cloud adoption accelerated, customers wanted more than a hosted database. The cloud had become a long-term investment, and developers needed global reach, resilience, and a platform that could handle more workloads, securely, without requiring them to keep adding infrastructure around it. Because developers already trusted us on the fundamentals, Atlas could expand into the kinds of workloads enterprises could not afford to get wrong. For workloads like payments, inventory, and order processing, strong transactional consistency is a requirement. The addition of multi-document ACID transactions in 2018 brought that transactional consistency to MongoDB and marked an important step in MongoDB’s evolution, enabling MongoDB to serve the kinds of high-stakes transactional workloads that enterprises had historically reserved for relational databases. Now, customers could use MongoDB with greater confidence for a wider set of systems where accuracy, resilience, and trust could not be compromised. MongoDB extended its trustworthy database foundation with the launch of MongoDB Queryable Encryption, an industry-first encryption capability that allows customers to query encrypted data while keeping sensitive information protected when it is at rest, in transit, and in use—an important step for securing regulated and highly sensitive workloads. At the same time, Atlas continued to evolve to help customers operate at a larger scale. In 2020, we introduced Atlas Search and Online Archive, adding rich application search and giving customers a simpler, lower-cost way to store older data without losing easy access to it. In 2021, Native Time Series Collections and Live Resharding followed, helping customers manage time-stamped data more efficiently and scale architectures without downtime. These updates made Atlas easier for builders to work with as deployments became bigger, more distributed, and more complex, all while minimizing the number of disparate systems that development teams had to stitch together and maintain. Atlas 10 Year Anniversary Blog - Quote 3 aside Trustworthy AI becomes the new frontier Then, the launch of ChatGPT in late 2022—and with it the rise of generative AI—created a massive new challenge for builders. Enterprise adoption moved faster than standards and controls, leaving teams to figure out how to connect the necessary data components to run semantic search and retrieval-augmented generation (RAG) workloads together without creating a brittle mess of data pipelines, sync jobs, and specialized infrastructure that compromised security and performance. To help teams bring these critical AI building blocks together on one secure platform, Atlas evolved again. With the public release of Atlas Vector Search in 2023, MongoDB was one of the first databases to launch vector search as a native capability, which enabled developers to keep vectors close to operational data and run semantic retrieval directly in the database without having to manage a separate vector store. Search Nodes gave teams a way to scale search and vector workloads independently from the operational database, while Atlas Stream Processing gave builders a way to process real-time streaming data without adding separate infrastructure. The business demand for this architecture has been staggering: over 726,000 vector indexes and 55,000 vector applications have been created since we introduced Atlas Vector Search, and we’ve seen a 92% increase in customers showing production-level vector search usage over the past 12 months. And with the company’s acquisition of Voyage AI in 2025, MongoDB sharpened its focus on retrieval quality—bringing advanced embedding and reranking models into Atlas. The integration of Voyage AI was about rethinking the data architecture to help customers reduce hallucinations, improve relevance, and make AI useful in the real-world environments where accuracy and trust matter most. 10 Year Anniversary Blog - Quote 4 aside This immediately paid huge dividends for customers building highly accurate semantic search and RAG applications. But we knew that as the market moved towards autonomous AI, trustworthy retrieval and access to real-time context would matter even more. Agents and the future of the data layer Today, we’re firmly in AI’s agentic era. Builders want to deploy agents that can reason over business context with autonomy. But agent memory requires fast accuracy at scale so that the right information is recalled at precisely the right time. And this is where they run into a challenge. They're excited about agents, but they can't put an agent in front of their customers if the results are inconsistent, irrelevant, or flat-out wrong. That puts increasing focus on the data layer of the tech stack. Agents are only as good as the context they can retrieve, rank, and retain. If the underlying data is stale, incomplete, or poorly retrieved, the output will be wrong—regardless of how strong the model is. In practice, production agents depend less on model choice alone than on retrieval quality and the ability to ground responses in live operational data. With search, vector search, embeddings, and rerankers natively integrated into the Atlas platform, businesses are closing the gap between data and retrieval to produce fast, accurate results for agents at scale. And with foundational capabilities to ensure exceptional security, resilience, and performance, builders are freed up to do what they do best, instead of spending their days bogged down managing data infrastructure. Atlas 10 Year Anniversary Blog - Quote 5 aside Over the past decade, our goal has been to reduce operational burden for customers without compromising on the technical bar. As the industry moves toward agents, that aim still applies. We’re ten years in, and Atlas has grown into the data platform that runs intelligent, mission-critical applications for nearly 70,000 customers across every industry. The world runs on Atlas! Our customers pushed us to build everything that matters in the platform, so they could do more, faster. The same holds true today: the agentic AI era is raising the bar for innovation, and we're raising it with them. The ambition our customers bring to what they're building next is what drives us forward—and we're ready for it. Here's to the next 10 years.

June 25, 2026