Building AI with MongoDB: Retrieval-Augmented Generation (RAG) Puts Power in Developers’ Hands

Mat Keep
November 28, 2023 | Updated: April 21, 2025

As recently as 12 months ago, any mention of retrieval-augmented generation (RAG) would have left most of us confused. However, with the explosion of generative AI, the RAG architectural pattern has now firmly established itself in the enterprise landscape.

RAG presents developers with a potent combination. They can take the reasoning capabilities of pre-trained, general-purpose LLMs and feed them with real-time, company-specific data. As a result, developers can build AI-powered apps that generate outputs grounded in enterprise data and knowledge that is accurate, up-to-date, and relevant. They can do this without having to turn to specialized data science teams to either retrain or fine-tune models — a complex, time-consuming, and expensive process.

Over this series of Building AI with MongoDB blog posts, we’ve featured developers using tools like MongoDB Atlas Vector Search for RAG in a whole range of applications. Take a look at our AI case studies page and you’ll find examples spanning conversational AI with chatbots and voice bots, co-pilots, threat intelligence and cybersecurity, contract management, question-answering, healthcare compliance and treatment assistants, content discovery and monetization, and more.

Further reflecting its growing adoption, Retool’s State of AI survey from a couple of weeks ago shows Atlas Vector Search earning the highest net promoter score (NPS) among developers.

Check out our AI Learning Hub to learn more about building AI-powered apps with MongoDB.

In this blog post, I’ll highlight three more interesting and novel use cases:

Unlocking geological data for better decision-making and accelerating the path to net zero at Eni
Video and audio personalization at Potion
Unlocking insights from enterprise knowledge bases at Kovai

Eni makes terabytes of subsurface unstructured data actionable with MongoDB Atlas

Based in Italy, Eni is a leading integrated energy company with more than 30,000 employees across 69 countries. In 2020, the company launched a strategy to reach net zero emissions by 2050 and develop more environmentally and financially sustainable products.

Sabato Severino, Senior AI Solution Architect for Geoscience at Eni, explains the role of his team: “We’re responsible for finding the best solutions in the market for our cloud infrastructure and adapting them to meet specific business needs.”

Projects include using AI for drilling and exploration, leveraging cloud APIs to accelerate innovation, and building a smart platform to promote knowledge sharing across the company. Eni’s document management platform for geosciences offers an ecosystem of services and applications for creating and sharing content. It leverages embedded AI models to extract information from documents and stores unstructured data in MongoDB.

The challenges for Severino’s team were to maintain the platform as it ingested a growing volume of data — hundreds of thousands of documents and terabytes of data — and to enable different user groups to extract relevant insights from comprehensive records quickly and easily.

With MongoDB Atlas, Eni users can quickly find data spanning multiple years and geographies to identify trends and analyze models that support decision-making within their fields. The platform uses MongoDB Atlas Search to filter out irrelevant documents while also integrating AI and machine learning models, such as vector search, to make it even easier to identify patterns.

“The generative AI we’ve introduced currently creates vector embeddings from documents, so when a user asks a question, it retrieves the most relevant document and uses LLMs to build the answer,” explains Severino.

“We’re looking at migrating vector embeddings into MongoDB Atlas to create a fully integrated, functional system. We’ll then be able to use Atlas Vector Search to build AI-powered experiences without leaving the Atlas platform — a much better experience for developers.”

Read the full case study to learn more about Eni and how it is making unstructured data actionable.

Video personalization at scale with Potion and MongoDB

Potion enables salespeople to personalize prospecting videos at scale. Already over 7,500 sales professionals at companies including SAP, AppsFlyer, CaptivateIQ, and Opensense are using SendPotion to increase response rates, book more meetings, and build customer trust.

All a sales representative needs to do is record a video template, select which words need to be personalized, and let Potion’s audio and vision AI models do the rest. Kanad Bahalkar, co-founder and CEO at Potion explains:

“The sales rep tells us what elements need to be personalized in the video — that is typically provided as a list of contacts with their name, company, desired call-to-action, and so on. Our vision and audio models then inspect each frame and reanimate the video and audio with personalized messages lip-synced into the stream. Reanimation is done in bulk in minutes. For example, one video template can be transformed into over 1,000 unique video messages, personalized to each contact.”

Potion’s custom generative AI models are built with PyTorch and TensorFlow, and run on Amazon Sagemaker. Describing their models, Kanad says “Our vision model is trained on thousands of different faces, so we can synthesize the video without individualized AI training. The audio models are tuned on-demand for each voice.”

And where does the data for the AI lifecycle live? “This is where we use MongoDB Atlas,” says Kanad.

“We use the MongoDB database to store metadata for all the videos, including the source content for personalization, such as the contact list and calls to action. For every new contact entry created in MongoDB, a video is generated for it using our AI models, and a link to that video is stored back in the database. MongoDB also powers all of our application analytics and intelligence. With the insights we generate from MongoDB, we can see how users interact with the service, capturing feedback loops, response rates, video watchtimes, and more. This data is used to continuously train and tune our models in Sagemaker."

On selecting MongoDB Kanad says, “I had prior experience of MongoDB and knew how easy and fast it was to get started for both modeling and querying the data. Atlas provides the best-managed database experience out there, meaning we can safely offload running the database to MongoDB. This ease-of-use, speed, and efficiency are all critical as we build and scale the business."

To further enrich the SendPotion service, Kanad is planning to use more of the developer features within MongoDB Atlas. This includes Atlas Vector Search to power AI-driven semantic search and RAG for users who are exploring recommendations across video libraries. The engineering team is also planning on using Atlas Triggers to enable event-driven processing of new video content.

Potion is a member of the MongoDB AI Innovators program. Asked about the value of the program, Kanad responds, “Access to free credits helped support rapid build and experimentation on top of MongoDB, coupled with access to technical guidance and support."

Bringing the power of Vector Search to enterprise knowledge bases

Founded in 2011, Kovai is an enterprise software company that offers multiple products in both the enterprise and B2B SaaS arena. Since its founding, the company has grown to nearly 300 employees serving over 2,500 customers.

One of Kovai’s key products is Document360, a knowledge base platform for SaaS companies looking for a self-service software documentation solution. Seeing the rise of GenAI, Kovai began developing its AI assistant, “Eddy.” The assistant provides answers to customers' questions utilizing LLMs augmented by retrieving information in a Document360 knowledge base.

During the development phase Kovai’s engineering and data science teams explored multiple vector databases to power the RAG portion of the application. They found the need to sync data between its system-of-record MongoDB database and a separate vector database introduced inaccuracies in answers from the assistant.

The release of MongoDB Atlas Vector Search provided a solution with three key advantages for the engineers:

Architectural simplicity: MongoDB Vector Search's architectural simplicity helps Kovai optimize the technical architecture needed to implement Eddy.
Operational efficiency: Atlas Vector Search allows Kovai to store both knowledge base articles and their embeddings together in MongoDB collections, eliminating “data syncing” issues that come with other vendors.
Performance: Kovai gets faster query response from MongoDB Vector Search at scale to ensure a positive user experience.

Atlas Vector Search is robust, cost-effective, and blazingly fast!
Said Saravana Kumar, CEO, Kovai, when speaking about his team's experience

Specifically, the team has seen the average time taken to return three, five, and 10 chunks between two and four milliseconds, and if the question is a closed loop, the average time reduces to less than two milliseconds.

You can learn more about Kovai’s journey into the world of RAG in the full case study.

Getting started

As the case studies in our Building AI with MongoDB series demonstrate, retrieval-augmented generation is a key design pattern developers can use as they build AI-powered applications for the business. Take a look at our Embedding Generative AI whitepaper to explore RAG in more detail.

Head over to our quick-start guide to get started with Atlas Vector Search today.

← Previous

使用 MongoDB Atlas构建客户单一视图

开展一项持久、成功的业务，关键在于了解客户。如果你真正了解你的客户，你便掌握了他们的需求和欲望，进而在对的时间以对的方式交付适宜的商品。然而，对于绝大多数 B2C 企业而言，由于存在大量分散的数据，很难构建单一的客户视图。企业收集客户数据的场景有很多，比如电商平台、CRM、ERP、忠诚度计划、支付端口、网络 APP、手机 APP 等等。各数据集可能是结构化、半结构化或非结构化，以流处理的形式交付或需要批处理，进一步加重了让碎片化的客户数据编辑工作。一些企业开始寻求定制解决方案，但只能提供部分客户视图。孤岛数据集让运营变得极具挑战性，包括客户服务、定向市场营销和高级分析（如流失预测和推荐）等。只有获得 360 度的客户视图，企业才能真正理解客户的需要、欲望和要求，进而谈及满足客户需求。因此，360 度数据的单一视图成为实现持久客户关系的关键。在本篇文章中，我们将详细分析如何通过 MongoDB 数据库和 Cogniflare Calledio Customer 360 工具架构客户单一视图，并依照现实世界中的使用案例了解情感分析。使用 Calleido Customer 360 构建单一视图借助 Customer 360 数据库，企业机构能够获得和分析多种类型的个体交互和触点，进而构建客户整体视图。实现途径是通过一系列不同的资源获取数据。不过，这些数据信息的发送和转换既复杂又耗时，现有的许多大数据手段也并不适配云环境。为了解决这些挑战和困难，Cogniflare 推出了 Calleido 。图 1：Calleido Customer 360 用例架构 Calleido 是一个数据处理平台，基于久经考验的开源工具，如 Apache NiFi。Calleido 拥有 300 个处理器，可轻松异动结构化和非结构化数据，打破地域束缚。它提供批量和实时更新，处理简单的数据转换。更重要的是，Calleido 能够与 Google Cloud 无缝整合，实现一键式部署。它利用 Google Kubernetes Engine 按需进行纵向和横向扩展，打造直观、流畅的低代码开发环境。图 2：使用 Calleido 数据管道，将客户数据从 PostgreSQL 复制到 MongoDB 现实世界用例：客户电子邮件的情感分析下面通过对客户电子邮件的情感分析用例来演示 Cogniflare Calleido 、 MongoDB Atlas 和 Customer 360 视图。为简化 Customer 360 数据库的构建，Cogniflare 团队创建了工作流模板，可在几秒钟完成数据管道部署。在接下来的章节中，我们将详细介绍一些常用的数据转移模式，来演示本 Customer 360 用例和控制面板样本。图三：控制面板样本自处理器从电子邮件服务器 (ConsumeIMAP) 提取 IMAP 消息起，工作流便开始了。进入所选收件箱（即客户服务）的每封新邮件都会触发一次事件。接着，会提取电子邮件标题来判断有关电子邮件内容的重要详情 (ExtractEmailHeaders)。 Calleido 会借助发送人的电子邮件识别客户 (UpdateAttribute)，并通过执行脚本来提取电子邮件全文 (ExecuteScript)。此时，基于已收集到的所有数据，可形成消息有效负载，并通过 Google Cloud Platform (GCP) Pub/Sub（还可使用 Kafka）发布，满足下游工作流和其他服务的使用。图 4：将电子邮件翻译为 Cloud PubSub 消息接着，会用到前一工作流中的 GCP Pub/Sub 消息 (ConsumeGCPPubSub)。此时，我们会借助 MongoDB Atlas 的整合功能来验证MongoDB 数据库中的每一位发件人 (GetMongo)。如果某位客户已存在于我们的系统中，那么我们会把该电子邮件数据发送至下一工作流，然后忽略其他电子邮件。图 5：使用 MongoDB 和 Calleido 确认客户电子邮件随后，开始分析电子邮件正文。在本工作流中，我们使用处理器准备一份请求正文，发送至 Google云自然语言 AI获取消息的语气和情感信息。语言处理 API 的结果会直接发送至 MongoDB Atlas，进入控制面板。图 6：使用 Calleido 拨打云 AutoML 电话控制面板中的最终结果 Customer 360 数据库可用于内部后台业务系统，补充和通知客户支持。在单一视图的加持下，故障排除、退货和投诉处理都变得更加便捷、高效。利用之前的客户通话信息，可确保为每一位客户提供最恰当、有效的回应。这些数据集还可导入分析系统，促进学习和优化，例如将负面情感和流失率相关联。 MongoDB 文档数据库的作用在上述示例中，Calleido 负责将企业源系统中的数据复制和发送至 MongoDB Atlas——运营数据存储 (ODS)。得益于 MongoDB 灵活的数据架构，我们能够以原始格式传输数据，后续还能够以迭代的方式执行必要的模式转换，无需运行复杂的模式迁移，快速交付单一视图数据库。图 7 和 8：使用 Calleido 数据管道，将产品和订单从 PostgreSQL 复制到 MongoDB Atlas Calleido 可以让我们通过简单的几步便完成此转变。此工具运行自定义 SQL 查询 (ExecuteSQL)，汇总来自外部图表的全部所需数据，编译结果，以便进行并行处理。收到的数据为 Avro 格式，Calleido 随后将其转换为 JSON (ConvertAvroToJSON)，并转移至 MongoDB (JoltTransformJSON) 的模式中。 Customer 360 控制面板中的最终结果 MongoDB Atlas 是面向 Customer 360 数据库的行业领先之选。以下是其称为世界级标杆的主要原因： MongoDB 可有效处理来自原有系统的非标准化模式，并存储为任意自定义属性数据模型包括作为嵌套文档的所有相关数据。有别于 SQL 数据库，MongoDB 可规避难以写入和操作的复的加入查询。 MongoDB 非常快速，当前的客户视图能够在几毫秒内呈现，无需引入缓存层。 MongoDB 灵活的模式模型可通过迭代的方式实现敏捷性。在最初的提取中，数据几乎可以按照原始形状进行复制，进而大幅降低延迟。在后续阶段中，无需繁琐的 SQL 迁移，即可标准化模式，提升数据质量。 MongoDB 可跨越多个数据中心存储几十 TB 数据，轻松实现横向扩展可跨越多个区域分享数据，有效应对合规性要求。可设置独立的分析节点，避免影响生产系统的性能。 MongoDB 在作为单一视图数据库运行上具有有迹可循的业绩记录，曾有多家大型传统组织在两周内即运行原型，一个业务季度内即投入生产。 MongoDB Atlas 可直接自动扩展，降低成本，应对流量高峰。数据可实现动态和静态加密，有助于满足安全和隐私标准，包括 GDPR、HIPAA、PCI-DSS 和 FERPA。向客户追加销售：产品推荐向客户追加销售是现代业务的关键环节之一，其成功的诀窍在于：减少直接推销，更多专注培养和引导。即使用数据识别客户所处的购买阶段，他们的所思所想以及通过何种产品或服务能够满足需求。基于客户的购买记录，Calleido 可将数据发送至相应的工具（如 BigQuery ML），进而协助完成产品推荐。接着，这些内容可通过客服中心和市场营销团队进行线上或手机 APP 推送。实现这一目标，涉及两个工作流：准备训练数据和生成产品推荐：准备训练数据首先，使用 ExecuteSQL 处理器将合适的数据从 PostgreSQL 转移至 BigQuery。数据管道可以编排为定期执行。下一步，从 PostgreSQL 获取合适的数据，借助 ExecuteSQLRecord 处理器分割为 1,000 行的数据块。接着，这些文件会传送至下一个处理器，通过负载平衡利用所有可用的节点。然后，所有上述数据会通过 PutBigQueryStreaming 处理器插入至 BigQuery 表中。图 9：通过 Calleido 从 PostgreSQL 复制数据至 BigQuery 生成产品推荐接下来，我们介绍产品推荐的生成。首先，必须购买 Big Query 容量槽，以最经济的方式使用 BigQuery ML 的各项功能。此时，Calleido 会通过 ExecuteSQL 处理器调用 SQL 程序，确保所需的 BigQuery 容量可正常使用。下一个处理器 (ExecuteSQL) 将执行 SQL 查询，使用从第一个工作流中复制的数据创建和训练 Matrix Factorization 机器学习模型。随后，Calleido 使用 ExecuteSQL 处理器查询已受训的模型获取所有预测，并存储在专属的 BigQuery 表格中。最后，Wait 处理器等待所有容量槽的移除，因为已不再需要。图 10 和 11：通过 Calleido 生成产品推荐接着，我们借助两个处理器移除旧的推荐。首先，ReplaceText 处理器会更新即将开始的工作流文件内容，设置查询主体，方便DeleteMongo 处理器用于执行移除操作。图 12：移除旧的推荐将推荐复制到 MongoDB 便完成了整个工作流。ExecuteSQL 处理器获取和集合每位用户的前 10 项推荐，均以 1,000 行的数据块呈现。接着，以下两个处理器（ConvertAvroToJSON 和 ExecuteScript）备好数据，通过 PutMongoRecord 处理器插入 MongoDB 集合。图 13：将推荐复制到 MongoDB Customer 360 控制面板中的最终结果（本示例中所用的数据为自动生成）： MongoDB Atlas 上 Calleido 360 客户数据库的优势如果数据位于集中操作数据存储（如 MongoDB）中，那么可通过 Calleido 与分析数据存储（如 Google BigQuery）进行同步。借助Customer 360 数据库，内部相关方可将数据用于：通过细分和定向市场营销来提升客户满意度精准、快速访问合规性审计构建需求规划展望和市场趋势分析奖励客户忠诚，降低流失率最终，客户单一视图不仅能够帮助企业机构向潜在的买家精准交付消息，还能将处于品牌认知阶段的客户引流到转化阶段，并确保客户保留和售后机制高效运转。在过去，客户 360 视图是个繁杂、碎片化的过程；但现在依托 Cogniflare 的 Calleido 和 MongoDB Atlas，Customer 360 数据库已成为企业机构放心使用的功能强大、成本可控的数据管理堆栈。

November 28, 2023

Next →

That’s a Wrap: MongoDB’s 2025 in Review & 2026 Predictions

It’s nearly the end of the year—again! That means it’s time for an end-of-year blog post that expresses disbelief at the passage of time. Which, as the saying goes, flies when you’re having fun. And definitely when you’re as busy as MongoDB was in 2025. It was a big year for the company—and more importantly, for the tens of thousands of customers and millions of developers who rely on MongoDB’s modern data platform for their most mission-critical workloads. At MongoDB, everything we do starts with our obsession with customers and their needs, and if there’s a theme to MongoDB’s 2025, it was (and will continue to be) enabling customer innovation and helping them succeed in the AI era. So here are a few highlights of how MongoDB acted on behalf of customers in 2025. From the acquisition of Voyage AI to customer success across industries, a lot happened in 2025. Let’s go!* *Read to the end for 2026 thoughts. 2025: The (MongoDB) year that was Voyage AI, modernization, and search In February, MongoDB announced the acquisition of Voyage AI, a pioneer in embedding and reranking models, to enhance the accuracy of AI applications. Integrating Voyage AI's advanced retrieval technology with MongoDB’s modern, AI-ready data platform addresses a critical challenge: LLM model hallucinations caused by a lack of context. By improving retrieval accuracy for specialized domains like finance and law, the integration enables businesses to deploy AI for mission-critical use cases. To learn more, see the MongoDB Voyage AI page. Then, in September, we launched MongoDB AMP, an AI-powered Application Modernization Platform. AMP is designed to accelerate the transformation of legacy applications through a combination of AI-powered tooling, a proven delivery framework, and expert guidance (tools, techniques, and talent) to help enterprises reduce technical debt and modernize 2-3 times faster. Want more? Sure you do! Check out this short video. MongoDB also announced the addition of search and vector search capabilities to MongoDB Community Edition and MongoDB Enterprise Server. This allows developers to build and test AI-native applications, including those using retrieval-augmented generation (RAG), in local or on-premises environments. Previously exclusive to MongoDB Atlas, these features enable secure, hybrid deployments where sensitive data can remain on-premises while still leveraging advanced search tools. Here’s a (slightly less short) video about search and vector search on Enterprise Server. Growing and scaling with MongoDB As noted, everything we do at MongoDB starts with our obsession with customers. 2025 was another banner year for customer success and innovation—we were inspired by what organizations of every shape and size, across industries and geographies, built with MongoDB in 2025. Here are just two of the many stories our customers shared in 2025; much more can be found in my colleague Katie Palmer’s blog series, Innovating with MongoDB. Factory By combining the Atlas modern data platform with Voyage AI’s high-performance embeddings, the AI-native startup Factory—which uses AI agents called Droids to accelerate software development lifecycles for organizations—consolidated its fragmented tech stack. This enabled superior code retrieval, simplified operations, and provided the scalability needed to process billions of tokens daily. McKesson McKesson, a global pharmaceutical distributor, replaced its monolithic legacy infrastructure with MongoDB Atlas to meet strict drug tracing mandates. By adopting our modern cloud data platform, McKesson scaled its operations 300x, managing tracking data for 1.2 billion containers annually without latency, and ensuring compliance and patient safety while reducing developer complexity. For more, check out the video of McKesson at MongoDB.local NYC from September. From niche NoSQL to enterprise powerhouse As senior MongoDB engineer and Technical Fellow Ashish Kumar put it earlier this year, “through a sustained and deliberate engineering effort,” MongoDB has gone from a (seemingly) niche NoSQL solution to a trusted enterprise standard, and now delivers “the high availability, tunable consistency, ACID transactions, and robust security that enterprises demand.” A new era of leadership The face of MongoDB has also changed—our CFO, Mike Berry, joined the company in April, and Dev Ittycheria stepped down as CEO in November, after more than 11 years leading the company (including its 2017 IPO). In a LinkedIn post about his role, new MongoDB CEO CJ Desai noted that the company is “at the forefront of a new data revolution, unlocking the next wave of productivity and intelligence.” “Having spent my career building and scaling technology platforms, I’ve always been drawn to companies defined by clarity of vision, relentless organic innovation, and a customer-first culture. MongoDB exemplifies all three,” said Desai. We couldn’t agree more. Onward! Reading the 2026 tea leaves So what might 2026 bring (for MongoDB and tech at large)? Here are a handful of our leaders’ predictions: “As much as people want to talk about Artificial General Intelligence (AGI), we’re still in the phase where most AI use cases automate redundant tasks but benefit from human-in-the-loop checks. Organizations that use AI to complete work that historically is a drain on human resources—but then uses people to carefully verify what AI builds, apply governance frameworks, and maintain accountability across the data lifecycle—will be more successful.” —Pete Johnson, Field CTO, AI, MongoDB “After years of inflated expectations and unsustainable spending, the AI industry is trapped in a bubble where companies reflexively attempt to deploy LLMs at every problem, driving up costs with minimal to no return. Businesses that break free from this spending cycle are the ones that understand the need to ground LLM responses in factual data and learn from prior mistakes. We believe the best way to do this will be with highly accurate embedding models and rerankers for reliable data retrieval.” —Frank Liu, Staff Product Manager, MongoDB "In 2026, cloud independence will evolve from strategic preference to existential imperative across enterprises of every scale. The outages and disruptions of recent years have exposed a fundamental truth: in an always-on digital economy—where commerce, mobility, governance, and even public safety depend on uninterrupted access to cloud services—single-provider reliance is no longer a calculated risk, but a systemic vulnerability. Compounding this is the inexorable rise of data sovereignty. Regulatory regimes worldwide now demand precise jurisdictional control over data residency, rendering rigid cloud commitments incompatible with compliance at global scale. The defining competitive advantage will belong to organizations that transcend fragile prevention theater and engineer true infrastructural resilience: architectures inherently portable, data frictionlessly mobile, and operations autonomously sustained across heterogeneous clouds through AI-orchestrated redundancy. In short, the winners will not merely mitigate downtime—they will design systems that render the concept obsolete." —Ben Cefalo, SVP, Head of Core Products, MongoDB Happy holidays and happy New Year, everyone!

December 22, 2025