Powering Long-Term Memory for Agents With LangGraph and MongoDB

Prakul Agarwal and Thibaut Gourdel
August 20, 2025

We're excited to introduce the MongoDB Store for LangGraph—a powerful integration that brings flexible and scalable long-term memory to AI agents. This new integration between MongoDB and LangGraph, LangChain’s open-source agent orchestration framework, allows agents to remember and build on previous interactions across multiple sessions instead of only retaining memory for the current session. The result is more intelligent, context-aware agentic systems that learn and improve over time.

This new integration complements MongoDB’s existing checkpointer integration, which handles short-term memory and persistent conversation history. Together, the MongoDB Store for LangGraph and MongoDB’s support for checkpointers provide a complete solution for building production-ready, memory-enabled agents.

The need for agent memory

An AI agent is a system designed to take actions or make decisions based on input, often using tools and reasoning to complete tasks. By default, agents don’t retain memory between conversations, which severely constrains what they can accomplish.

Agent memory (and memory management) is a computational exocortex for AI agents. It is a dynamic, systematic process that integrates an agent’s large language model (LLM) memory (context window and parametric weights) with a persistent memory management system to encode, store, retrieve, and synthesize knowledge and experiences.

Agent memory is typically divided into two main types: short-term memory and long-term memory. In a memory context, LangGraph uses “threads” to represent individual conversations or sessions. Short-term memory, managed through thread-scoped checkpointers that MongoDB supports, maintains context within a given session. While this preserves conversation continuity and manages history, it doesn’t help agents learn continuously from the past across different conversations to adapt and optimize their behavior over time. This is why we introduced the MongoDB Store for LangGraph, enabling your agents to retain memories across conversations through a cross-thread memory store.

Figure 1. Short and long-term memory integration between LangGraph and MongoDB.

The top left of this diagram is the AI agent, which sends and receives data from MongoDB Atlas with two lines titled shared across threads and thread-level context. Atlas contains Stores Optimized Search, Checkpointers, and TTL Indexes. The bottom of the diagram has a line that shows langGraph + langgraph-checkpoint-mongodb + langgraph-store-mongodb.

MongoDB Store: Enabling cross-thread long-term memory

The new langgraph-store-mongodb package introduces a MongoDBStore class. Available now through PyPI, this production-ready integration provides:

Cross-thread persistence: Store and recall information across different conversation sessions and user interactions, allowing agents to build on previous knowledge.
Native JSON structure: LangGraph stores long-term memories as JSON documents, which map directly to MongoDB documents. Each memory is organized using namespaces and a key-value structure. This enables the usage of MongoDB’s native and optimized data formats and search capabilities for efficient retrieval.
Vector Search capabilities: Leverage MongoDB Atlas Vector Search for semantic memory retrieval based on meaning, not just keyword matching.
Asynchronous support: Support for both synchronous and asynchronous operations for high-performance applications.
Automatic connection management: Robust connection pooling and error handling to ensure reliability.
Optimized TTL indexes: MongoDB’s Time-to-Live (TTL) indexes are integrated with LangGraph’s TTL system, allowing automatic removal of stale or outdated data. This improves retrieval performance, reduces storage costs, and ensures the system "forgets" obsolete memories efficiently.

Ready to give your AI agents persistent long-term memory? The langgraph-store-mongodb package is available now:

pip install langgraph-store-mongodb

The MongoDB Store for LangGraph enables developers to build more powerful agents for different use cases, including:

Customer support agents: Build agents that remember customer preferences, past issues, and resolution patterns across multiple support channels.
Personal assistant applications: Build agents that learn user habits and preferences to provide increasingly personalized experiences.
Enterprise knowledge management: Create agents that accumulate organizational knowledge and can retrieve relevant information semantically.
Multi-agent systems: Enable agent teams to share learned experiences and coordinate through persistent memory.

Why MongoDB for agent memory?

Effective agentic memory requires comprehensive mechanisms for storing, retrieving, updating, and deleting memories. MongoDB Atlas provides a unified database that meets all these complex requirements:

Flexible document model: Store complex, nested memories as rich JSON, matching how agents naturally read, organize, and update evolving information.
Semantic search: Native vector search enables retrieval by meaning, not just exact matches.
State-of-the-art models: Voyage AI provides embedding models and rerankers for cutting-edge memory retrieval.
Scalable architecture: Distributed architecture, workload isolation, autoscaling, and automatic sharding capabilities for scaling AI agent memory.
Enterprise security: Fine-grained role-based access control (RBAC) allows precise management of both access scope (specific services or databases) and access type (read-only or read-write).

MongoDB Atlas and LangChain: A complete solution for AI agent memory

Short-term memory provides an agent with immediate context, current conversation state, prior exchanges within that session, or shared memory for coordination in multi-agent systems. The most common form of short-term memory is working memory—an active, temporary context accessible during a session. MongoDB's integration with LangGraph checkpointers supports this by persisting and restoring conversation states.

Other short-term memory implementations include semantic caches, such as using MongoDB's semantic cache integration with LangChain, which stores recent prompts and LLM responses for retrieval when similar queries occur. Shared memory is also used in multi-agent systems to provide a common space for coordination and information sharing.

Long-term memory serves as the agent’s knowledge base, storing diverse kinds of information for future use. It includes several functional types, each requiring specific storage and retrieval strategies:

Episodic memory: captures specific events and interactions, such as conversation history or summaries of key occurrences with metadata (e.g., timestamps, participants). For instance, a customer support agent can use this to recall a user’s past issues and offer personalized responses.
Procedural memory: records instructions or rules for recurring tasks. A typical implementation is a social content generator agent that remembers past feedback on writing style and formatting to improve its process.
Semantic memory: remembers general knowledge, facts, and concepts. This is often implemented through retrieval-augmented generation (RAG), where data is stored as vector embeddings and retrieved based on semantic similarity.
Associative memory: stores key entities and relationships between different pieces of information, enabling an agent to identify patterns and make inferences by navigating these connections. It's often implemented using graph structures that support efficient exploration of relationships. One practical approach is GraphRAG.

The MongoDB Store for LangGraph supports these memory types through flexible filtering and semantic search, making it a versatile approach for building reliable long-term memory in agents.

LangChain also provides LangMem, a toolkit featuring pre-built tools designed specifically for extracting and managing procedural, episodic, and semantic memories. LangMem integrates natively with LangGraph, streamlining the memory engineering process. For developers seeking a straightforward approach to using various memory types with MongoDB, explore this comprehensive tutorial for implementing MongoDB alongside LangGraph and LangMem.

The future of intelligent agents

With the new MongoDB Store for LangGraph, we're enabling developers to build AI agents that can learn and adapt. Agents that remember user preferences, learn from mistakes, and build knowledge over time will transform how we interact with AI systems.

The combination of LangGraph's sophisticated orchestration capabilities with MongoDB's flexible, scalable storage creates unprecedented opportunities for building intelligent, persistent AI agents that feel truly alive and responsive.

Ready to build memory-enabled agents with LangGraph and MongoDB Atlas? Get started with the documentation.

← Previous

Constitutional AI: Ethical Governance with MongoDB Atlas

As AI systems become increasingly powerful and pervasive, the question isn't whether we need ethical guardrails—it's how to implement them effectively at scale. Traditional approaches to AI safety often rely heavily on human oversight and manual review processes that don't scale with the complexity and volume of modern AI applications. Enter Constitutional AI (CAI), a groundbreaking approach developed by Anthropic that enables AI models to self-govern using predefined ethical principles [1]. When combined with MongoDB's robust data governance capabilities, Constitutional AI offers developers a practical framework for building responsible AI systems that maintain both ethical compliance and operational efficiency [7]. In this article, we'll explore how to architect and implement this powerful combination. Understanding constitutional AI: Beyond human oversight Constitutional AI represents a fundamental shift in how we approach AI alignment [1]. Instead of relying exclusively on Reinforcement Learning from Human Feedback (RLHF), Constitutional AI enables models to autonomously evaluate and improve their outputs against a predefined "constitution" of ethical principles [2]. Figure 1. Constitutional AI process flow. The process operates through two complementary phases: Supervised self-critique: The model generates an initial response, evaluates it against constitutional rules (such as UN human rights principles or domain-specific ethical guidelines), and revises accordingly. These self-corrections create high-quality training data for further refinement [1]. Reinforcement learning from AI feedback (RLAIF): A preference model, trained on AI-generated critiques rather than human feedback, guides the system toward constitutional compliance while maintaining performance [1]. What makes this approach particularly powerful is its integration of chain-of-thought reasoning [3]. The model doesn't just follow rules—it explains its ethical decisions in natural language, making the alignment process transparent and auditable. Research demonstrates that Constitutional AI achieves what's known as a Pareto improvement: it increases harmlessness without sacrificing helpfulness, particularly in large-scale models [1]. This represents a significant advancement over traditional safety measures that often create performance trade-offs. The data governance challenge While Constitutional AI provides the framework for ethical decision-making, implementing it at scale requires a robust data governance infrastructure [7]. Consider the challenges involved: Sensitive rule storage: Constitutional principles often contain sensitive information about organizational values, regulatory requirements, and ethical boundaries that must be protected. Audit requirements: Every AI decision needs to be traceable for compliance and debugging purposes, especially under emerging regulations like the EU AI Act and frameworks like NIST AI RMF [15]. Real-time monitoring: Systems must detect and respond to potential violations as they occur. Access control: Different stakeholders need different levels of access to governance data and decision logs. This is where MongoDB's governance-ready features become essential. The platform provides the infrastructure needed to implement Constitutional AI securely and at scale [7]. MongoDB's governance arsenal MongoDB Atlas offers several critical capabilities that align perfectly with Constitutional AI requirements, now enhanced by Voyage AI and its industry-leading embedding models joining MongoDB: Figure 2. MongoDB’s governance infrastructure Role-based access control (RBAC) Constitutional AI systems require careful access management. RBAC ensures that only authorized personnel can modify constitutional rules, while allowing AI systems the necessary read access for decision-making [7]. This principle of least privilege is fundamental to secure governance implementation. Change streams and comprehensive auditing Every modification to constitutional rules, every AI decision, and every system interaction can be captured in real-time through Change Streams [7]. This creates an immutable audit trail essential for compliance and system debugging, particularly important for meeting GDPR, HIPAA, and other regulatory requirements [12]. Enhanced vector search capabilities through Voyage AI integration Using MongoDB Atlas Vector Search together with Voyage AI's advanced embedding models, which are also preferred by Anthropic, could establish a transformative foundation for constitutional rule retrieval systems. By leveraging this integrated approach, organizations would be able to implement semantic matching capabilities that complement traditional text search methods. While exact-match text search remains highly efficient for precise keyword queries and specific rule lookups, vector search excels at understanding context and intent behind queries. Combining both approaches through MongoDB Atlas Hybrid Search, which merges the precision of text search with the semantic understanding of vector search, enables optimal accuracy in identifying potential constitutional violations, ensuring no relevant rules are missed regardless of how they're queried. [22] Advanced retrieval and domain specialization Voyage AI's sector-specific models could deliver measurable improvements in constitutional rule application. The voyage-law-2 model, which exhibits 1.7x enhanced retrieval accuracy on legal benchmarks, would be particularly valuable for constitutional frameworks within regulated industries.[19] For complex constitutional documents exceeding standard context limits, Voyage AI's newly launched voyage-context-3 offers a breakthrough solution. This contextualized chunk embedding model captures full document context automatically without manual metadata augmentation, while serving as a drop-in replacement for existing embeddings. The model demonstrates substantial performance gains across retrieval tasks: 14.24% better chunk-level and 12.56% better document-level retrieval than OpenAI-v3-large. 20.54% improvement over contextual retrieval methods on chunk-level tasks. 99.48% reduction in vector database costs while maintaining retrieval accuracy through advanced quantization [23]. This advancement is particularly critical for constitutional applications where cross-referential legal provisions require both granular detail and broader contextual understanding. The model's reduced sensitivity to chunking strategies ensures consistent interpretation of constitutional principles across varying document structures, eliminating the traditional tradeoff between focused detail and global context that has challenged constitutional document processing. The constitutional rule application process could benefit from Voyage AI's sophisticated two-stage retrieval pipeline: Semantic discovery phase: Embedding-based search would identify potentially relevant constitutional principles through semantic similarity analysis. Contextual prioritization phase: Advanced reranking models (rerank-2.5 and rerank-2.5-lite) could evaluate and prioritize constitutional rules based on situational relevance [18, 24]. Precision application: This dual-phase approach would ensure optimal ethical guideline selection for each decision context. Performance optimization and efficiency Voyage AI's quantization capabilities could deliver substantial operational benefits, potentially achieving up to 90% computational efficiency improvements while preserving constitutional evaluation accuracy. Organizations might process 5x more documents using equivalent computational resources by converting standard 32-bit floating-point embeddings to optimized int8 or binary representations, potentially reducing vector storage costs by up to 83%.[20] Additionally, Matryoshka Representation Learning could enable real-time precision-performance adjustments, allowing systems to modify embedding dimensions based on computational constraints while maintaining semantic integrity essential for accurate constitutional evaluation [21]. Contextual understanding and multi-modal governance The proposed system could demonstrate sophisticated contextual understanding—for instance, when healthcare AI systems need to evaluate patient privacy considerations, Voyage's embeddings could accurately distinguish between different privacy principles based on specific medical contexts. The voyage-3-large model's superior performance across eight evaluated domains and 100 datasets suggests strong potential for this application [17]. The voyage-multimodal-3 model's ability to process interleaved text and image data could provide essential capabilities for constitutional AI systems that must evaluate the full semantic meaning of visual content alongside textual information (e.g., charts with legends or graphics with captions). This would prove particularly valuable for content moderation or medical imaging applications requiring comprehensive ethical oversight. Architectural blueprint: Integrating Constitutional AI with MongoDB Let's examine how to structure a Constitutional AI system using MongoDB as the governance backbone. Schema design for governance Your MongoDB collections should reflect the core components of Constitutional AI: // Constitutional Rules Collection { "_id": ObjectId("..."), "rule_id": "fair_treatment_01", "category": "fairness", "description": "Ensure equitable treatment across all demographic groups", "principle": "No decision should disproportionately impact protected classes", "vector": [...], // Voyage AI embedding of the principle "threshold": { "disparity_limit": 0.05, "confidence_required": 0.95 }, "severity": "critical", "applicable_domains": ["lending", "hiring", "healthcare"], "created_by": "governance_team", "last_modified": ISODate("2024-01-15T10:30:00Z") } // Decision Logs Collection { "_id": ObjectId("..."), "decision_id": "dec_20240115_001", "model_version": "constitutional_ai_v2.1", "input_hash": "sha256_hash_of_input", "initial_response": "Initial AI response", "constitutional_review": { "rules_evaluated": ["fair_treatment_01", "privacy_protection_02"], "violations_detected": [], "reasoning": "Chain-of-thought explanation of review process" }, "final_response": "Revised response after constitutional review", "metadata": { "processing_time_ms": 245, "user_id": "encrypted_user_identifier", "session_id": "session_12345" }, "timestamp": ISODate("2024-01-15T14:22:30Z") } Figure 3. Comprehensive data schema. Real-time governance implementation The integration operates through several key components: Pre-decision screening: Before generating responses, the system queries constitutional rules relevant to the current context using MongoDB's indexing and vector search capabilities. Real-time evaluation: As the AI model generates responses, each output is evaluated against applicable constitutional principles, with results logged to MongoDB in real-time. Violation detection and response: Change Streams monitor for constitutional violations, triggering immediate alerts and remediation workflows. For example, when the system detects potential exposure of sensitive data, such as PII, API keys, or confidential business information, it immediately quarantines the response and routes it to security teams for review. The flagged content is held in a secure collection while reviewers verify whether the information can be safely shared, redacted, or must be blocked entirely, with their decisions feeding back into the constitutional ruleset for continuous improvement [9]. Real-world implementation scenario Consider a financial services company implementing Constitutional AI for loan approval recommendations: Constitutional rule implementation { "rule_id": "lending_fairness_core", "category": "algorithmic_fairness", "description": "Prevent discriminatory lending practices", "principle": "Loan recommendations must show no significant disparate impact across protected demographic groups", "implementation": { "protected_attributes": ["race", "gender", "age", "marital_status"], "disparity_threshold": 0.04, "sample_size_minimum": 1000, "statistical_test": "chi_square_independence" }, "remediation": { "immediate_action": "flag_for_review", "escalation_threshold": 3, "notification_recipients": ["compliance_team", "model_ops"] } } Figure 4. Real-world implementation - financial services. Governance workflow Input processing: Loan application data is encrypted using Queryable Encryption, enabling secure storage while maintaining the ability to query and analyze sensitive fields for governance purposes. Constitutional review: The AI model evaluates its recommendation against fairness principles. Decision logging: All AI-generated recommendations, along with human reviewer approvals or modifications, are captured in MongoDB with complete audit trails. Monitoring: Change Streams detect patterns that might indicate systematic bias, automatically flagging anomalies for human review. When bias indicators exceed thresholds, loan officers are notified to manually review affected decisions and can override AI recommendations. Reporting: Aggregation pipelines generate compliance reports for regulators. Privacy and security Throughout this process, sensitive customer data remains encrypted, constitutional rules are protected through RBAC, and all activities are audited for compliance purposes. Performance and scalability considerations Implementing Constitutional AI with MongoDB requires attention to several performance factors: Figure 5. Performance & scalability architecture. Compute overhead management Constitutional AI involves additional processing for self-critique and reasoning. MongoDB's efficient querying and indexing help minimize the data retrieval overhead [5,6]; meanwhile, you should plan for: Caching strategies: Frequently accessed constitutional rules should be cached at the application level. Batch processing: For high-volume scenarios, consider batching constitutional reviews. Async logging: Use asynchronous writes for decision logs to avoid impacting response times. Scaling considerations As your system grows, MongoDB's horizontal scaling capabilities ensure governance infrastructure can keep pace: Sharding strategy: Partition decision logs by time ranges or user segments. Read replicas: Distribute constitutional rule queries across read replicas. Geographic distribution: Use MongoDB's global clusters for multi-region compliance requirements. Current limitations and future directions Constitutional AI with MongoDB isn't without challenges: Model size dependencies Research indicates that Constitutional AI works best with larger models (70B+ parameters) [4]. Smaller models may experience helpfulness trade-offs or behavioral instabilities [1]. This suggests that organizations need to carefully evaluate their model selection and potentially reserve Constitutional AI for critical decision points. Cultural and contextual considerations Defining universal ethical principles remains complex [10]. Your constitutional rules must account for cultural differences, regulatory variations across jurisdictions, and evolving social norms. MongoDB's flexible document model makes it easier to adapt rules over time, but the governance challenge remains significant. Computational costs The dual-phase training and reasoning overhead of Constitutional AI increases compute costs [1]. However, this investment often pays dividends through reduced human oversight requirements and improved compliance outcomes. Figure 6. Security and regulatory compliance framework. Emerging trends and opportunities The field is rapidly evolving toward more sophisticated governance approaches: Public constitutional AI: Future systems may involve communities in defining constitutional principles [4,10], requiring more complex governance data models and stakeholder management capabilities. Iterative governance (IterAlign): Automated refinement of constitutional principles through red-teaming and data-driven discovery [16] will require more sophisticated versioning and change management in your governance data. Regulatory integration: As frameworks like the EU AI Act and emerging US regulations take shape [15], Constitutional AI systems will need to automatically incorporate regulatory requirements into their governance logic [12,13]. Implementation recommendations Based on our analysis, here are key recommendations for developers implementing Constitutional AI with MongoDB: Start with clear governance requirements Before writing code, establish clear answers to: What ethical principles matter most to your organization? What are your regulatory compliance requirements? How will you measure and monitor constitutional adherence? Design for auditability from day one Every aspect of your system should be designed with audit requirements in mind. Use MongoDB's Change Streams and comprehensive logging to ensure you can answer compliance questions months or years later. Implement gradual rollout Constitutional AI systems should be deployed gradually: Start with non-critical applications Monitor performance and ethical outcomes closely Gradually expand to more sensitive use cases Continuously refine constitutional rules based on real-world performance Plan for scale Design your MongoDB schema and infrastructure to handle growth: Use appropriate indexing strategies for your query patterns. Plan your sharding strategy early. Consider data retention policies for decision logs. Implement efficient archival strategies for historical governance data. Conclusion: The path forward Constitutional AI represents a significant advancement in building responsible AI systems, and MongoDB provides the robust data governance infrastructure needed to implement it effectively at scale. The key to success lies in treating governance not as an afterthought, but as a core architectural requirement from day one. Voyage AI and its industry-leading embedding models joining MongoDB marks a pivotal evolution in this space. This combination transforms ethical governance from static rule storage to dynamic semantic understanding, enabling AI systems to apply ethical principles with contextual precision across diverse operational scenarios. With Constitutional AI handling the ethical reasoning, MongoDB managing the governance data, and Voyage AI providing semantic intelligence, organizations can build AI applications that maintain both exceptional performance and unwavering ethical alignment. By managing the governance infrastructure seamlessly, this integrated approach allows developers to focus on building AI applications that truly serve human values while meeting the performance and scale requirements of modern business. The goal is AI systems that make ethical decisions with the same nuance and contextual awareness that humans bring to moral reasoning. As regulations evolve and ethical standards become more sophisticated, this architectural approach provides the flexibility and robustness needed to adapt while maintaining compliance and trust. Organizations that invest in this integrated governance infrastructure today will be best positioned to meet tomorrow's requirements and set new standards for responsible AI deployment. Ready to build ethical AI at scale? Explore how MongoDB can power your AI governance strategy—from secure data infrastructure to real-time compliance. Start building with MongoDB Atlas today . [1] Bai, Y., et al. (2022). Constitutional AI: Harmlessness from AI Feedback. arXiv preprint arXiv:2212.08073. https://arxiv.org/abs/2212.08073 [2] Anthropic. (2023). Claude's Constitution. https://www.anthropic.com/news/claudes-constitution [3] Anthropic. (2024). Chain of Thought Prompting. https://docs.anthropic.com/en/docs/build-with-claude/prompt-engineering/chain-of-thought [4] Bakker, M., et al. (2024). Public Constitutional AI. arXiv preprint arXiv:2406.16696. https://arxiv.org/abs/2406.16696 [5] MongoDB Atlas Performance Scale On-Demand. https://www.mongodb.com/cloud/atlas/performance [6] MongoDB 8.0: Improving Performance, Avoiding Regressions. https://www.mongodb.com/company/blog/mongodb-8-0-improving-performance-avoiding-regressions [7] MongoDB. (2024). Data Governance for Building Generative AI Applications with MongoDB. https://www.mongodb.com/blog/post/data-governance-building-generative-ai-applications-mongodb [8] MongoDB. (2024). Building a Unified Data Platform for Gen AI. https://www.mongodb.com/blog/post/building-unified-data-platform-for-gen-ai [9] MongoDB. (2024). Automate Regulatory Compliance with Advanced AI and MongoDB. https://www.mongodb.com/resources/solutions/use-cases/automate-regulatory-compliance-with-advanced-ai-and-mongodb [10] Sorensen, T., et al. (2024). Collective Constitutional AI: Aligning a Language Model with Public Input. Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency. https://dl.acm.org/doi/10.1145/3630106.3658979 [11] Kundu, S., et al. (2024). C3AI: Crafting and Evaluating Constitutions for Constitutional AI. arXiv preprint arXiv:2502.15861. https://arxiv.org/html/2502.15861v1 [12] MongoDB. (2024). MongoDB Takes Steps Toward Governance In The Era Of GDPR. https://www.mongodb.com/resources/products/capabilities/mongodb-takes-steps-toward-governance-in-the-era-of-gdpr [13] OECD. (2024). AI, Data Governance and Privacy. https://www.oecd.org/en/publications/ai-data-governance-and-privacy_2476b1a4-en.html [14] National Institute of Standards and Technology. (2023). AI Risk Management Framework (AI RMF 1.0). https://www.nist.gov/itl/ai-risk-management-framework [15] U.S. Congress. (2024). Federal A.I. Governance and Transparency Act of 2024. https://www.congress.gov/bill/118th-congress/house-bill/7532/text [16] arXiv. (2024). Towards Adaptive AI Governance. https://arxiv.org/pdf/2504.00652 [17] voyage-3-large: the new state-of-the-art general-purpose embedding model. https://blog.voyageai.com/2025/01/07/voyage-3-large/ [18] Rethinking Information Retrieval in MongoDB with Voyage AI. https://www.mongodb.com/company/blog/engineering/rethinking-information-retrieval-mongodb-with-voyage-ai [19] Domain-Specific Embeddings and Retrieval: Legal Edition (voyage-law-2). https://blog.voyageai.com/2024/04/15/domain-specific-embeddings-and-retrieval-legal-edition-voyage-law-2/ [20] Scaling Vector Search with MongoDB Atlas Quantization & Voyage AI Embeddings. https://www.mongodb.com/blog/post/technical/scaling-vector-search-mongodb-atlas-quantization-voyage-ai-embeddings [21] Matryoshka Embeddings: Smarter Embeddings with Voyage AI. https://www.mongodb.com/company/blog/technical/matryoshka-embeddings-smarter-embeddings-with-voyage-ai [22] Embeddings - Anthropic. https://docs.anthropic.com/en/docs/build-with-claude/embeddings#why-do-voyage-embeddings-have-superior-quality [23] Introducing voyage-context-3: focused chunk-level details with global document context. https://blog.voyageai.com/2025/07/23/voyage-context-3/ [24] Rerankers. https://docs.voyageai.com/docs/reranker

August 19, 2025

Next →

That’s a Wrap: MongoDB’s 2025 in Review & 2026 Predictions

It’s nearly the end of the year—again! That means it’s time for an end-of-year blog post that expresses disbelief at the passage of time. Which, as the saying goes, flies when you’re having fun. And definitely when you’re as busy as MongoDB was in 2025. It was a big year for the company—and more importantly, for the tens of thousands of customers and millions of developers who rely on MongoDB’s modern data platform for their most mission-critical workloads. At MongoDB, everything we do starts with our obsession with customers and their needs, and if there’s a theme to MongoDB’s 2025, it was (and will continue to be) enabling customer innovation and helping them succeed in the AI era. So here are a few highlights of how MongoDB acted on behalf of customers in 2025. From the acquisition of Voyage AI to customer success across industries, a lot happened in 2025. Let’s go!* *Read to the end for 2026 thoughts. 2025: The (MongoDB) year that was Voyage AI, modernization, and search In February, MongoDB announced the acquisition of Voyage AI, a pioneer in embedding and reranking models, to enhance the accuracy of AI applications. Integrating Voyage AI's advanced retrieval technology with MongoDB’s modern, AI-ready data platform addresses a critical challenge: LLM model hallucinations caused by a lack of context. By improving retrieval accuracy for specialized domains like finance and law, the integration enables businesses to deploy AI for mission-critical use cases. To learn more, see the MongoDB Voyage AI page. Then, in September, we launched MongoDB AMP, an AI-powered Application Modernization Platform. AMP is designed to accelerate the transformation of legacy applications through a combination of AI-powered tooling, a proven delivery framework, and expert guidance (tools, techniques, and talent) to help enterprises reduce technical debt and modernize 2-3 times faster. Want more? Sure you do! Check out this short video. MongoDB also announced the addition of search and vector search capabilities to MongoDB Community Edition and MongoDB Enterprise Server. This allows developers to build and test AI-native applications, including those using retrieval-augmented generation (RAG), in local or on-premises environments. Previously exclusive to MongoDB Atlas, these features enable secure, hybrid deployments where sensitive data can remain on-premises while still leveraging advanced search tools. Here’s a (slightly less short) video about search and vector search on Enterprise Server. Growing and scaling with MongoDB As noted, everything we do at MongoDB starts with our obsession with customers. 2025 was another banner year for customer success and innovation—we were inspired by what organizations of every shape and size, across industries and geographies, built with MongoDB in 2025. Here are just two of the many stories our customers shared in 2025; much more can be found in my colleague Katie Palmer’s blog series, Innovating with MongoDB. Factory By combining the Atlas modern data platform with Voyage AI’s high-performance embeddings, the AI-native startup Factory—which uses AI agents called Droids to accelerate software development lifecycles for organizations—consolidated its fragmented tech stack. This enabled superior code retrieval, simplified operations, and provided the scalability needed to process billions of tokens daily. McKesson McKesson, a global pharmaceutical distributor, replaced its monolithic legacy infrastructure with MongoDB Atlas to meet strict drug tracing mandates. By adopting our modern cloud data platform, McKesson scaled its operations 300x, managing tracking data for 1.2 billion containers annually without latency, and ensuring compliance and patient safety while reducing developer complexity. For more, check out the video of McKesson at MongoDB.local NYC from September. From niche NoSQL to enterprise powerhouse As senior MongoDB engineer and Technical Fellow Ashish Kumar put it earlier this year, “through a sustained and deliberate engineering effort,” MongoDB has gone from a (seemingly) niche NoSQL solution to a trusted enterprise standard, and now delivers “the high availability, tunable consistency, ACID transactions, and robust security that enterprises demand.” A new era of leadership The face of MongoDB has also changed—our CFO, Mike Berry, joined the company in April, and Dev Ittycheria stepped down as CEO in November, after more than 11 years leading the company (including its 2017 IPO). In a LinkedIn post about his role, new MongoDB CEO CJ Desai noted that the company is “at the forefront of a new data revolution, unlocking the next wave of productivity and intelligence.” “Having spent my career building and scaling technology platforms, I’ve always been drawn to companies defined by clarity of vision, relentless organic innovation, and a customer-first culture. MongoDB exemplifies all three,” said Desai. We couldn’t agree more. Onward! Reading the 2026 tea leaves So what might 2026 bring (for MongoDB and tech at large)? Here are a handful of our leaders’ predictions: “As much as people want to talk about Artificial General Intelligence (AGI), we’re still in the phase where most AI use cases automate redundant tasks but benefit from human-in-the-loop checks. Organizations that use AI to complete work that historically is a drain on human resources—but then uses people to carefully verify what AI builds, apply governance frameworks, and maintain accountability across the data lifecycle—will be more successful.” —Pete Johnson, Field CTO, AI, MongoDB “After years of inflated expectations and unsustainable spending, the AI industry is trapped in a bubble where companies reflexively attempt to deploy LLMs at every problem, driving up costs with minimal to no return. Businesses that break free from this spending cycle are the ones that understand the need to ground LLM responses in factual data and learn from prior mistakes. We believe the best way to do this will be with highly accurate embedding models and rerankers for reliable data retrieval.” —Frank Liu, Staff Product Manager, MongoDB "In 2026, cloud independence will evolve from strategic preference to existential imperative across enterprises of every scale. The outages and disruptions of recent years have exposed a fundamental truth: in an always-on digital economy—where commerce, mobility, governance, and even public safety depend on uninterrupted access to cloud services—single-provider reliance is no longer a calculated risk, but a systemic vulnerability. Compounding this is the inexorable rise of data sovereignty. Regulatory regimes worldwide now demand precise jurisdictional control over data residency, rendering rigid cloud commitments incompatible with compliance at global scale. The defining competitive advantage will belong to organizations that transcend fragile prevention theater and engineer true infrastructural resilience: architectures inherently portable, data frictionlessly mobile, and operations autonomously sustained across heterogeneous clouds through AI-orchestrated redundancy. In short, the winners will not merely mitigate downtime—they will design systems that render the concept obsolete." —Ben Cefalo, SVP, Head of Core Products, MongoDB Happy holidays and happy New Year, everyone!

December 22, 2025