Why Multi-Agent Systems Need Memory Engineering

Mikiko Bazeley
September 11, 2025

Most multi-agent AI systems fail not because agents can't communicate, but because they can't remember. Production deployments have shown agents tend to duplicate work, operate on inconsistent states, and burn through token budgets, re-explaining context to each other—problems that scale exponentially as you add more agents.

A breakthrough came from context engineering for individual agents: "The right information at the right time" transforms agent effectiveness. But this principle breaks down catastrophically when multiple agents must coordinate without shared memory infrastructure.

To understand multi-agent coordination, we must first establish our foundation: Agent memory (and memory management) is a computational exocortex for AI agents—a dynamic, systematic process that integrates an agent’s LLM memory (context window and parametric weights) with a persistent memory management system to encode, store, retrieve, and synthesize experiences. Within this system, information is stored as memory units (also called memory blocks)—the smallest discrete, actionable pieces of memory that pair content with rich metadata such as timestamps, strength/confidence, associative links, semantic context, and retrieval hints.

Agent memory is the key term, but the actual discipline that is the focal point of this piece is memory engineering. Memory engineering is the missing architectural foundation for multi-agent systems. Just as databases transformed software from single-user programs to multi-user applications, shared persistent memory systems enable AI to evolve from single-agent tools to coordinated teams capable of tackling enterprise-scale problems. The path from individual intelligence to collective intelligence runs through memory.

Figure 1. The relationship between memory engineering and context engineering.

This diagram is titled Memory engineering and context engineering. The description says memory engineering and context engineering work hand-in-hand, where memory engineering builds the persistent, intelligent storage systems that context engineering then leverages to dynamically curate the most relevant information for each AI decision. The diagram below has memory on the left and the LLM on the right, with different connectors between the two.

The memory crisis in multi-agent systems

Enterprise AI agents face a fundamental architectural mismatch. Without the proper data-to-memory transformation pipeline (aggregation → encoding → storage → organization → retrieval), they're built on stateless large language models but must operate in stateful environments that demand continuity, learning, and coordination. This creates cascading failure modes that become exponentially worse when agents work together.

Individual agent memory failures

Every production agent battles four core memory problems.

Context poisoning occurs when hallucinations contaminate future reasoning, creating a feedback loop of increasingly inaccurate responses.

Context distraction happens when too much information overwhelms the agent's decision-making process, leading to suboptimal choices.

Context confusion emerges when irrelevant information influences responses, while context clash creates inconsistencies when conflicting information exists within the same context window.

Figure 2. How context degrades over time.

This image is titled four methods for ruining context quality. There are four boxes to this diagram. The first is titled context poisoning, with a description of when a hallucination makes it into the context. The next box is titled context distraction, when the context overwhelms the training. The third box is titled context confusion, when superfluous context influences the response. The final box is titled context clash, when parts of the context disagree.

Recent research from Chroma reveals an additional critical issue: context rot—the systematic degradation of LLM performance as input length increases, even on trivially simple tasks. Their evaluation of 18 leading models, including GPT-4.1, Claude 4, and Gemini 2.5, demonstrates that performance degrades non-uniformly across context lengths, with models showing decreased accuracy on tasks as basic as text replication when processing longer inputs. This phenomenon is particularly pronounced when needle-question similarity decreases—as needle-question similarity decreases, model performance degrades more significantly with increasing input length—and when distractors are present, creating cascading failures in multi-agent environments where context pollution spreads between agents.

These problems are expensive. According to Manus AI's production data, agents solving complex tasks average 50 tool calls per task with 100:1 input-to-output token ratios, far exceeding simple chatbot interactions. With context tokens costing $0.30-$3.00 per million tokens across major LLM providers, inefficient memory management becomes prohibitively expensive at scale.

Multi-agent coordination failures

When multiple agents operate without proper memory coordination, individual failures become systemic problems. Research by Cemri et al. analyzing over 200 execution traces from popular multi-agent frameworks found failure rates ranging from 40% to over 80%, with 36.9% of failures attributed to inter-agent misalignment issues (Cemri et al., "Why Do Multi-Agent LLM Systems Fail?").

Figure 3. Challenges encountered in multi-agent systems, categorized by type.

Image titled Agentic issues in-action. This image shows a variety of failure types plotted along a table.

Work duplication occurs constantly—agents repeat tasks without knowing others have already completed them.

An inconsistent state means different agents operate on different versions of reality, leading to conflicting decisions and recommendations.

Communication overhead skyrockets as agents must constantly re-explain context and previous decisions to each other. Most critically, cascade failures spread one agent's context pollution to others through shared interactions.

The coordination chaos this creates is exactly what Anthropic's team encountered in their early development: "Early agents made errors like spawning 50 subagents for simple queries, scouring the web endlessly for nonexistent sources, and distracting each other with excessive updates" (Anthropic, Hadfield et al., 2025).

These coordination failures aren't just inefficient—they're architecturally inevitable without proper memory engineering. This is particularly evident in Deep Research Mode (one of three core application modes alongside Assistant and Workflow modes), where Anthropic found that multi-agent systems excel when properly architected with shared memory infrastructure.

Memory as the foundation for multi-agent coordination

Understanding multi-agent coordination requires understanding how individual agents manage memory. Like humans, every agent operates within a memory hierarchy—from immediate working memory to long-term stored knowledge to shared cultural understanding:

Figure 4. Memory types specific to multi-agent systems.

This diagram is titled Multi-agent memory: Persona, consensus, whiteboard. The top left box of this diagram is titled short-term memory and contains core identity & context, and context buffer. The top right box is titled long-term memory and contains skills, episodic, semantic, and raw data. The bottom box is titled multi-agent memory and contains short-term external memory and long-term memory.

The context window challenge

A context window represents an agent's active working memory—everything it can "see" and reason about simultaneously. This includes system prompts, tool schemas, memory units, recent conversations, files, and tool responses. Even large models with 128K token limits can be exceeded by complex agent tasks, creating performance bottlenecks and cost explosions.

Context engineering emerged as the solution for individual agents: managing what information enters the context window and how it's organized. The goal is getting "the right information at the right time" to maximize agent effectiveness while minimizing costs.

Figure 5. Conversations aren’t free—the context window can become a junkyard of prompts, outputs, tool calls, and metadata, failed attempts, and irrelevant information.

Image titled how context quality becomes a problem. The left side of the diagram is the LLM, which contains a few items. A few iterations later to the right is an LLM that contains far more items.

This represents the natural evolution from prompt engineering to context engineering, and now to memory engineering and memory management—the operational practice of orchestrating, optimizing, and governing an agent's memory architecture. While single-agent memory management remains an active area of research, techniques like retrieval-augmented generation (RAG) for semantic knowledge access, hierarchical summarization for compressing conversation history, dynamic context pruning for removing irrelevant information, and external memory systems like MemGPT have dramatically improved individual agent reliability and effectiveness over the past few years.

The multi-agent memory challenge

Research insight: Studies show that memory management in multi-agent systems must handle "complex context data and sophisticated interaction and history information," requiring advanced memory design. These systems need both individual agent memory capabilities and sophisticated mechanisms for "sharing, integrating, and managing information across the different agents" (LLM Multi-Agent Systems: Challenges and Open Problems, Section 4).

Most memory engineering techniques developed to date focus on optimizing individual agents, not multi-agent systems. But coordination requires fundamentally new memory structures and patterns that single-agent systems never needed. Multi-agent systems demand innovations like consensus memory (a specialized form of procedural memory) for verified team procedures, persona libraries (extensions of semantic memory's persona memory) for role-based coordination, and whiteboard methods (implementations of shared memory configured for short-term collaboration)—structures that emerge only when agents work together.

Figure 6. How persona, consensus, and whiteboard memory work together.

This diagram is titled multi-agent memory: Persona, consensus, whiteboard. Starting on the left, short-term internal memory receives information from the long-term memory knowledge retrieval & storage, as well as long-term memory agent team alignment. It also interacts with the multi-agent memories short-term external memory.

More importantly, multi-agent systems create opportunities to invest in collective memory that improves both current performance and future agent capabilities. Shared external memory enables several critical capabilities:

Persistent state across agent interactions ensures continuity when agents hand off tasks or collaborate on long-running projects.

Atomic operations provide consistent updates when multiple agents need to modify a shared state simultaneously.

Conflict resolution handles situations where agents attempt contradictory updates to the same information.

Performance optimization through caching and indexing reduces redundant operations across agent teams.

Context windows become shared resources requiring careful management.

Core memory alignment ensures agents share essential state and objectives.

Selective context sharing propagates relevant information between agents without overwhelming their individual context windows.

Memory block coordination provides synchronized access to shared memory blocks.

Cost optimization maximizes KV-cache hit rates across agent interactions, reducing the exponential cost growth that kills multi-agent deployments.

The economic impact of smart context management in multi-agent systems isn’t trivial, as Anthropic found in their multi-agent deep research deployment: "In our data, agents typically use about 4× more tokens than chat interactions, and multi-agent systems use about 15× more tokens than chats" (Anthropic, Hadfield et al., 2025).

Yet when coordination and memory work together, the results are remarkable. Anthropic's research system provides compelling evidence: "We found that a multi-agent system with Claude Opus 4 as the lead agent and Claude Sonnet 4 subagents outperformed single-agent Claude Opus 4 by 90.2% on our internal research eval" (Anthropic, Hadfield et al., 2025). This dramatic improvement shows the multiplicative potential of well-coordinated agent teams.

The 5 pillars of multi-agent memory engineering

Successful multi-agent memory engineering requires architectural foundations that extend beyond single-agent patterns. These five pillars provide the complete framework for scalable multi-agent systems.

Figure 7. Five pillars of engineering memory for multi-agent systems.

Image titled five pillars: engineering memory. The first pillar is persistence (write context), which is store information outside the context window. The next pillar is optimization (compress context), maximize information density within token constraints. The third is retrieval (select context), intelligently choose what enters the context window. The fourth is separation (isolate context), separate contexts to prevent interference and optimize performance. The final pillar is resolution (sync context), ensure multiple agents maintain consistent shared context during concurrent updates.

1. Persistence architecture (storage and state management)

Multi-agent systems need sophisticated storage patterns that enable coordinated state management across agent teams.

Memory units structured as YAML or JSON documents in systems like MongoDB provide the foundation for complex multi-agent state management. These memory units—structured containers with metadata and relationships—can be configured as either short-term or long-term shared memory depending on use case requirements.

Shared Todo.md patterns extend the proven individual agent pattern of constantly updated objectives to team-level coordination. A shared objective tracking system ensures all agents work toward aligned goals while maintaining visibility into team progress.

Cross-agent episodic memory captures interaction history and decision patterns between agents. This enables agents to learn from past coordination successes and failures, improving future collaboration effectiveness.

Procedural memory evolution stores workflows and coordination protocols that improve over time. As agent teams encounter new scenarios, they can update shared procedures, creating institutional memory that benefits the entire system.

2. Retrieval intelligence (selection and querying)

Retrieving the right information at the right time becomes exponentially more complex with multiple agents accessing shared memory concurrently.

Embedding-based retrieval uses vector similarity to find relevant cross-agent memory, but must account for agent-specific contexts and capabilities. A customer service agent and a technical support agent need different information about the same customer issue.

Agent-aware querying tailors memory selection based on individual agent capabilities and roles. The system understands which agents can act on specific types of information and prioritizes accordingly.

Temporal coordination manages time-sensitive information sharing. When one agent discovers urgent information, the memory system must propagate this to relevant agents quickly while avoiding information overload for agents working on unrelated tasks.

Resource orchestration coordinates access across multiple knowledge bases, APIs, and external systems. Rather than each agent independently querying resources, the memory system can optimize queries and cache results for team benefit.

3. Performance optimization (compression and caching)

Optimization becomes critical when context costs multiply across agent teams.

Hierarchical summarization compresses inter-agent communication efficiently. Rather than storing complete conversation transcripts between agents, the system can create layered summaries that preserve essential information while reducing storage and retrieval costs.

Selective preservation maintains restoration paths for complex coordination scenarios. Even when compressing information, the system preserves references to original sources, enabling agents to access full context when needed.

Intelligent eviction implements memory lifecycle management through forgetting (gradual strength degradation) rather than deleting, removing redundant information while preserving the coordination state. The system reduces memory strength attributes of outdated memory units while maintaining their structure for potential reactivation.

Cross-agent cache optimization implements shared KV-cache strategies that benefit the entire agent team. When one agent processes information, the results can be cached for other agents with similar contexts, dramatically reducing costs.

4. Coordination boundaries (isolation and access control)

Effective boundaries prevent context pollution while enabling necessary coordination.

Agent specialization creates domain-specific memory isolation. A financial analysis agent and a marketing agent can share high-level project information while maintaining separate specialized knowledge bases.

Memory management agents handle cross-team memory operations as a dedicated responsibility. Rather than every agent managing memory independently, specialized agents can optimize memory operations for the entire team.

Workflow orchestration coordinates context across specialized agent teams. The system understands how information flows between different agent roles and can manage context propagation accordingly.

Session boundaries isolate memory by project, user, or task domain. This prevents information leakage between unrelated workstreams while enabling rich context within specific collaboration boundaries.

5. Conflict resolution (handling simultaneous updates)

Multi-agent systems must gracefully handle situations where agents attempt contradictory or simultaneous updates to shared memory.

Atomic operations ensure that critical memory operations to update memory units happen entirely or not at all. When multiple agents need to perform memory operations on shared memory units simultaneously, atomic operations prevent partial updates that could leave the system in an inconsistent state.

Version control patterns track changes to shared memory over time, enabling agents to understand how information evolved and resolve conflicts based on temporal precedence or agent authority levels.

Consensus mechanisms handle situations where agents have conflicting information about the same topic. The system must determine which information is authoritative and how to propagate corrections to agents operating on outdated knowledge.

Priority-based resolution resolves conflicts based on agent roles, information recency, or confidence levels. A specialized technical agent's assessment might override a general-purpose agent's conclusion about a technical issue.

Rollback and recovery enable the system to revert problematic changes when conflicts create an inconsistent state. If a memory update causes downstream coordination failures, the system can roll back to a known-good state and retry with better conflict resolution.

Figure 8. Analogies for memory engineering.

This image is titled memory analogies. The first box is titled persistence (write context), and creates memories. The second box is titled optimization (compress context), selectively remembers memories. The third is retrieval (select context), gets memories. The fourth is separation (isolate context), cleanly organizes streams of thoughts. The final is resolution (sync context), synchronization shared memory updates.

Measuring multi-agent memory success

Successful multi-agent memory engineering requires understanding what good coordination looks like and how it relates to the five architectural pillars. Rather than focusing on individual agent performance, we must measure emergent behaviors that only appear when agents work together effectively.

What success looks like

Seamless coordination makes multi-agent systems reliable (through consistent access to accurate historical context), believable (through trustworthy inter-agent interactions), and capable (through leveraging accumulated collective knowledge)—the RBC framework that defines successful agent memory implementation.

Collective intelligence emerges when agent teams consistently outperform individual agents on complex tasks. The shared memory system enables capabilities that no single agent could achieve alone.

Cost-effective scaling occurs when adding agents to a team reduces per-task costs rather than multiplying them. Effective memory sharing prevents the exponential cost growth that kills most multi-agent deployments.

Resilient operations maintain continuity when individual agents fail or new agents join the team. The shared memory system preserves institutional knowledge and enables smooth transitions.

Adaptive learning allows agent teams to improve over time through accumulated experience. Shared memory becomes an investment that benefits current and future agent deployments.

Figure 9. High-level signals for measuring memory engineering quality.

This image is titled signals of memory-engineered multi-agents. The first box is titled persistence (persistent architecture success), consistent state across agents & zero data loss. The second box is optimization (performance optimization success), sub-linear cost scaling - effective caching & compression should decrease per-agent costs. The third box is retrieval (retrieval intelligence success), effective & fast retrieval for each agent. The fourth box is separation (coordination boundaries success), agents maintain specialization while still contributing to overall task. The final box is resolution (sync context), synchronized, shared memory updates.

Persistence architecture success shows up as a consistent state across all agents and zero data loss during coordination. You measure this through state synchronization rates and the absence of coordination conflicts caused by inconsistent storage.

Retrieval intelligence success manifests as agents quickly finding exactly the information they need without information overload. Effective retrieval means agents spend time acting rather than searching.

Performance optimization success appears as sub-linear cost scaling as you add agents and tasks. Well-optimized systems show decreasing per-agent costs through effective caching and compression.

Coordination boundaries success prevents context pollution while enabling necessary information sharing. You see this in specialized agents maintaining their expertise while contributing to team objectives.

Conflict resolution success handles simultaneous updates gracefully and maintains system consistency even when agents discover contradictory information. The system should rarely require manual intervention to resolve conflicts.

The ultimate measure

The true test of multi-agent memory engineering is whether agent teams tackle problems that individual agents cannot solve, at costs that make deployment viable. Success means transitioning from "agents helping humans" to "agent teams solving problems independently"—the difference between tools and teammates.

Organizations implementing sophisticated memory engineering will achieve 3× decision speed improvement and 30% operational cost reduction by 2029 (Gartner prediction), demonstrating the strategic value of proper memory architecture.

The path forward

Memory engineering represents the missing infrastructure layer for production multi-agent systems. Just as relational databases enabled the transition from single-user desktop applications to multi-user web applications, shared memory systems enable the transition from single-agent tools to multi-agent intelligence.

The companies succeeding with AI agents today have figured out memory architecture, not just prompt engineering. They understand that agent coordination requires the same foundational infrastructure thinking that built the modern web: persistent state, atomic operations, conflict resolution, and performance optimization.

Research validates this approach: Enterprises implementing proper memory engineering achieve 18% ROI above cost-of-capital thresholds (IBM Institute for Business Value), while systems without memory architecture continue to struggle with the coordination failures that plague 40-80% of multi-agent deployments.

Key takeaways

Multi-agent systems fail because of memory problems, not communication problems. The real issue isn't that agents can't talk to each other—it's that they can't remember and coordinate state effectively.

Memory engineering is the key differentiator for production multi-agent systems. Companies succeeding with AI agents have figured out memory architecture, not just prompt engineering.

Individual agent memory patterns extend naturally to multi-agent coordination. The same principles (write, select, compress, isolate) that solve single-agent context problems solve multi-agent coordination problems.

Shared external memory is essential infrastructure. Just like databases enabled web applications to scale, shared memory systems enable AI agents to coordinate at enterprise scale.

To explore memory engineering further, start experimenting with memory architectures using MongoDB Atlas or review our detailed tutorials available at AI Learning Hub.

← Previous

MongoDB Atlas Now Available in the Vercel Marketplace

We are pleased to announce that MongoDB Atlas is now available in the Vercel Marketplace . It’s now easier than ever to leverage MongoDB’s flexible document model, distributed architecture, and versatile built-in search capabilities from within the Vercel ecosystem. The combination of Vercel, the company that provides the tools and infrastructure for developers to build on the AI Cloud, and MongoDB, the world’s leading modern database, creates a supercharged offering that uniquely enables developers to rapidly build, scale, and adapt AI applications. In the words of Tom Occhino, Chief Product Officer, Vercel, "We’re excited to partner with MongoDB to bring Atlas into the Vercel Marketplace. By combining MongoDB’s flexible data platform with Vercel’s focus on developer experience, we’re giving our joint community a faster path to build and scale intelligent applications on the AI Cloud.” Andrew Davidson, SVP Products, MongoDB, added, "Vercel powers many of the best experiences on the web, with an exceptional focus on developer experience from open source to their AI Cloud. We are thrilled to be launching onto the Vercel Marketplace, supercharging our joint community with the power of MongoDB's flexible document model with integrated search and vector search." The Vercel Marketplace is the best way for developers hosting web applications on Vercel to manage third-party dependencies. It’s easy to use—with a few clicks, developers can integrate tools for analytics, authentication, logging, testing, and more. Now that MongoDB is part of the Marketplace, you can simply follow the intuitive deployment process and let MongoDB Atlas persist data for web applications built on Vercel. Vercel: Build and deploy on the AI Cloud AI-powered applications and tools have fundamentally changed the landscape of web development. Developers are increasingly tasked with building applications that can process unstructured data, adapt to changing requirements, and scale dynamically—all while maintaining the speed and reliability users expect. Traditional relational databases and hosting solutions can fall short in this new paradigm, creating friction that slows development and limits innovation. Vercel has carved for itself a key place in this new landscape. Originally known for creating and maintaining Next.js , one of the most popular frameworks for building web applications, Vercel has built on that early success to evolve far beyond its frontend origins. The Vercel Marketplace serves as a central hub where developers can discover and manage their third-party services. v0 by Vercel lets developers turn their ideas into interactive web apps, fast, with AI that generates production-grade code from natural language. And Vercel’s AI SDK provides a free, open-source library that gives developers the tools they need to build AI-powered products. The whole ecosystem is incredibly powerful. Anything you create with v0 can be deployed to Vercel. The Marketplace creates a frictionless experience for integrating disparate tools and services, including MongoDB Atlas, without leaving the Vercel ecosystem, further simplifying deployments. Clearly, Vercel’s scope has grown to be a one-stop shop for the creation and hosting of web applications, an “AI Cloud.” MongoDB enhances the Vercel experience MongoDB and Vercel share a commitment to the developer experience, freeing up developer time to focus on developing rather than getting bogged down with infrastructure concerns. MongoDB, with its flexible data model, distributed architecture, and versatile search functionality, acts as a natural complement to Vercel, a classic case of the whole being greater than the sum of its parts. MongoDB’s document model allows developers, both human and agentic, to model their domain intuitively and work with both structured and unstructured data, allowing for fast iteration and powerful abstractions. Via sharding and replica sets , MongoDB scales up to meet developer needs, offering an easy-to-use, performant, and scalable developer experience at the data layer to accompany Vercel's scalability at the application layer. MongoDB offers a myriad of ways to search your data (vector, semantic, even hybrid), meeting the requirements for virtually any AI use case. The Marketplace integration means developers can provision a MongoDB Atlas database directly from their Vercel dashboard, configure connections, and start building—all without context switching between different platforms or dealing with complex setup procedures and fractured billing. It’s never been easier to use MongoDB Atlas with Vercel. Looking ahead This integration marks a key milestone in the deepening of the partnership between MongoDB and Vercel. As both companies continue to grow in the AI space, developers can expect even more powerful tools and capabilities from the successful partnership. The combination of MongoDB Atlas and Vercel provides a strong foundation for developers who want to build the next generation of web and AI applications, simply and scalably. Get started today and experience how MongoDB Atlas and Vercel can supercharge your application development workflow. Interested in trying Vercel and MongoDB together? Take a look at our documentation Install the integration from the Vercel Marketplace directly

September 10, 2025

Next →

Cars24 Improves Search For 300 Million Users With MongoDB Atlas

The Indian multinational online car marketplace Cars24 serves 300 million users globally. The company offers services that span sales, insurance, maintenance, financing, and more, reshaping the entire car ownership journey. Speaking at MongoDB .local Bengaluru in July 2025 , Pradeep Sharma, Head of Technology at Cars24, shared how MongoDB has been a key driver of Car24’s digital transformation journey. Specifically, he highlighted two recent use cases that show how MongoDB Atlas has helped Cars24 scale, improve its search capabilities, and reduce its architectural complexity. Matching the growing scale with simplified and expanded search Cars24 has operations in multiple countries, and a diverse customer base. Over the years, the company has used customer data, behavior analytics, and operational workflows to build, evolving from being a platform for buying and selling cars, to an end-to-end ecosystem, supported by a hub of interconnected systems. At the start of its journey, Cars24 relied on legacy databases for managing and searching data, such as Postgres. Their relational database set-up would store information, synchronize the data to a separate “bolt-on” search engine (such as Elasticsearch), manually indexing it, and then querying the index. While initially effective for a small application ecosystem, these processes became bottlenecked as the organization’s services grew. Multiple engineering teams piped data into a single search index, which often resulted in synchronization challenges and overwhelming administrative overhead. Cars24 faced three core limitations with this setup: Lower developer productivity: Exponential effort was spent maintaining pipelines and synchronizing procedures. Developers had little bandwidth for building business features or innovation. Architectural complexity: Ensuring data sync consistency required multiple pipelines and race logic. This led to inefficiencies in real-time dashboard updates for agents. Operational overhead: Maintaining separate systems for database and search—alongside provisioning, patching, scaling, and monitoring—strained resources. Seeking an integrated approach, Cars24 embraced MongoDB Atlas, hosted on Google Cloud . MongoDB Atlas would serve as a single, consistent, modern database and embedded search solution, powered by Apache Lucene. MongoDB Atlas Search also enabled Cars24 to run queries directly in the database. This eliminated the need to synchronise data between systems while delivering real-time results. This unified approach allowed the company’s developers to transition from managing complex synchronization mechanisms to building applications. Furthermore, the reduced administrative overhead enabled Cars24 to consolidate the team’s efforts, and to streamline query execution across the ecosystem. Thanks to MongoDB Atlas and MongoDB Atlas Search, Cars24 was able to: Avoid "synchronization tax”: Switching to MongoDB Atlas eliminated the need for data synchronization and the additional tooling this mandated. Real-time searches can be performed from a single interface and workflow. Deliver new search features faster: By using a single, unified API across database and search operations, new features can be delivered rapidly. Work with a fully managed platform: With MongoDB Atlas, Cars24’s engineers can focus more on application development and building products, rather than thinking about managing indexes, syncing, and more. Following this successful migration, Cars24 decided to also use MongoDB Atlas to replace one of its legacy databases, ArangoDB. The switch to MongoDB Atlas eliminated major roadblocks for other critical search capabilities. From ArangoDB to MongoDB: Streamlined operations and 50% cost savings As Cars24 scaled new services globally, it encountered limitations with its geospatial search solution, which was based on ArangoDB. This included performance bottlenecks, weak transactions as it was difficult to guarantee consistent data operations, and a limited ecosystem which meant that scaling developer onboarding and troubleshooting became increasingly onerous. Moving to MongoDB Atlas enabled Cars24 to transition its geospatial services, consolidating its data storage and search capabilities under a single, versatile platform. “We now have a highly available architecture, and an amazing team at MongoDB that has our back,” said Sharma. MongoDB offered a proven architecture for high availability, scalability, and real-world production readiness: Enhanced scalability: MongoDB’s ability to scale massive workloads supports Cars24’s growing global presence. Reliable transactions: MongoDB provides robust multi-document ACID transactions across shards, meeting mission-critical needs. Streamlined operations: MongoDB offers a single platform that is not limited to a database only. By consolidating its geospatial search workload under MongoDB, Cars24 has reduced maintenance and operational overhead. Not only did Cars24 cut costs in half by moving to MongoDB, but the widespread market adoption of MongoDB Atlas also means that Cars24 can continue to rapidly onboard developers familiar with MongoDB, a recruiting priority for Cars24’s growing development team. “To give you an idea, one of our business units had a developer team of less than 10 about a year ago. Now they are a triple-digit team,” said Sharma. “If we are going to keep introducing new developers, for a product coming up or scaling up, it becomes very important to focus on the community skills and support provided by our technology partner.” “Now that we have moved from ArangoDB to MongoDB Atlas, our developers are the happiest,” he added. Cars24 is now looking to consolidate even more of its application and data workflows under MongoDB Atlas. With the growing number of developers joining Cars24’s engineering teams, plans are to utilize MongoDB Atlas further to enhance productivity, scalability, and data-driven insights. Visit the MongoDB Atlas Learning Hub to learn more about Atlas. To learn more about MongoDB Atlas Search, visit our product page .

October 12, 2025