Présentation d'Atlas Stream Processing - Simplifier le chemin vers des applications réactives et event-driven

MongoDB
September 14, 2023 | Updated: September 15, 2023

Aujourd'hui, nous sommes heureux d'annoncer l'avant-première privée (Private Preview) d'Atlas Stream Processing !

Le monde évolue de plus en plus vite et vos applications doivent pouvoir tenir le rythme. Les applications réactives et event-driven donnent vie à des expériences numériques pour vos clients et accélèrent le temps de compréhension et d'action pour l'entreprise. Pensez à :

avertir vos utilisateurs dès que l'état de leur livraison change,
bloquer les transactions frauduleuses lors du traitement des paiements
analyser la télémétrie des capteurs au fur et à mesure qu'elle est générée afin de détecter les défaillances potentielles de l'équipement et d'y remédier avant les pannes coûteuses.

Dans chacun de ces exemples, les données perdent de leur valeur au fil des secondes. Elles doivent être requêtées et faire face à une action en continu et ce avec une faible latence. Pour ce faire, les développeurs se tournent de plus en plus vers des applications event-driven alimentées par des données en continu, de manière à ce qu'elles puissent instantanément réagir et répondre à l'évolution constante du monde qui les entoure. Atlas Stream Processing aidera les développeurs à passer plus rapidement à des applications event-driven.

Au fil des ans, les développeurs ont adopté la base de données MongoDB parce qu'ils apprécient la flexibilité et la facilité d'utilisation de document model, ainsi que l'API de requête MongoDB qui leur permet de travailler avec des données en tant que code. Ces principes fondamentaux éliminent radicalement les frictions liées au développement de logiciels et d'applications. Aujourd'hui, nous appliquons ces mêmes principes aux données en continu. Atlas Stream Processing redéfinit l'expérience des développeurs qui travaillent avec des streams complexes de données à haute vitesse et à évolution rapide, et unifie la façon dont les développeurs travaillent avec des données en mouvement et au repos.

Bien que les produits et technologies existants aient apporté de nombreuses innovations en matière de flux et de traitement des flux, nous pensons que MongoDB est naturellement bien adapté pour aider les développeurs à relever certains défis clés restants. Ces défis comprennent la difficulté de travailler avec des données variables, à fort volume et à grande vitesse, les frais généraux contextuels liés à l'apprentissage de nouveaux outils, langages et API, ainsi que la maintenance opérationnelle supplémentaire et la fragmentation qui peuvent être introduites par des technologies ponctuelles dans des piles d'applications complexes.

Présentation de l'Atlas Stream Processing

Atlas Stream Processing permet de traiter des flux de données complexes à grande vitesse avec quelques avantages uniques pour le développeur :

Atlas Stream Processing est basé sur le modèle documentaire, ce qui lui confère une grande souplesse dans le traitement des structures de données imbriquées et complexes qui sont courantes dans les streams d'événements. Cela évite les étapes de prétraitement et permet aux développeurs de travailler de façon intuitive et facile avec des données aux structures complexes. Tout comme la base de données le permet.
Cela unifie l'expérience de travail à travers tout type de données, en offrant une plateforme unique - à travers l'API, le langage de requête et les outils - pour traiter des données de streams riches et complexes, parallèlement aux données d'application critiques de votre base de données.
Atlas Stream Processing est entièrement managé dans MongoDB Atlas, en s'appuyant sur un définisseur de services intégrés déjà robuste. Quelques appels d'API et quelques lignes de code suffisent pour mettre en place un processeur de flux, une base de données et une couche de service d'API sur l'ensemble des principaux fournisseurs cloud.
Atlas Stream Processing est entièrement managé dans MongoDB Atlas, en s'appuyant sur un définisseur de services intégrés déjà robuste. Quelques appels d'API et quelques lignes de code suffisent pour mettre en place un processeur de flux, une base de données et une couche de service d'API sur l'ensemble des principaux fournisseurs cloud.

Comment fonctionne Atlas Stream Processing ?

Atlas Stream Processing se connecte à vos données critiques, qu'elles se trouvent dans MongoDB (via change stream) ou dans une plateforme de streaming comme Apache Kafka. Les développeurs peuvent se connecter facilement et de manière transparente à Confluent cloud, Amazon MSK, Redpanda, Azure Event Hubs, ou à l'auto-managé Kafka en utilisant le protocole Kafka wire. En s'intégrant au pilote Kafka natif, Atlas Stream Processing offre des performances natives à faible latence.

En plus de notre partenariat stratégique de longue date avec Confluent, nous sommes heureux d'annoncer les partenariats avec AWS, Microsoft, Redpanda et Google, dès le lancement.

Atlas Stream Processing fournit ensuite 3 fonctionnalités clés nécessaires pour transformer votre stream de données en une expérience client différenciée. Examinons-les un par un.

Traitement en continu

Tout d'abord, les développeurs peuvent désormais utiliser le modèle d'agrégation de MongoDB pour traiter en continu des flux de données riches et complexes provenant de plateformes de streaming d'événements telles qu'Apache Kafka. Cela ouvre de nouvelles voies puissantes pour le requêtage, l'analyse et l'accès aux données en continu sans aucun des retards inhérents au traitement par lots. Avec modèle d'agrégation, vous pouvez filtrer et grouper les données, agréger les streams d'événements à grande vitesse en informations exploitables sur le temps d'état Windows, ce qui permet d'enrichir les expériences d'application en temps réel.

Validation continue

Ensuite, Atlas Stream Processing offre aux développeurs des mécanismes robustes et natifs pour traiter les problèmes de données incorrectes qui peuvent autrement causer des ravages dans les applications. Les potentiels problèmes comprennent la transmission de résultats inexacts à l'application, la perte de données et les temps d'arrêt de l'application. Atlas Stream Processing résout ces problèmes afin de garantir le traitement et le partage des données en continu de manière fiable entre les applications event-driven.

Atlas Stream Processing :

Apporte une validation en continue du schéma afin de vérifier que les événements sont correctement formés avant d'être traités, par exemple en rejetant les événements dont les champs sont manquants ou qui contiennent des plages de valeurs non valides.
Détecte l'altération des messages
Détecte les données arrivées tardivement qui ont manqué une fenêtre de traitement.

Les pipelines d'Atlas Stream Processing peuvent être configurées avec une Dead Letter Queue (DLQ) dans laquelle les événements qui échouent à la validation sont acheminés. Cela évite aux développeurs d'avoir à construire et à maintenir leurs propres implémentations personnalisées. Les problèmes peuvent être rapidement résolus et le risque que des données manquantes ou corrompues entraînent l'arrêt de l'ensemble de l'application est minimisé.

Fusion continue

Les données traitées peuvent ensuite être matérialisées en continu dans des vues gérées dans des collections de bases de données Atlas. On peut considérer qu'il s'agit d'une requête "push". Les applications peuvent extraire les résultats (via des requêtes) de la vue en utilisant l'API de requête MongoDB ou l'interface Atlas SQL. La fusion continue des mises à jour des collections est un moyen très efficace de maintenir des vues analytiques fraîches des données pour soutenir la prise de décision et l'action humaine et automatisée. Outre les vues matérialisées, les développeurs ont également la possibilité de publier les événements traités dans des systèmes de diffusion en continu tels qu'Apache Kafka.

Création d'un Stream Processor

Nous allons vous montrer à quel point il est facile de construire un processeur de flux dans MongoDB Atlas. Avec Atlas Stream Processing, vous pouvez utiliser la même syntaxe de pipeline d'aggrégation pour un stream processor que celle que vous connaissez pour la base de données. Nous présentons ci-dessous une instance de traitement de stream simple, du début à la fin. Il suffit de quelques lignes de code.

Tout d'abord, nous allons écrire une pipeline d'agrégation qui définit une source pour vos données, effectue une validation pour s'assurer que les données ne proviennent pas de l'adresse IP localhost/127.0.0.1, crée une fenêtre de basculement pour collecter des données de messages groupés toutes les minutes, puis fusionne ces données nouvellement traitées dans une collection MongoDB dans Atlas.

Ensuite, nous allons créer notre Stream Processor appelé "netattacks" en spécifiant notre pipeline nouvellement définie p ainsi que dlq comme arguments. Ceci lancera le traitement souhaité et, grâce au Dead Letter Queue (DLQ), stocke en toute sécurité les données non valides en vue d'une inspection, d'un débogage ou d'un nouveau traitement ultérieur.

Enfin, nous pouvons le démarrer. C'est tout ce qu'il faut pour construire un stream processor dans MongoDB Atlas.

Demander un aperçu privé

Nous sommes impatients de mettre ce produit entre vos mains et de voir ce que vous en ferez. Pour en savoir plus sur Atlas Stream Processing et demander un accès anticipé ici pour participer à l'aperçu privé une fois qu'il sera ouvert aux développeurs.

Nouveau sur MongoDB ? Commencez gratuitement dès aujourd'hui en créant votre compte MongoDB Atlas.

Sphère de sécurité

Le développement, la publication et le calendrier des caractéristiques ou fonctionnalités décrites pour nos produits demeurent à notre entière discrétion. Cette information est simplement destinée à décrire l'orientation générale des produits et ne doit pas être invoquée pour prendre une décision d'achat, et n'est pas un engagement, une promesse ou une obligation légale de fournir un matériel, un code ou une fonctionnalité.

← Previous

Einführung von Atlas Vector Search: Entwickeln Sie intelligente Anwendungen mit Semantic Search und KI für alle Arten von Daten

Wir freuen uns, Ihnen mitteilen zu können, dass Atlas Vector Search jetzt allgemein verfügbar ist. Vector Search unterstützt jetzt Produktions-Workloads, sodass Sie auch weiterhin intelligente Anwendungen auf Basis von Semantic Search und generativer KI erstellen können, während der Ressourcenverbrauch optimiert und die Leistung mit Search Nodes verbessert wird. Die vollständige Ankündigung und die Liste der Vorteile finden Sie unten. Die Zeit ist endlich gekommen. Künstliche Intelligenz hat sich nach links verschoben. Was einst in unternehmensweiten Data-Science- und Machine-Learning-Teams entwickelt wurde und oft dort feststeckte, ist jetzt für Entwickler auf der ganzen Welt leicht verfügbar. Um jedoch die unglaubliche Leistungsfähigkeit dieser neuen Tools nutzen zu können, müssen sie auf einer zuverlässigen, zusammensetzbaren und eleganten Datenplattform aufbauen. Gleichzeitig sind diese neuen Funktionen, wie wir alle gesehen haben, nur so gut, wie die Daten oder die „Grundwahrheit“, auf die sie zugreifen müssen. Aus diesem Grund freuen wir uns, der MongoDB Atlas moderne Datenbank eine weitere Funktion hinzuzufügen, um das volle Potenzial Ihrer Daten auszuschöpfen und KI-Anwendungen voranzutreiben. MongoDB freut sich, heute unsere aufregende neue Vector Search Funktion bekannt zu geben, die den Anforderungen von Daten in allen Formen gerecht wird und es unseren Partnern ermöglicht, diese unglaublichen neuen Funktionen zu nutzen. Auf unserer KI-Ressourcenseite erfahren Sie mehr über die Entwicklung von KI-gestützten Apps mit MongoDB. Was ist die Fähigkeit? Für diejenigen unter Ihnen, die es nicht kennen: Vector Search ist eine Funktion, mit der Sie Ihre Daten auf der Grundlage der Semantik oder der Bedeutung der Daten und nicht auf der Grundlage der Daten selbst abfragen können. Möglich wird dies dadurch, dass man jede Form von Daten numerisch als Vektor darstellen kann, die dann mit Hilfe hochentwickelter Algorithmen miteinander verglichen werden können. Der erste Schritt besteht darin, die Quelldaten, egal ob Text, Audio, Bild oder Video, mithilfe eines „Kodierungsmodells“ in „Vektoren“ oder „Einbettungen“ umzuwandeln. Dank der jüngsten Fortschritte in der künstlichen Intelligenz sind diese Vektoren nun besser in der Lage, die Bedeutung von Daten zu erfassen, indem sie niedrigerdimensionale Daten in einen höherdimensionalen Raum projizieren, der mehr Kontext zu den Daten enthält. Sobald diese Daten in diese numerischen Darstellungen umgewandelt wurden, können Sie Abfragen durchführen, um ähnliche Werte mithilfe eines ANN-Algorithmus (Approximate Nearest Neighbors) zu finden, der es Ihren Abfragen ermöglicht, sehr schnell Daten mit ähnlichen Vektoren zu finden. Auf diese Weise können Sie Anfragen wie „Gib mir Filme mit dem Gefühl der Traurigkeit“ oder „Gib mir Bilder, die aussehen wie ...“ erfüllen. Diese Fähigkeit eröffnet Ihnen eine ganz neue Gruppe von Möglichkeiten. In welcher Beziehung steht dies zu unserer Plattform? Da diese Funktionalität nativ in MongoDB Atlas integriert ist, müssen Sie Ihre Daten nicht kopieren und transformieren, keinen neuen Stack und keine neue Syntax erlernen oder eine völlig neue Infrastruktur managen. Mit Atlas Vector Search von MongoDB ist das alles nicht nötig. Sie können diese leistungsstarken neuen Funktionen innerhalb einer erstklassigen und kampferprobten Plattform nutzen, um Anwendungen schneller als je zuvor zu entwickeln. Viele der mit der Nutzung von KI und Vector Search verbundenen Herausforderungen ergeben sich aus der Komplexität, die mit der sicheren Offenlegung Ihrer Anwendungsdaten verbunden ist. Diese Aufgaben erhöhen die Reibungspunkte für die Entwicklererfahrung und erschweren das Entwickeln, Debuggen und Warten Ihrer Anwendungen. MongoDB beseitigt all diese Herausforderungen und bringt gleichzeitig die Leistungsfähigkeit von Vector Search auf eine Plattform, die organisch vertikal und horizontal skaliert, um jeden Workload zu bewältigen, die Sie ihr auferlegen. Schließlich ist nichts davon ohne Garantien in Bezug auf Sicherheit und Verfügbarkeit von Bedeutung, und das Engagement von MongoDB für eine sichere Datenverwaltungslösung zusammen mit hoher Verfügbarkeit durch Redundanz und automatischem Failover sorgt dafür, dass Ihre Anwendung nie aus dem Takt kommt. Neu bei MongoDB.local London Bei .Local London freuen wir uns, die Einführung einer speziellen Vektorsuch-Aggregationsstufe ankündigen zu können, die über $vectorSearch aufgerufen werden kann. Diese neue Aggregationsstufe führt ein paar neue Konzepte ein, die die Vektorsuche noch leistungsfähiger und einfacher als je zuvor machen. Mit $vectorSearch können Sie auch einen Vorfilter mit MQL-Syntax (z. B. $gte, $eq usw.) verwenden, die beim Durchlaufen des Index Dokumente herausfiltert, was zu konsistenten Ergebnissen und hoher Leistung führt. Jeder Entwickler, der MongoDB versteht, wird diese Filterfunktion problemlos nutzen können! Schließlich führen wir auch zwei Möglichkeiten ein, Ihre Ergebnisse innerhalb der Aggregationsphase zu optimieren, sowohl einen „numCandidates“ -Parameter als auch einen „limit“ -Parameter. Mit diesen Parametern können Sie einstellen, wie viele Dokumente für die ungefähre Suche nach dem nächsten Nachbarn in Frage kommen sollen, und dann die Anzahl der gewünschten Ergebnisse mit dem „Limit“ einschränken. Wie interagiert dies mit der Umgebung? Die Menge an Innovationen im Bereich der künstlichen Intelligenz ist erstaunlich und es ist beeindruckend zu sehen, welche rasanten Fortschritte die Open Source Community macht. Es gibt enorme Fortschritte bei Open-Source-Sprachmodellen sowie bei den verschiedenen Methoden, mit denen sie in Anwendungen integriert werden können. Angesichts der enormen Leistungsfähigkeit, die künstliche Intelligenz bietet, war es noch nie so wichtig, eine solide Abstraktion für diese Funktion zu haben, um Entwicklern die Flexibilität zu bieten, die sie benötigen. Vor diesem Hintergrund freuen wir uns, Ihnen mitteilen zu können, dass wir in LangChain und LlamaIndex mehrere Funktionen unterstützen, von der Unterstützung von Vector Search bis hin zur Chat-Protokollierung und Dokumentindizierung. Wir schreiten hier schnell voran und werden weiterhin neue Funktionen für die führenden Anbieter veröffentlichen. Zusammenfassung Trotz alledem geht es gerade erst los. Wir bei MongoDB setzen uns dafür ein, Entwicklern dabei zu helfen, die nächste Generation von KI-fähigen Anwendungen zu entwickeln – mit der besten moderne Datenbank auf dem Markt. Wir werden uns auch mit weiteren Frameworks und Plugin-Architekturen befassen, die wir unterstützen können. Aber wie immer sind Sie als Entwickler der wichtigste Teil dieser Gleichung. Wir werden mit der Community sprechen und Wege finden, wie wir Ihnen am besten helfen können, und sicherstellen, dass wir Ihre Bedürfnisse bei jedem Schritt erfüllen. Geh los und entwickel! Um mehr über Atlas Vector Search zu erfahren und herauszufinden, ob es die richtige Lösung für Sie ist, schauen Sie sich unsere Dokumentation , unser Whitepaper und unsere Tutorials an oder legen Sie noch heute los .

September 14, 2023

Next →

Build AI Agents Worth Keeping: The Canvas Framework

Why 95% of enterprise AI agent projects fail Development teams across enterprises are stuck in the same cycle: They start with "Let's try LangChain" before figuring out what agent to build. They explore CrewAI without defining the use case. They implement RAG before identifying what knowledge the agent actually needs. Months later, they have an impressive technical demo showcasing multi-agent orchestration and tool calling—but can't articulate ROI or explain how it solves actual business needs. According to McKinsey's latest research, while nearly eight in 10 companies report using generative AI, fewer than 10% of use cases deployed ever make it past the pilot stage . MIT researchers studying this challenge identified a " gen AI divide "—a gap between organizations successfully deploying AI and those stuck in perpetual pilots. In their sample of 52 organizations, researchers found patterns suggesting failure rates as high as 95% (pg.3). Whether the true failure rate is 50% or 95%, the pattern is clear: Organizations lack clear starting points, initiatives stall after pilot phases, and most custom enterprise tools fail to reach production. 6 critical failures killing your AI agent projects The gap between agentic AI's promise and its reality is stark. Understanding these failure patterns is the first step toward building systems that actually work. 1. The technology-first trap MIT's research found that while 60% of organizations evaluated enterprise AI tools, only 5% reached production (pg.6)—a clear sign that businesses struggle to move from exploration to execution. Teams rush to implement frameworks before defining business problems. While most organizations have moved beyond ad hoc approaches ( down from 19% to 6% , according to IBM), they've replaced chaos with structured complexity that still misses the mark. Meanwhile, one in four companies taking a true "AI-first" approach—starting with business problems rather than technical capabilities—report transformative results. The difference has less to do with technical sophistication and more about strategic clarity. 2. The capability reality gap Carnegie Mellon's TheAgentCompany benchmark exposed the uncomfortable truth: Even our best AI agents would make terrible employees . The best AI model (Claude 3.5 Sonnet) completes only 24% of office tasks , with 34.4% success when given partial credit . Agents struggle with basic obstacles, such as pop-up windows, which humans navigate instinctively. More concerning, when faced with challenges, some agents resort to deception , like renaming existing users instead of admitting they can't find the right person. These issues demonstrate fundamental reasoning gaps that make autonomous deployment dangerous in real business environments, rather than just technical limitations. 3. Leadership vacuum The disconnect is glaring: Fewer than 30% of companies report CEO sponsorship of the AI agenda despite 70% of executives saying agentic AI is important to their future . This leadership vacuum creates cascading failures—AI initiatives fragment into departmental experiments, lack authority to drive organizational change, and can't break through silos to access necessary resources. Contrast this with Moderna, where CEO buy-in drove the deployment of 750+ AI agents and radical restructuring of HR and IT departments. As with the early waves of Big Data, data science, then machine learning adoption, leadership buy-in is the deciding factor for the survival of generative AI initiatives. 4. Security and governance barriers Organizations are paralyzed by a governance paradox: 92% believe governance is essential, but only 44% have policies (SailPoint, 2025). The result is predictable—80% experienced AI acting outside intended boundaries, with top concerns including privileged data access (60%), unintended actions (58%), and sharing privileged data (57%). Without clear ethical guidelines, audit trails, and compliance frameworks, even successful pilots can't move to production. 5. Infrastructure chaos The infrastructure gap creates a domino effect of failures. While 82% of organizations already use AI agents, 49% cite data concerns as primary adoption barriers (IBM). Data remains fragmented across systems, making it impossible to provide agents with complete context. Teams end up managing multiple databases—one for operational data, another for vector data and workloads, a third for conversation memory—each with different APIs and scaling characteristics. This complexity kills momentum before agents can actually prove value. 6. The ROI mirage The optimism-reality gap is staggering. Nearly 80% of companies report no material earnings impact from gen AI (McKinsey), while 62% expect 100%+ ROI from deployment (PagerDuty). Companies measure activity (number of agents deployed) rather than outcomes (business value created). Without clear success metrics defined upfront, even successful implementations look like expensive experiments. The AI development paradigm shift: from data-first to product-first There's been a fundamental shift in how successful teams approach agentic AI development, and it mirrors what Shawn Wang (Swyx) observed in his influential " Rise of the AI Engineer " post about the broader generative AI space. The old way: data → model → product In the traditional paradigm practiced during the early years of machine learning, teams would spend months architecting datasets, labeling training data, and preparing for model pre-training. Only after training custom models from scratch could they finally incorporate these into product features. The trade-offs were severe: massive upfront investment, long development cycles, high computational costs, and brittle models with narrow capabilities. This sequential process created high barriers to entry—only organizations with substantial ML expertise and resources could deploy AI features. Figure 1. The Data → Model → Product Lifecycle. Traditional AI development required months of data preparation and model training before shipping products. The new way: product → data → model The emergence of foundation models changed everything. Figure 2. The Product → Data → Model Lifecycle. Foundation model APIs flipped the traditional cycle, enabling rapid experimentation before data and model optimization. Powerful LLMs became commoditized through providers like OpenAI and Anthropic. Now, teams could: Start with the product vision and customer need. Identify what data would enhance it (examples, knowledge bases, RAG content). Select the appropriate model that could process that data effectively. This enabled zero-shot and few-shot capabilities via simple API calls. Teams could build MVPs in days, define their data requirements based on actual use cases, then select and swap models based on performance needs. Developers now ship experiments quickly, gather insights to improve data (for RAG and evaluation), then fine-tune only when necessary. This democratized cutting-edge AI to all developers, not just those with specialized ML backgrounds. The agentic evolution: product → agent → data → model But for agentic systems, there's an even more important insight: Agent design sits between product and data. Figure 3. The Product → Agent → Data → Model Lifecycle. Agent design now sits between product and data, determining downstream requirements for knowledge, tools, and model selection. Now, teams follow this progression: Product: Define the user problem and success metrics. Agent: Design agent capabilities, workflows, and behaviors. Data: Determine what knowledge, examples, and context the agent needs. Model: Select external providers and optimize prompts for your data. With external model providers, the "model" phase is really about selection and integration rather than deployment. Teams choose which provider's models best handle their data and use case, then build the orchestration layer to manage API calls, handle failures, and optimize costs. The agent layer shapes everything downstream—determining what data is needed (knowledge bases, examples, feedback loops), what tools are required (search, calculation, code execution), and ultimately, which external models can execute the design effectively. This evolution means teams can start with a clear user problem, design an agent to solve it, identify necessary data, and then select appropriate models—rather than starting with data and hoping to find a use case. This is why the canvas framework follows this exact flow. The canvas framework: A systematic approach to building AI agents Rather than jumping straight into technical implementation, successful teams use structured planning frameworks. Think of them as "business model canvases for AI agents"—tools that help teams think through critical decisions in the right order. Two complementary frameworks directly address the common failure patterns: Figure 4. The Agentic AI Canvas Framework. A structured five-phase approach moving from business problem definition through POC, prototype, production canvas, and production agent deployment. Please see the “Resources” section at the end for links to the corresponding templates, hosted in the gen AI Showcase. Canvas #1 - The POC canvas for validating your agent idea The POC canvas implements the product → agent → data → model flow through eight focused squares designed for rapid validation: Figure 5. The Agent POC Canvas V1. Eight focused squares implementing the product → agent → data → model flow for rapid validation of AI agent concepts. Phase 1: Product validation—who needs this and why? Before building anything, you must validate that a real problem exists and that users actually want an AI agent solution. This phase prevents the common mistake of building impressive technology that nobody needs. If you can't clearly articulate who will use this and why they'll prefer it to current methods, stop here. table, th, td { border: 1px solid black; border-collapse: collapse; } th, td { padding: 5px; } Square Purpose Key Questions Product vision & user problem Define the business problem and establish why an agent is the right solution. Core problem: What specific workflow frustrates users today? Target users: Who experiences this pain and how often? Success vision: What would success look like for users? Value hypothesis: Why would users prefer an agent to current solutions? User validation & interaction User Validation & Interaction Map how users will engage with the agent and identify adoption barriers. User journey: What's the complete interaction from start to finish? Interface preference: How do users want to interact? Feedback mechanisms: How will you know it's working? Adoption barriers: What might prevent users from trying it? Phase 2: Agent design—what will it do and how? With a validated problem, design the agent's capabilities and behavior to solve that specific need. This phase defines the agent's boundaries, decision-making logic, and interaction style before any technical implementation. The agent design directly determines what data and models you'll need, making this the critical bridge between problem and solution. table, th, td { border: 1px solid black; border-collapse: collapse; } th, td { padding: 5px; } Square Purpose Key Questions Agent capabilities & workflow Agent Capabilities & Workflow Design what the agent must do to solve the identified problem. Core tasks: What specific actions must the agent perform? Decision logic: How should complex requests be broken down? Tool requirements: What capabilities does the agent need? Autonomy boundaries: What can it decide versus escalate? Agent interaction & memory Agent Interaction & Memory Establish communication style and context management. Conversation flow: How should the agent guide interactions? Personality and tone: What style fits the use case? Memory requirements: What context must persist? Error handling: How should confusion be managed? Phase 3: Data requirements—what knowledge does it need? Agents are only as good as their knowledge base, so identify exactly what information the agent needs to complete its tasks. This phase maps existing data sources and gaps before selecting models, ensuring you don't choose technology that can't handle your data reality. Understanding data requirements upfront prevents the costly mistake of selecting models that can't work with your actual information. table, th, td { border: 1px solid black; border-collapse: collapse; } th, td { padding: 5px; } Square Purpose Key Questions Knowledge requirements & sources Identify essential information and where to find it. Essential knowledge: What information must the agent have to complete tasks? Data sources: Where does this knowledge currently exist? Update frequency: How often does this information change? Quality requirements: What accuracy level is needed? Data collection & enhancement strategy Plan data gathering and continuous improvement. Collection strategy: How will initial data be gathered? Enhancement priority: What data has the biggest impact? Feedback loops: How will interactions improve the data? Integration method: How will data be ingested and updated? Phase 4: External model integration—which provider and how? Only after defining data needs should you select external model providers and build the integration layer. This phase tests whether available models can handle your specific data and use case while staying within budget. The focus is on prompt engineering and API orchestration rather than model deployment, reflecting how modern AI agents actually get built. table, th, td { border: 1px solid black; border-collapse: collapse; } th, td { padding: 5px; } Square Purpose Key Questions Provider selection & prompt engineering Choose external models and optimize for your use case. Provider evaluation: Which models handle your requirements best? Prompt strategy: How should you structure requests for optimal results? Context management: How should you work within token limits? Cost validation: Is this economically viable at scale? API integration & validation Build orchestration and validate performance. Integration architecture: How do you connect to providers? Response processing: How do you handle outputs? Performance testing: Does it meet requirements? Production readiness: What needs hardening? Figure 6. The Agent POC Canvas V1 (Detailed). Expanded view with specific guidance for each of the eight squares covering product validation, agent design, data requirements, and external model integration. Unified data architecture: solving the infrastructure chaos Remember the infrastructure problem—teams managing three separate databases with different APIs and scaling characteristics? This is where a unified data platform becomes critical. Agents need three types of data storage: Application database: For business data, user profiles, and transaction history Vector store: For semantic search, knowledge retrieval, and RAG Memory store: For agent context, conversation history, and learned behaviors Instead of juggling multiple systems, teams can use a unified platform like MongoDB Atlas that provides all three capabilities—flexible document storage for application data, native vector search for semantic retrieval, and rich querying for memory management—all in a single platform. This unified approach means teams can focus on prompt engineering and orchestration rather than model infrastructure, while maintaining the flexibility to evolve their data model as requirements become clearer. The data platform handles the complexity while you optimize how external models interact with your knowledge. For embeddings and search relevance, specialized models like Voyage AI can provide domain-specific understanding, particularly for technical documentation where general-purpose embeddings fall short. The combination of unified data architecture with specialized embedding models addresses the infrastructure chaos that kills projects. This unified approach means teams can focus on agent logic rather than database management, while maintaining the flexibility to evolve their data model as requirements become clearer. Canvas #2 - The production canvas for scaling your validated AI agent When a POC succeeds, the production canvas guides the transition from "it works" to "it works at scale" through 11 squares organized following the same product → agent → data → model flow, with additional operational concerns: Figure 7. The Productionize Agent Canvas V1. Eleven squares guiding the transition from validated POC to production-ready systems, addressing scale, architecture, operations, and governance. Phase 1: Product and scale planning Transform POC learnings into concrete business metrics and scale requirements for production deployment. This phase establishes the economic case for investment and defines what success looks like at scale. Without clear KPIs and growth projections, production systems become expensive experiments rather than business assets. table, th, td { border: 1px solid black; border-collapse: collapse; } th, td { padding: 5px; } Square Purpose Key Questions Business case & scale planning Translate POC validation into production metrics. Proven value: What did the POC validate? Business KPIs: What metrics measure ongoing success? Scale requirements: How many users and interactions? Growth strategy: How will usage expand over time? Production requirements & constraints Define performance standards and operational boundaries. Performance standards: Response time, availability, throughput? Reliability requirements: Recovery time and failover? Budget constraints: Cost limits and optimization targets? Security needs: Compliance and data protection requirements? Phase 2: Agent architecture Design robust systems that handle complex workflows, multiple agents, and inevitable failures without disrupting users. This phase addresses the orchestration and fault tolerance that POCs ignore but production demands. The architecture decisions here determine whether your agent can scale from 10 users to 10,000 without breaking. table, th, td { border: 1px solid black; border-collapse: collapse; } th, td { padding: 5px; } Square Purpose Key Questions Robust agent architecture Design for complex workflows and fault tolerance. Workflow orchestration: How do you manage multi-step processes? Multi-agent coordination: How do specialized agents collaborate? Fault tolerance: How do you handle failures gracefully? Update rollouts: How do you update without disruption? Production memory & context systems Implement scalable context management. Memory architecture: Session, long-term, and organizational knowledge? Context persistence: Storage and retrieval strategies? Cross-session continuity: How do you maintain user context? Memory lifecycle management: Retention, archival, and cleanup? Phase 3: Data infrastructure Build the data foundation that unifies application data, vector storage, and agent memory in a manageable platform. This phase solves the "three database problem" that kills production deployments through complexity. A unified data architecture reduces operational overhead while enabling the sophisticated retrieval and context management that production agents require. table, th, td { border: 1px solid black; border-collapse: collapse; } th, td { padding: 5px; } Square Purpose Key Questions Data architecture & management Build a unified platform for all data types. Platform architecture: Application, vector, and memory data? Data pipelines: Ingestion, processing, and updates? Quality assurance: Validation and freshness monitoring? Knowledge governance: Version control and approval workflows? Knowledge base & pipeline operations Maintain and optimize knowledge systems. Update strategy: How does knowledge evolve? Embedding approach: Which models for which content? Retrieval optimization: Search relevance and reranking? Operational monitoring: Pipeline health and costs? Phase 4: Model operations Implement strategies for managing multiple model providers, fine-tuning, and cost optimization at production scale. This phase covers API management, performance monitoring, and the continuous improvement pipeline for model performance. The focus is on orchestrating external models efficiently rather than deploying your own, including when and how to fine-tune. table, th, td { border: 1px solid black; border-collapse: collapse; } th, td { padding: 5px; } Square Purpose Key Questions Model strategy & optimization Manage providers and fine-tuning strategies. Provider selection: Which models for which tasks? Fine-tuning approach: When and how to customize? Routing logic: Base versus fine-tuned model decisions? Cost controls: Caching and intelligent routing? API management & monitoring Handle external APIs and performance tracking. API configuration: Key management and failover? Performance Tracking: Accuracy, latency, and costs? Fine-tuning pipeline: Data collection for improvement? Version control: A/B testing and rollback strategies? Phase 5: Hardening and operations Add the security, compliance, user experience, and governance layers that transform a working system into an enterprise-grade solution. This phase addresses the non-functional requirements that POCs skip but enterprises demand. Without proper hardening, even the best agents remain stuck in pilot purgatory due to security or compliance concerns. table, th, td { border: 1px solid black; border-collapse: collapse; } th, td { padding: 5px; } Square Purpose Key Questions Security & compliance Implement enterprise security and regulatory controls. Security implementation: Authentication, encryption, and access management? Access control: User and system access management? Compliance framework: Which regulations apply? Audit capabilities: Logging and retention requirements? User experience & adoption Drive usage and gather feedback. Workflow integration: How do you fit existing processes? Adoption strategy: Rollout and engagement plans? Support systems: Documentation and help channels? Feedback integration: How does user input drive improvement? Continuous improvement & governance Ensure long-term sustainability. Operational procedures: Maintenance and release cycles? Quality gates: Testing and deployment standards? Cost management: Budget monitoring and optimization? Continuity planning: Documentation and team training? Figure 8. The Productionize Agent Canvas V1 (Detailed). Expanded view with specific guidance for each of the eleven squares covering scale planning, architecture, data infrastructure, model operations, and hardening requirements. Next steps: start building AI agents that deliver ROI MIT's research found that 66% of executives want systems that learn from feedback , while 63% demand context retention (pg.14). The dividing line between AI and human preference is memory, adaptability, and learning capability. The canvas framework directly addresses the failure patterns plaguing most projects by forcing teams to answer critical questions in the right order—following the product → agent → data → model flow that successful teams have discovered. For your next agentic AI initiative: Start with the POC canvas to validate concepts quickly. Focus on user problems before technical solutions. Leverage AI tools to rapidly prototype after completing your canvas. Only scale what users actually want with the production canvas. Choose a unified data architecture to reduce complexity from day one. Remember: The goal isn't to build the most sophisticated agent possible—it's to build agents that solve real problems for real users in production environments. For hands-on guidance on memory management, check out our webinar on YouTube, which covers essential concepts and proven techniques for building memory-augmented agents. Head over to the MongoDB AI Learning Hub to learn how to build and deploy AI applications with MongoDB. Resources Download POC Canvas Template (PDF) Download Production Canvas Template (PDF) Download Combined POC + Production Canvas (Excel) - Get both canvases in a single excel file, with example prompts and blank templates. Full reference list McKinsey & Company . (2025). "Seizing the agentic AI advantage." ttps://www.mckinsey.com/capabilities/quantumblack/our-insights/seizing-the-agentic-ai-advantage MIT NANDA . (2025). "The GenAI Divide: State of AI in Business 2025." Report Gartner . (2025). "Gartner Predicts Over 40% of Agentic AI Projects Will Be Canceled by End of 2027." https://www.gartner.com/en/newsroom/press-releases/2025-06-25-gartner-predicts-over-40-percent-of-agentic-ai-projects-will-be-canceled-by-end-of-2027 IBM . (2025). "IBM Study: Businesses View AI Agents as Essential, Not Just Experimental." https://newsroom.ibm.com/2025-06-10-IBM-Study-Businesses-View-AI-Agents-as-Essential,-Not-Just-Experimental Carnegie Mellon University . (2025). "TheAgentCompany: Benchmarking LLM Agents." https://www.cs.cmu.edu/news/2025/agent-company Swyx . (2023). "The Rise of the AI Engineer." Latent Space. https://www.latent.space/p/ai-engineer SailPoint . (2025). "SailPoint research highlights rapid AI agent adoption, driving urgent need for evolved security." https://www.sailpoint.com/press-releases/sailpoint-ai-agent-adoption-report SS&C Blue Prism . (2025). "Generative AI Statistics 2025." https://www.blueprism.com/resources/blog/generative-ai-statistics-2025/ PagerDuty . (2025). "State of Digital Operations Report." https://www.pagerduty.com/newsroom/2025-state-of-digital-operations-study/ Wall Street Journal . (2024). "How Moderna Is Using AI to Reinvent Itself." https://www.wsj.com/articles/at-moderna-openais-gpts-are-changing-almost-everything-6ff4c4a5

September 23, 2025