Boost Connected Car Developments with MongoDB Atlas and AWS
August 11, 2025
As vehicles continue to evolve from mechanical systems to connected, software-defined platforms, the automotive industry is continuously being reshaped by data. With modern cars generating terabytes of sensor data daily, a key challenge facing the industry is how to extract timely, actionable insight from that data. And a recent survey by McKinsey underscored the degree to which strong connectivity is important to car buyers—close to 40% of US survey respondents indicated that they are willing to switch OEMs over better connectivity options. Though connectivity preferences vary widely by country, autonomous driving and safety features are top of mind for many customers.
In such a landscape, OEMs need to offer new innovative use cases on top of the customer data. For example, one of MongoDB’s large automotive clients is combining car telemetry data with engine noise to perform faster diagnostics and maintenance services. Combining car telemetry data and Internet of Things (IoT) infrastructure with generative AI unlocks enormous potential for the auto manufacturers, from predictive maintenance and remote diagnostics to context-aware driver assistance, smart infotainment, and usage-based insurance models.
Imagine a vehicle that not only warns of a failing battery but also proactively recommends the nearest certified service center with the right parts in stock. A fleet manager might analyze driving behavior across hundreds of trucks to optimize fuel efficiency and reduce accident risks. And with improved data, manufacturers could aggregate warranty and performance data across regions to detect early signs of systemic issue, responding before small defects become expensive recalls. Insurance providers, meanwhile, might use real-time driving profiles to offer policies tailored to individual habits, replacing static risk models with dynamic pricing.
To enable such use cases, organizations require a scalable, flexible and secure data infrastructure. MongoDB Atlas not only offers a flexible document data model but also built-in time series support, high availability, geospatial indexing, and horizontal scalability to handle millions of connected vehicles and associated use cases and services. Combined with AWS services for IoT, edge processing, machine learning, and generative AI, this stack becomes a robust foundation for intelligent mobility.
This blog post explores how enterprises can build such a connected car architecture using MongoDB Atlas, Atlas Vector Search, AWS IoT Greengrass, Amazon Bedrock, and LangChain—as shown in Figure 1. We will convert raw automotive telemetry into real-time business value for drivers, technicians, and fleet managers using an example of car maintenance business workflow.

The limitations of traditional maintenance models
Vehicle maintenance still follows two basic patterns: reactive and scheduled. In the reactive model, service is initiated only after a problem has already impacted car performance. At that point, it is too late to avoid costly repairs. Scheduled maintenance is more proactive but is often inefficient, leading to unnecessary servicing that proves costly for the driver and does not reflect actual wear and usage.
The automotive sector needs to shift toward predictive and personalized care, relying on the connected car data that is being collected in real time by OEMs. But achieving this requires a cloud-native data infrastructure that can support continuous ingestion and real-time processing of this data.
From raw sensor data to driving insight
The connected vehicle data journey begins at the edge. Vehicle operational data—from engine RPM and temperature, to battery voltage, tire pressure, and onboard diagnostic codes—can be processed locally on the car using AWS IoT Greengrass, a service from AWS that enables local decision-making even without constant cloud connectivity.
From there, the data flows into AWS IoT Core and is published to Amazon MSK (Managed Streaming for Apache Kafka). Atlas Stream Processing—which ensures scalable, fault-tolerant stream processing—connects to MSK and ingests this data into MongoDB Atlas, where it is stored using a schema modeled on the Vehicle Signal Specification (VSS), a standard developed by the COVESA alliance.
VSS is a widely adopted open data model that helps normalize vehicle signals and improve interoperability, and it provides a hierarchical, standardized format for structuring vehicle data. It defines a semantic tree of signal, such as Vehicle.Speed
, Vehicle.Powertrain.Engine.RPM
, or Vehicle.Cabin.Door.FrontLeft.IsOpen
to ensure consistency and interoperability across makes, models, and applications. This consistency is critical for large-scale data analysis, cross-platform integration, and AI training.
MongoDB, an active member of the COVESA community, is particularly well-suited to implement VSS. Our document-oriented data model allows developers to store deeply nested, flexible JSON structures without enforcing rigid and normalized schemas. This is especially useful when working with evolving vehicle software platforms or optional equipment packages, trim levels, etc., that alter the signal tree. Whether a car has two doors or four, a combustion or an electric drive, MongoDB can seamlessly adapt to its VSS-defined structure without structural rework, saving time and money for the OEMs.
Once vehicle data lands in MongoDB Atlas, a series of event-driven triggers enable real-time reactions. Atlas Triggers can detect when an engine temperature exceeds safe thresholds and immediately invoke an AWS Lambda function to log the incident, notify support teams via Amazon EventBridge, or create a maintenance task in a service management system.
A strong data tiering strategy is important for connected vehicle use cases. For longer-term trend analysis, vehicle data can be exported to Amazon S3 for model training in Amazon SageMaker. These models can forecast component wear, detect behavioral anomalies, or estimate the Remaining Useful Life (RUL) of key systems. Once the model is trained, it can infer directly on the MongoDB data and feed prediction results back into the database, closing the loop.
The alerts and raw telemetry can live inside MongoDB time series collections, which are optimized for high-speed time series data storage and processing. Time series collections also come with Window Functions that enable operations on a specified span of documents or a window in time.
Empowering technicians with AI and vector search
Once an alert is raised, we can use gen AI to enhance customer and technician experience in dealing with and resolving the identified issue. Traditional diagnostic workflows involve sifting through manuals, logs, and systems of record. Now, with Amazon Bedrock and Atlas Vector Search, technicians can simply ask natural-language questions using a chat assistant embedded in a mobile or web application.
Unstructured data such as service manuals, historical record, and technical bulletins are vectorized into arrays of embeddings. These embeddings are indexed and stored in MongoDB Atlas. Once stored and indexed, the technician can query “What is the root cause of the service engine light?” and Atlas Vector Search can search through the vector embeddings and retrieve the most relevant, semantically aligned documents. These results can be fed into large language models exposed by AWS Bedrock to generate the response in a conversational language and tone.
MongoDB’s vector search capability integrates seamlessly with traditional metadata search, combining structured queries (e.g., vehicle ID, timestamp) with semantic matching. This unified approach enhances technician productivity and shortens repair cycles—resulting in positive customer engagement.
To expose this data (and these insights) to different users, we can leverage AWS AppSync as a managed GraphQL interface. Through AppSync, users can query live telemetry, view predicted maintenance needs, or trigger actions like assigning a technician or updating a vehicle’s diagnostic state—ensuring consistency between backend services and user-facing applications.
Business impact across automotive domains
The potential applications of this architecture span the entire automotive value chain.
For example, fleet operators could benefit from predictive service scheduling, improving uptime while reducing costs. Manufacturers would gain insights into failure patterns, enabling them to make data-driven decisions about component design or supplier quality. Dealerships can improve first-time fix rates with AI-guided diagnostics, while insurance companies could implement usage-based models grounded in real driving behavior. Even suppliers and logistics chains could benefit, using aggregated data to anticipate demand and optimize inventory levels.
Smart vehicles, smart connectivity
MongoDB’s high-performance, scalable database—paired with the IoT, AI, and machine learning capabilities of AWS—creates a responsive, resilient connected car platform.
As vehicles grow smarter, so too must the systems that manage their data. MongoDB’s alignment with the VSS standard ensures that automotive data remains interoperable, searchable, and AI-ready. Atlas Vector Search ensures efficient retrieval of context stored in unstructured data, and when paired with AWS services like IoT Greengrass, SageMaker, Bedrock, and AppSync, this architecture allows enterprises to scale to millions of connected vehicles with confidence. For more information on how to model data in MongoDB using VSS specification, check out our other article.
To see these concepts in action, visit our GitHub repository for a hands-on experience and detailed instructions. To learn more about MongoDB’s role in the manufacturing industry, please visit our manufacturing and automotive page.