Revolutionizing Inventory Classification with Generative AI

Humza Akhtar and Rami Pinto Prieto
July 16, 2025

In today's volatile geopolitical environment, the global automotive industry faces compounding disruptions that require a fundamental rethink of data and operations strategy. After decades of low import taxes, the return of tariffs as a tool of economic negotiations has led the global automotive industry to delay model-year transitions and disrupt traditional production and release cycles. As of June 2025, only 3% of US automotive inventory comprises next-model-year vehicles—less than half the number seen at this time in previous years.

This severe decline in new-model availability, compounded by a 12.2% year-over-year drop in overall inventory, is pressuring consumer pricing and challenging traditional dealer inventory management. In this environment of constrained supply, better tools are urgently needed to classify and control vehicle, spare part, and raw material inventories for both dealers and manufacturers.

Traditionally, dealerships and automakers have relied on ABC analysis to segment and control inventory by value. This widely used method classifies items into Category A, B, or C. For example, Category A items typically represent just 20% of stock but drive 80% of sales, while Category C items might comprise half the inventory yet contribute only 5% to the bottom line. This approach effectively helps prioritize resource allocation and promotional efforts.

Figure 1. ABC analysis for inventory classification.

Diagram showing the ABC analysis for inventory classification. At the top of the diagram are 3 boxes. Box one is category A, which accounts for 20% of total inventory and 80% of total sales. Category b is 30% of total inventory and 15% of total sales. Category c is 50% of inventory and 5% of total sales. At the bottom is a line chart showing the spend totals of each category, with A at 80%, B at 15%, and C at 5%.

While ABC analysis is known for its ease of use, it has been criticized for its focus on dollar usage. For example, not all Category C items are necessarily low-priority, as some may be next-model-year units arriving early or aging stock affected by shifting consumer preferences. Other criteria—such as lead-time, commonality, obsolescence, durability, inventory cost, and order size requirements—have also been recognized as critical for inventory classification. A multi-criteria inventory classification (MCIC) methodology, therefore, adds additional criteria to dollar usage. MCIC can be achieved with methods like statistical clustering or unsupervised machine learning techniques.

Yet, a significant blind spot remains: the vast amount of unstructured data that organizations must deal with; unstructured data accounts for an estimated 80% of the world's total.

Traditional ABC analysis—and even MCIC—often overlook the growing influence of insights gleaned from unstructured sources like customer sentiment and product reviews on digital channels. But now, valuable intelligence from reviews, social media posts, and dealer feedback can be vectorized and transformed into actionable features using large language models (LLMs). For instance, analyzing product reviews can yield qualitative metrics like the probability of recommending or repurchasing a product, or insights into customer expectations vs. the reality of ownership. This textual analysis can also reveal customers' product perspectives, directly informing future demand.

By integrating these signals into inventory classification models, businesses can gain a deeper understanding of true product value and demand elasticity. This fusion of structured and unstructured data represents a crucial shift from reactive inventory management to predictive and customer-centric decision-making. In this blog post, we propose a novel methodology to convert unstructured data into powerful feature sets for augmenting inventory classification models.

Figure 2. Transforming unstructured data into features for machine learning models.

On the left side of this diagram is a box for unstructured data, which contains things like customer text reviews, audio/video reviews, and social media mentions. This data is then transformed using LLM, and goes to features. Feature input then send the data to a box labeled re run inventory classification model.

How MongoDB enables AI-driven inventory classification

So, how does MongoDB empower the next generation of AI-driven inventory classification? It all comes down to four crucial steps, and MongoDB provides the robust technology and features to support every single one.

Figure 3. Methodology and requirements for gen AI-powered inventory classification.

Diagram showing four steps with methodology and requirements for gen AI-powered inventory classification. First step is create and store vector embeddings from unstructured data, with the requirements of an embedding model and vector database. Step two is design and store evaluation criteria with requirements of large language model and operational database. Step three is create an agentic application to perform transformation based on criteria, with the requirements of large language model, vector search, and time series database. The final step is re run the inventory classification model, with the requirements of classification model and time series database.

Step 1: Create and store vector embeddings from unstructured data

MongoDB Atlas enables modern vector search workflows. Unstructured data like product reviews, supplier notes, or customer support transcripts can be vectorized via embedding models (such as Voyage AI models) and ingested into MongoDB Atlas, where they are stored next to the original text chunks. This data then becomes searchable using MongoDB Atlas Vector Search, which allows you to run native semantic search queries directly inside the database.

Unlike solutions that require separate databases for structured and vector data, MongoDB stores them side by side using the flexible document model, enabling unified access via one API. This reduces system complexity, technical debt, and infrastructure footprint—and allows for low-latency semantic searches.

Figure 4. Product reviews can be stored as vector embeddings in MongoDB Atlas.

At The left of this diagram is an image representing product reviews history, this connects to the embedder, which then sends data via vectors to the vectorized reviews collection. The bottom of this diagram shows what a return function would look like in code format.

Step 2: Design and store evaluation criteria

In a gen AI-powered inventory classification system, evaluation criteria are no longer a set of static rules stored in a spreadsheet. Instead, the criteria are dynamic and data-backed, and are generated via an AI agent using structured and unstructured data—and enriched by domain experts using business objectives and constraints.

As shown in Figure 5, the criteria for features like “Product Durability” can be defined based on relevant unstructured data stored in MongoDB (product reviews, audit reports) as well as structured data like inventory turnover and sales history. Such criteria are not just instructions or rules, but are knowledge objects with structure and semantic depth.

The AI agent uses tools such as generate_criteria and embed_criteria tool and iterates over each product in the inventory. It leverages the LLM to create the criteria definition and uses an embedding model (e.g., voyage-3-large) to generate embeddings of each definition.

MongoDB Atlas is uniquely suited to store these dynamic criteria. Each rule is modeled as a flexible JSON document containing the name of the feature, criteria definition, data sources use, and the embeddings. Since there are different types of products (different car models/makes and different car parts), the documents can evolve over time without requiring schema migrations and be queried and retrieved by the AI agent in real time. MongodB Atlas provides all the necessary tools for this design—a flexible document model database, vector search, and full search tools—that can be leveraged by the AI agent to create the criteria.

Figure 5. Unstructured and structured data are used by the AI agent to create criteria for feature generation.

The top left of this diagram includes the business objects, which flow into the experts domain knowledge, which connects to the AI agent. On the middle left is structured and unstructured data, which also connects to the AI agent. The AI agent then sends data to the criteria folder, as well as to the LLM, tools, and embedder.

Step 3: Create an agentic application to perform transformation based on the criteria

In the third step, we have another AI agent that operates over products, criteria, and unstructured data to generate enriched feature sets. This agent iterates over every product and uses MongoDB Atlas Vector Search to find relevant customer reviews to apply the criteria to and calculate a numerical feature score. The new features are added to the original features JSON document in MongoDB. In Figure 6, the agent has created “durability” and “criticality” features from the product reviews.

MongoDB Atlas is the ideal foundation for this agentic architecture. Again, it provides the agent the tools it needs for features to evolve, adding new dimensions without requiring schema redesign. This results in an adaptive classification dataset that contains both structured and unstructured data.

Figure 6. An AI agent enriches product features with vectorized review data to generate new features.

On the top left of this diagram are product and criteria folders, which connect to the AI agent. On the bottom left are the vectorized review, which flow through vector search to connect to the AI agent. From the agent, data flows to support traditional quantitative features and quantitative + gen AI extracted qualitative features.

Step 4: Rerun the inventory classification model with new features added

As a final step, the inventory classification domain experts can assign or balance weights to existing and new features, choose a classification technique, and rerun inventory classification to find new inventory classes.

Figure 7 shows the process where generative AI features are used in the existing inventory classification algorithm.

Figure 7. Domain experts can rerun classification after balancing weights.

At the top of this diagram are three separate areas. The first is quantitative + qualitative features, these flow to selected features, which then go to new inventory classification. From each component, lines go to the bottom of the diagram, which is the experts domain knowledge and business objectives.

Figure 8 shows the solution in action. The customer satisfaction score is created by LLM a using customer reviews vectorized collection and then utilized in the inventory classification model with a new weight of 0.2.

Figure 8. Inventory classification using generative AI.

Screen grab of the MongoDB inventory optimization dashboard.

Driving smarter inventory decisions

As the automotive industry navigates slowing sales and uneven inventory, traditional inventory classification techniques also need to evolve. Though such techniques provide a solid foundation, they fall short in the face of geopolitical uncertainty, tariff-driven supply shifts, and fast-evolving consumer expectations.

By combining structured sales and consumption data with unstructured insights, and enabling agentic AI using MongoDB, the automotive industry can enable a new era of inventory intelligence where products are dynamically classified based on all available data—both structured and unstructured.

Clone the GitHub repository if you are interested in trying out this solution yourself. To learn more about MongoDB’s role in the manufacturing industry, please visit our manufacturing and automotive webpage.

← Previous

Introducing MongoDB’s Multimodal Search Library For Python

AI applications increasingly rely on a variety of different data types—text, images, charts, and complex documents—to drive rich user experiences. For developers building these applications, determining how to effectively search and retrieve information that spans these data types presents a challenge. Developers have to consider different chunking strategies, figure out how to incorporate figures and tables, and manage context that could bleed across chunks. To simplify this, we're excited to announce the public preview of MongoDB’s Multimodal Search Python Library . This new library makes it easy to build sophisticated applications using multimodal data, providing a single interface for integrating MongoDB Atlas Vector Search , AWS S3, and Voyage AI's multimodal embedding model voyage-multimodal-3 . The library handles: Processing and storage: It interacts with S3 for storing PDFs from a URL or referring to a PDF already stored in S3. PDFs are then turned into single-page images and stored in S3. Generating embeddings: Images use voyage-multimodal-3 to produce high-quality embeddings. Vector indexing: Finally, it indexes the embeddings using Atlas Vector Search and provides a reference back to S3. The power of multimodal Traditional search methods often struggle when dealing with documents that contain text alongside visual elements like charts and graphs, which are common in research papers, financial reports, and more. Developers typically need to build complex, custom pipelines to handle image storage, embedding generation, and vector indexing. Our Multimodal Search Library abstracts this complexity away, using the best-in-class voyage-multimodal-3. It empowers developers to build applications that can understand and search the content of images just as easily as text. This enables accurate and efficient information retrieval and richer user experiences when working with either multimodal data or PDFs with visually rich documents. Figure 1. Traditional chunking vs. multimodal embedding. Imagine you're a financial analyst sifting through hundreds of annual reports—dense PDFs filled with text, tables, and charts—to find a specific trend. With our Multimodal Search Library, you can simply ask a question in natural language, like: " Show me all the charts illustrating revenue growth over the past three years ." The library will process the query and retrieve pages containing the relevant charts from your corpus of knowledge. Likewise, consider an e-commerce platform with a large product catalog. A shopper might be looking for a specific style of shoes but may not know the right keywords to describe exactly what they are looking for. By leveraging multimodal search, the user could upload an image of the shoes they like, and the application finds visually similar in-stock items, creating a seamless product discovery journey. Learn how to get started To get started, you’ll need: A MongoDB Atlas cluster ( sign up for the free tier) A MongoDB collection in that cluster A MongoDB Atlas Vector Search index A Voyage AI API key ( sign up ) An S3 bucket ( sign up ) Installation and setup First, we’ll ensure that we can connect to MongoDB Atlas, AWS S3, and Voyage AI. pip install pymongo-voyageai-multimodal import os from pymongo import MongoClient from pymongo_voyageai_multimodal import PyMongoVoyageAI client = PyMongoVoyageAI.from_connection_string( connection_string=os.environ["MONGODB_ATLAS_CONNECTION_STRING"], database_name="db_name", collection_name="collection_name", s3_bucket_name=os.environ["S3_BUCKET_NAME"], voyageai_api_key=os.environ["VOYAGEAI_API_KEY"], ) Adding documents Next, we’ll add relevant documents for embedding generation. from pymongo_voyageai_multimodal import TextDocument, ImageDocument text = TextDocument(text="foo", metadata={"baz": "bar"}) images = client.url_to_images( "https://www.fdrlibrary.org/documents/356632/390886/readingcopy.pdf" ) documents = [text, images[0], images[1]] ids = ["1", "2", "3"] client.add_documents(documents=documents, ids=ids) Performing search Finally, we’ll search for content most semantically similar to our query. results = client.similarity_search(query="example", k=1) for doc in results: print(f"* {doc['id']} [{doc['inputs']}]") Loading data already stored in S3 Developers can also query against documents already stored in S3. See more information in the documentation . import os from pymongo_voyageai_multimodal import PyMongoVoyageAI client = PyMongoVoyageAI( voyageai_api_key=os.environ["VOYAGEAI_API_KEY"], s3_bucket_name=os.environ["S3_BUCKET_NAME"], mongo_connection_string=os.environ["MONGODB_URI"], collection_name="test", database_name="test_db", ) query = "The consequences of a dictator's peace" url = "s3://my-bucket-name/readingcopy.pdf" images = client.url_to_images(url) resp = client.add_documents(images) client.wait_for_indexing() data = client.similarity_search(query, extract_images=True) print(f"Found {len(data)} relevant pages") client.close() A few important notes: Automatic updates to source data are not supported. Changes to indexed data need to be made via application code calling the client using the add_documents and delete functions. This library is primarily meant to support integrating multimodal embeddings and MongoDB Atlas on relatively static datasets. It is not intended to support sophisticated aggregation pipelines that combine multiple stages or data that updates frequently. voyage-multimodal-3 is the only embedding model supported directly, and AWS is the only cloud provider supported directly. Ready to try it yourself? Check out the Github project today to get started. Learn more in our documentation , and please share feedback . We can't wait to see what you build!

July 16, 2025

Next →

Cars24 Improves Search For 300 Million Users With MongoDB Atlas

The Indian multinational online car marketplace Cars24 serves 300 million users globally. The company offers services that span sales, insurance, maintenance, financing, and more, reshaping the entire car ownership journey. Speaking at MongoDB .local Bengaluru in July 2025 , Pradeep Sharma, Head of Technology at Cars24, shared how MongoDB has been a key driver of Car24’s digital transformation journey. Specifically, he highlighted two recent use cases that show how MongoDB Atlas has helped Cars24 scale, improve its search capabilities, and reduce its architectural complexity. Matching the growing scale with simplified and expanded search Cars24 has operations in multiple countries, and a diverse customer base. Over the years, the company has used customer data, behavior analytics, and operational workflows to build, evolving from being a platform for buying and selling cars, to an end-to-end ecosystem, supported by a hub of interconnected systems. At the start of its journey, Cars24 relied on legacy databases for managing and searching data, such as Postgres. Their relational database set-up would store information, synchronize the data to a separate “bolt-on” search engine (such as Elasticsearch), manually indexing it, and then querying the index. While initially effective for a small application ecosystem, these processes became bottlenecked as the organization’s services grew. Multiple engineering teams piped data into a single search index, which often resulted in synchronization challenges and overwhelming administrative overhead. Cars24 faced three core limitations with this setup: Lower developer productivity: Exponential effort was spent maintaining pipelines and synchronizing procedures. Developers had little bandwidth for building business features or innovation. Architectural complexity: Ensuring data sync consistency required multiple pipelines and race logic. This led to inefficiencies in real-time dashboard updates for agents. Operational overhead: Maintaining separate systems for database and search—alongside provisioning, patching, scaling, and monitoring—strained resources. Seeking an integrated approach, Cars24 embraced MongoDB Atlas, hosted on Google Cloud . MongoDB Atlas would serve as a single, consistent, modern database and embedded search solution, powered by Apache Lucene. MongoDB Atlas Search also enabled Cars24 to run queries directly in the database. This eliminated the need to synchronise data between systems while delivering real-time results. This unified approach allowed the company’s developers to transition from managing complex synchronization mechanisms to building applications. Furthermore, the reduced administrative overhead enabled Cars24 to consolidate the team’s efforts, and to streamline query execution across the ecosystem. Thanks to MongoDB Atlas and MongoDB Atlas Search, Cars24 was able to: Avoid "synchronization tax”: Switching to MongoDB Atlas eliminated the need for data synchronization and the additional tooling this mandated. Real-time searches can be performed from a single interface and workflow. Deliver new search features faster: By using a single, unified API across database and search operations, new features can be delivered rapidly. Work with a fully managed platform: With MongoDB Atlas, Cars24’s engineers can focus more on application development and building products, rather than thinking about managing indexes, syncing, and more. Following this successful migration, Cars24 decided to also use MongoDB Atlas to replace one of its legacy databases, ArangoDB. The switch to MongoDB Atlas eliminated major roadblocks for other critical search capabilities. From ArangoDB to MongoDB: Streamlined operations and 50% cost savings As Cars24 scaled new services globally, it encountered limitations with its geospatial search solution, which was based on ArangoDB. This included performance bottlenecks, weak transactions as it was difficult to guarantee consistent data operations, and a limited ecosystem which meant that scaling developer onboarding and troubleshooting became increasingly onerous. Moving to MongoDB Atlas enabled Cars24 to transition its geospatial services, consolidating its data storage and search capabilities under a single, versatile platform. “We now have a highly available architecture, and an amazing team at MongoDB that has our back,” said Sharma. MongoDB offered a proven architecture for high availability, scalability, and real-world production readiness: Enhanced scalability: MongoDB’s ability to scale massive workloads supports Cars24’s growing global presence. Reliable transactions: MongoDB provides robust multi-document ACID transactions across shards, meeting mission-critical needs. Streamlined operations: MongoDB offers a single platform that is not limited to a database only. By consolidating its geospatial search workload under MongoDB, Cars24 has reduced maintenance and operational overhead. Not only did Cars24 cut costs in half by moving to MongoDB, but the widespread market adoption of MongoDB Atlas also means that Cars24 can continue to rapidly onboard developers familiar with MongoDB, a recruiting priority for Cars24’s growing development team. “To give you an idea, one of our business units had a developer team of less than 10 about a year ago. Now they are a triple-digit team,” said Sharma. “If we are going to keep introducing new developers, for a product coming up or scaling up, it becomes very important to focus on the community skills and support provided by our technology partner.” “Now that we have moved from ArangoDB to MongoDB Atlas, our developers are the happiest,” he added. Cars24 is now looking to consolidate even more of its application and data workflows under MongoDB Atlas. With the growing number of developers joining Cars24’s engineering teams, plans are to utilize MongoDB Atlas further to enhance productivity, scalability, and data-driven insights. Visit the MongoDB Atlas Learning Hub to learn more about Atlas. To learn more about MongoDB Atlas Search, visit our product page .

October 12, 2025