Boost the Accuracy of E-commerce Search Results with Atlas Vector Search

Vittal Pai, Francesco Baldissera, and Ashwin Gangadhar
October 11, 2023 | Updated: February 14, 2024

Artificial Intelligence’s (AI) growth has led to transformative advancements in the retail industry, including natural language processing, image recognition, and data analysis. These capabilities are pivotal to enhancing the efficiency and accuracy of e-commerce search results.

E-commerce, characterized by its vast product catalogs and diverse customer base, generates enormous amounts of data every day. From user preferences and search histories to product reviews and purchase patterns — and add to that images, video, and audio associated with product campaigns and user search — the data is both a goldmine and a challenge.

Traditional search mechanisms, which rely on exact keyword matches, are inadequate at handling such nuanced and voluminous data. This is where vector search comes into play as the perfect data mining tool.

As a sophisticated search mechanism, it leverages AI-driven algorithms to understand the intrinsic relationships between data points. This enables it to discern complex patterns, similarities, and contexts that conventional keyword-based searches might overlook.

Let’s dig deeper into the differences between traditional keyword matching search and vector search, and answer questions like: What type of queries does vector search improve in the retail search landscape? What are the challenges associated with it? And how can your business tap into the competitive advantage it represents?

Check out our AI resource page to learn more about building AI-powered apps with MongoDB.

Traditional Keyword Matching vs. Vector Search

Traditional search functionalities for e-commerce platforms — keyword matching, typo tolerance, autocomplete, highlighting, facets, and scoring — are often built in-house or implemented on top of typical search engines like Apache Lucene, AtlasSearch, or ElasticSearch, relying heavily on metadata textual descriptions.

While this has served the industry well for years, it often falls short of understanding the nuanced needs of modern consumers. For instance, a customer might be looking for a "blue floral summer dress," but if the product description lacks these terms, it might not appear in the search results, even if it perfectly matches the visual description.

Graph displaying higher accuracy contextual search results. Bottom of graph reads: By computing the embeddings of the query and defining a distance metric, we can simply look up the nearest documents to our query. — Figure 1: As embeddings encode numerically the meaning of documents, semantically close documents will be geometrically close as well.

Vector search is a method that finds similar items in a dataset based on their vector representations, and offers a more efficient and accurate way to sift through large datasets. Instead of relying on exact matches, it uses mathematical techniques to measure the similarity between vectors, allowing it to retrieve items that are semantically similar to the user's query, even if the query and the item descriptions don't contain exact keyword matches.

Chart displaying the Vector Search Data Workflow. Data starts with the client and is then either written or read, as plain text, to the encoder. From there it flows into the vector and is stores as new field in documents. Finally, it is sent to the Vector Search Engine, which flows information back to the client. — Figure 2: Data flow diagram showcasing how applications, vector embedding algorithms, and search engines work together at a high level.

One great thing about Vector search is that by encoding any type of data, i.e. text, images or sound, you can perform queries on top of that, creating a much more comprehensive way of improving the relevance of your search results.

Let’s explore examples of queries that involve context, intent, and similarity.

Visual similarity queries

Query: "Find lipsticks in shades similar to this coral lipstick."

Vector Search Benefit: Vector search can recognize the color tone and undertones of the specified lipstick and suggest similar shades from the same or different brands.

Data type: image or text

Contextual queries

Query: "Affordable running shoes for beginners."

Vector Search Benefit: Vector search can consider both the price range and the context of "beginners," leading to relevant shoe suggestions tailored to the user's experience level and budget.

Data type: text, audio (voice)

Natural language queries

Query: "Show me wireless noise-canceling headphones under $100."

Vector Search Benefit: Capture intent. Vector search can parse the query's intent to filter headphones with specific features (wireless, noise-canceling) and a price constraint, offering products that precisely match the request.

Data type: text, audio (voice)

Complementary product queries

Query: "Match this dress with elegant heels and a clutch."

Vector Search Benefit: Vector search can comprehend the user's request to create a coordinated outfit by suggesting shoes and accessories that complement the selected dress.

Data type: text, audio (voice), image

Challenging landscape, flexible stack

Now that we've explored different queries and their associated data types that could be used in vector embeddings for search, we can see how much more information can be used to deliver more accurate results and fuel growth.

Let’s consider some of the challenges associated with a vector search solution data workflow and how MongoDB Atlas Vector Search helps bridge the gap between challenges and opportunities.

Data overload

The sheer volume of products and user-generated data can be overwhelming, making it challenging to offer relevant search results. By embedding different types of data inputs like images, audio (voice), and text queries for later use with vector search, we can simplify this workload.

Storing your vector encoding in the same shared operational data layer your applications are built on top of, but also generating search indexes based on those vectors, makes it simple to add context to your application search functionalities.

Using Atlas Vector Search combined with MongoDB App Services, you can reduce operational overhead by creating a trigger that could “see” when a new document is created in your collections and automatically make the call to the embedding API of your preference, pushing the document to it and storing the retrieved embedding data in the same document stored in your collection.

Image split in half with a example Atlas Vector Search query on the left and explainer text on the right. The text reads: In an Atlas Search index definition, simply add a field of type knnVector to let Atlas Search expect a vector. Also define the distance metric. Supported distance metrics: euclidean, cosine, and dotProduct. — Figure 3: Storing vectors with the data simplifies the overall architecture of your application. As the number of documents or vectors grows, efficient indexing structures ensure that search performance remains reasonable.

By simply creating an index based on the embedded data field, you can leverage the optimized retrieval of the data, reduce the computational load, and accelerate its performance, especially for nearest neighbor search tasks, where the goal is to find items that are most similar to a given query.

Altogether, the combination of MongoDB Vector Search capabilities with App Services and indexing provides a robust and scalable solution to achieve real-time responsiveness. An indexed vector search database can provide rapid query results, making it suitable for applications like recommendation engines or live search interfaces.

Changing consumer behavior

Developing an effective vector search solution involves understanding the nuances of the retail domain. Retailers must consider factors like seasonality, trends, and user behavior to improve the accuracy of search results.

To overcome this challenge, retailers will need to be able to adjust their business model by categorizing their product catalogs and user data according to different criteria, for example:

Chart as an example for a retailer categorizing their product catalog and user data into different criteria. This chart separates the inputs into Marketing Channel, Customer Data, and Business data. Under the Marketing Channel, you have Marketing campaigns, engagement campaigns, content type, devices, behavioral data, click-through rate, and cost per click. Under Customer data there is customer lifetime value, cost of acquisition, RFM, demographics and personas, geographic locations, languages, and behavioral data. Finally, under business data you have seasonality of the products, conversion funnel stage, product attributes, inventory turnover, price sensitivity, available to promise, and bundle or promotion.

So as you can see all this vast amount of information can be embedded to build more comprehensive criteria for relevance, but first it needs to be properly captured and organized. This is where the value of the flexible document model comes into play.

The document model allows you to define different fields and attributes for each category of data. This can be used to capture the various categorization criteria. Retailers could also utilize embedded subdocuments to associate relevant information with products or customers. For instance, you can embed a subdocument containing marketing campaign data, engagement channels, and geographic location within products to track their performance.

As categorization criteria evolve, dynamic schema evolution allows you to add or modify fields without disrupting existing data. This flexibility easily accommodates changing business needs.

Retailers may also use embedded arrays to record purchase history for customers. Each array element can represent a transaction, including product details and purchase date, facilitating segmentation based on recency and frequency.

By embedding all these different data types, and leveraging the flexible capabilities of the document model, retailers can create a comprehensive and dynamic system that effectively categorizes data according to diverse criteria in a fast and resilient way. This enables personalized search experiences and enhanced customer engagement in the e-commerce space.

Sitting on a goldmine

Every retailer worldwide now realizes that with their customer data, they are sitting on a goldmine. Using the proper enabling technologies would allow them to build better experiences for their customers while infusing their applications with automated, data-driven decision-making.

Retailers offering more intuitive and contextual search results can ensure their customers find what they're looking for by personalizing the relevance of their search results, enhancing satisfaction, and increasing the likelihood of successful transactions.

The future of e-commerce search lies in harnessing the power of technologies like Atlas Vector Search, as it’s not only another vector search database, but also an extended product for the developer data platform, providing them with an integrated set of data and application services.

For retailers, the message is clear: to offer unparalleled shopping experiences, embracing and integrating vector search functionalities with a performant and reliant platform that simplifies your data organization and storage is not just beneficial, it's essential.

Learn more and discover How to Implement Databricks Workflows and Atlas Vector Search for Enhanced E-commerce Search Accuracy with our developer guide, and check out our GitHub repository explaining the full code for deploying an AI-Enhanced e-commerce search solution

← Previous

Multi-Cloud Data Resilience with MongoDB Atlas

MongoDB Atlas is architected to ensure that data remains safe and secure at all times, offering automated database resilience against hardware failures and regional outages. One of the reasons why MongoDB Atlas is able to provide high levels of resilience and availability is because it's the only developer data platform that's available on all three major public cloud platforms, AWS, Microsoft Azure, and Google Cloud. In fact, deploying Atlas to multiple cloud providers or regions is a simple matter of choosing how many nodes you want to deploy and in which cloud providers. And, once you're in the cloud, you can span databases cross-cloud without having to worry about setting up complicated ETL processes, so you're never locked into one cloud provider or running the risk of concentrating all your data in a single location. By utilizing Atlas to distribute data across multiple clouds, businesses can quickly and painlessly achieve high service levels for critical applications with virtually no latency. In the event of an outage, the self-healing process kicks in — automatically electing a secondary member to take the reins within seconds without operations to the database being affected — all without any manual intervention. Having access to multiple regions across the world provides businesses with the flexibility to adhere to data sovereignty requirements without needing to compromise on availability; cloud providers that offer just one region in a specific geography can leave users vulnerable to system disruptions, but multi-cloud clusters enable organizations to deploy additional nodes in regions not provided by their primary cloud provider. You also have the option, should you need it, of using another cloud provider in countries where a provider may only have one data center in a given region. Geo-resilience in Atlas MongoDB Atlas puts you in control of where your data is stored, with more than 110 regions across AWS, Google Cloud, and Microsoft Azure — to ensure your managed databases are close enough to your application servers for fast response times. Atlas is designed to ensure maximum uptime, no matter which region or cloud provider you're using. It provides built-in geographic resilience for multi-zone, multi-region, and multi-cloud clusters with built-in data resilience features: Atlas takes proactive measures to ensure the resilience of single-region clusters, automatically distributing replica set members across different cloud availability zones for maximum protection and maximum uptime. Atlas helps businesses stay resilient in the face of regional failure by leveraging multi-region clusters to replicate data across geographic boundaries and keep operations running smoothly. For added protection and peace of mind, multi-cloud clusters provide the perfect solution to address cloud provider failure and ensure data replication across multiple clouds. With AWS Availability Zones, Google Regions and Zones, and Azure Availability Zones, each independent zone is made up of one or more discrete data centers, all equipped with redundant power, and networking. A cloud region refers to the actual geographical site within a cloud service provider's infrastructure where a cluster or replica set is deployed. Regardless of the number of zones present in a cloud region, MongoDB Atlas will always deploy replica sets with at least three members to ensure the highest levels of availability and data durability. Atlas gives you complete flexibility when it comes to configuring multi-region protection. A multi-region cluster can be hosted in multiple regions within a single cloud provider or multiple regions across multiple cloud providers. Cloud provider disruptions can take various forms, from relatively minor capacity constraints to devastating outages that can wreak havoc on your application deployments. To reduce the risk of a major outage, organizations should consider the benefits of distributing their data across multiple clouds to maximize database and application resilience. With multi-cloud clusters, you can easily access the powerful and unique tools and services within AWS, Google Cloud, and Azure — giving you global reach, low-latency performance, regional data security, and resilient data replication and migration. Plus, Atlas takes the hassle out of the equation, automatically distributing your data across clouds for maximum fault tolerance and giving you the freedom to explore cross-cloud migration options at any time. Multi-cloud clusters provide the same features as single-cloud, multi-region clusters — such as continuous cloud backups, automated data tiering, and workload isolation for data analytics and visualization — but they also offer the added advantage of increased cross-cloud resilience. For more information on multi-cloud features in MongoDB Atlas, download our Data Resilience Strategy with MongoDB Atlas whitepaper. Find out more about deploying multi-cloud clusters from our documentation.

October 10, 2023

Next →

Building Modern Applications Faster: New Capabilities at MongoDB.local NYC 2024

Today, we kicked off MongoDB.local NYC and unveiled new capabilities across our developer data platform. The updates and capabilities announced today pave the way for a new era of app modernization and will allow developers to unleash the full potential of transformative technology like AI. Here’s an overview of our announcements, from a comprehensive update to MongoDB to AI-powered intelligent developer experiences: This post is also available in: Deutsch , Français , Español , Português , Italiano , 한국어 , 简体中文 . Modern applications need a modern database Cutting-edge modern applications must deliver both an exceptional experience and additional revenue. To meet these demands, developers require a database solution that offers optimal performance, scale, and operational resilience—while maintaining cost efficiency. So today, we’re thrilled to announce the preview of MongoDB 8.0 —the next evolution of MongoDB’s modern database. MongoDB 8.0 is focused on delivering unparalleled performance, scalability, security, and operational resilience to support the creation of next-generation applications, including sophisticated AI-driven solutions. It provides optimal performance by dramatically increasing query performance, improving resilience during periods of heavy load, making scalability easier and more cost-effective, and making time series collections faster and more efficient. Modernizing your next application with MongoDB is now easier As application modernization projects gain momentum, migrations are becoming a pressing reality for development and database teams. Transitioning from legacy relational systems to modern databases like MongoDB is essential to keeping up with technological shifts like AI. However, modernization and migrations have many challenges, from converting complex schemas and translating large amounts of application code to keeping databases in sync during long modernization projects. Announced in June 2023, MongoDB Relational Migrator streamlines the migration process by automating tasks like schema design, data migrations, and application code generation. Maintaining data synchronization is paramount in long-running modernization projects—where legacy relational databases must coexist with MongoDB until the project is complete. Today, we are pleased to announce that MongoDB Relational Migrator is now integrated with Confluent Cloud to support long-running change data capture (CDC) sync jobs. These jobs ensure operational resilience and observability, addressing the complexities of phased transitions without the added burden of managing Apache Kafka independently. Furthermore, migrating from legacy relational databases often involves significant effort in rewriting SQL queries, stored procedures, and triggers, which has traditionally been time-consuming and difficult. Now available in public preview, an AI-powered SQL Query Converter Tool has been introduced to MongoDB Relational Migrator that automates the process of converting existing SQL queries, stored procedures, and triggers to work with MongoDB in languages like JavaScript, Java, or C#. This streamlined approach—paired with MongoDB professional services—enables a simplified migration process that can scale effectively. Helping developers build faster with confidence on MongoDB We recognize the vital role that developers play in the success of every project, which is why we’re dedicated to making their MongoDB experience as seamless as possible. Frameworks are a great way for developers to boost productivity, improve code consistency and quality, and ultimately deliver code faster. For the C# developer community, we are pleased to announce that the MongoDB Provider for Entity Framework Core (EF Core) is now generally available . This allows C# developers building with EF Core to unlock the full power of MongoDB's developer data platform—while still using the EF Core APIs and design patterns they already know and love. And, recognizing the needs of the PHP community, we’re also proud to introduce the Laravel Aggregation Builder . This feature simplifies the process of building complex aggregation queries within Laravel, the most popular framework among PHP developers. By enhancing the integration of MongoDB with Laravel, we aim to boost productivity and ease the complexity of query operations, ensuring PHP developers can also enjoy an optimized development experience with MongoDB. Generating queries and visualizations with AI Since its initial release in 2015, MongoDB Compass has helped developers quickly build and debug queries and aggregations for their application code. Today, MongoDB Compass introduces an AI-powered, natural language query experience , making it even easier for developers to use MongoDB’s powerful Query API. Now generally available, this feature lets developers use natural language to generate executable MongoDB Query API syntax for everything from simple queries to sophisticated aggregations through an intelligent and guided experience. For example, a developer can input "Filter vacation rentals by location, group the remaining documents by number of bedrooms, and calculate the average nightly rental price," MongoDB Compass will suggest code to execute the stages of the aggregation pipeline. Data visualizations are a powerful way of understanding application data, and embedding charts into user-facing applications further enhances their utility and appeal to developers. However, creating visualizations is often hampered by the need for in-depth knowledge of the dataset and proficiency in using business intelligence tools—skills that many developers may not have. Now available in public preview, we introduced an easy-to-use visualization tool with generative AI capabilities in MongoDB Atlas Charts . Using natural language prompts, developers can easily render charts and build dashboards, making visualizing data and enriching their apps simple and fast. For example, developers can input ‘Show me the list of movies released in the last year sorted by genre,’ and MongoDB Atlas Charts will gather data and quickly generate the requested visualization. Today’s announcements underscore MongoDB’s commitment to helping developers innovate quickly and easily. For more about the MongoDB.local NYC 2024 updates, check out the product announcements page on our website.

May 2, 2024