Introducing voyage-3.5 and voyage-3.5-lite: Improved Quality for a New Retrieval Frontier

MongoDB
May 20, 2025 | Updated: August 18, 2025

TL;DR – We’re excited to introduce voyage-3.5 and voyage-3.5-lite, the latest generation of our embedding models. These models offer improved retrieval quality over voyage-3 and voyage-3-lite at the same price, setting a new frontier for price-performance. Both models support embeddings in 2048, 1024, 512, and 256 dimensions, with multiple quantization options enabled by Matryoshka learning and quantization-aware training. voyage-3.5 and voyage-3.5-lite outperform OpenAI-v3-large by 8.26% and 6.34%, respectively, on average across evaluated domains, with 2.2x and 6.5x lower respective costs and a 1.5x smaller embedding dimension. Compared with OpenAI-v3-large (float, 3072), voyage-3.5 (int8, 2048) and voyage-3.5-lite (int8, 2048) reduce vector database costs by 83%, while achieving higher retrieval quality.

Note to readers: voyage-3.5 and voyage-3.5-lite are currently available through the Voyage AI APIs directly or through the private preview of automated text embedding in Atlas Vector Search. For access, sign up for Voyage AI or register your interest in the Atlas Vector Search private preview.

We’re excited to introduce voyage-3.5 and voyage-3.5-lite, which maintain the same sizes as their predecessors—voyage-3 and voyage-3-lite—but offer improved quality for a new retrieval frontier.

As we see in the figure below, voyage-3.5 improves retrieval quality over voyage-3 by 2.66%, and voyage-3.5-lite improves over voyage-3-lite by 4.28%—both maintaining a 32K context length and their respective price points of $0.06 and $0.02 per 1M tokens.

Figure 1. voyage-3.5 and voyage-3.5-lite achieve a new frontier for price-performance.

Graph showing the price performance of voyage-3.5 and 3.5-lite against voyage-3, Cohere v4, and OpenAI models. Compared to all the rest, voyage-3.5 and 3.5-lite achieve higher quality retrieval quality per penny spent.

voyage-3.5 and voyage-3.5-lite also outperforms OpenAI-v3-large by 8.26% and 6.34%, respectively, with voyage-3.5 also outperforming Cohere-v4 by 1.63%. voyage-3.5-lite achieves retrieval quality within 0.3% of Cohere-v4 at 1/6 the cost. Both models advance the cost-performance ratio of embedding models to a new state-of-the-art through an improved mixture of training data, distillation from voyage-3-large, and the use of Voyage AI rerankers.

Matryoshka embeddings and quantization: voyage-3.5 and voyage-3.5-lite support 2048, 1024, 512, and 256-dimensional embeddings enabled by Matryoshka learning and multiple embedding quantization options—including 32-bit floating point, signed and unsigned 8-bit integer, and binary precision—while minimizing quality loss. Compared with OpenAI-v3-large (float, 3072), voyage-3.5 and voyage-3.5-lite (both int8, 2048) reduce vector database costs by 83%, while achieving outperformance of 8.25% and 6.35% respectively. Further, comparing OpenAI-v3-large (float, 3072) with voyage-3.5 and voyage-3.5-lite (both binary, 1024), vector database costs are reduced by 99%, with outperformance of 3.63% and 1.29% respectively.

Figure 2. voyage-3.5 and voyage-3.5-lite offer industry-leading performance.

Another graph that depicts retrieval quality versus relative storage costs. Again, voyage-3.5 and 3.5-lite provide you with higher quality retrievals for the same price as other competing models.

Evaluation details

Datasets: We evaluate on 100 datasets spanning eight domains: technical documentation, code, law, finance, web reviews, multilingual, long documents, and conversations. Each dataset consists of a corpus (e.g., technical documentation, court opinions) and queries (e.g., questions, summaries). The following table lists the datasets in the eight categories, except multilingual, which includes 62 datasets covering 26 languages. A list of all evaluation datasets is available in this spreadsheet.

Category	Descriptions	Datasets
TECH	Technical documentation	Cohere, 5G, OneSignal, LangChain, PyTorch
CODE	Code snippets, docstrings	LeetCodeCPP-rtl, LeetCodeJava-rtl, LeetCodePython-rtl, HumanEval-rtl, MBPP-rtl, DS1000-referenceonly-rtl, DS1000-rtl, APPS-rtl
LAW	Cases, court opinions, statues, patents	LeCaRDv2, LegalQuAD, LegalSummarization, AILA casedocs, AILA statutes
FINANCE	SEC fillings, finance QA	RAG benchmark (Apple-10K-2022), FinanceBench, TAT-QA-rtl, Finance Alpaca, FiQA-Personal-Finance-rtl, Stock News Sentiment, ConvFinQA-rtl, FinQA-rtl, HC3 Finance
WEB	Reviews, forum posts, policy pages	Huffpostsports, Huffpostscience, Doordash, Health4CA
LONG-CONTEXT	Long documents on assorted topics: government reports, academic papers, and dialogues	NarrativeQA, QMSum, SummScreenFD, WikimQA
CONVERSATION	Meeting transcripts, dialogues	Dialog Sum, QA Conv, HQA

Models: We evaluate voyage-3.5 and voyage-3.5-lite alongside several alternatives, including: OpenAI-v3 small (text-embedding-3-small) and large (text-embedding-3-large), Cohere-v4 (embed-v4.0), voyage-3-large, voyage-3, and voyage-3-lite.

Metrics: Given a query, we retrieve the top 10 documents based on cosine similarities and report the normalized discounted cumulative gain (NDCG@10), a standard metric for retrieval quality and a variant of the recall.

Results

All the evaluation results are available in this spreadsheet.

Domain-specific quality: The bar charts below illustrate the average retrieval quality of voyage-3.5 and voyage-3.5-lite with full precision and 2048 dimensions, both overall and for each domain. voyage-3.5 outperforms OpenAI-v3-large, voyage-3, and Cohere-v4 by an average of 8.26%, 2.66%, and 1.63%, respectively across domains. voyage-3.5-lite outperforms OpenAI-v3-large and voyage-3-lite by an average of 6.34% and 4.28%, respectively across domains.

Figure 3. voyage-3.5 and voyage-3.5-lite outperform across domains.

A set of bar graphs, each representing a different domain, showcasing how voyage-3.5 and 3.5-lite outperform the other models that have been referenced on the previous two graphs.

Binary rescoring: In some cases, users retrieve an initial set of documents using binary embeddings (e.g., 100 in our evaluation) and then rescore them with full-precision embeddings. For voyage-3.5 and voyage-3.5-lite, this binary rescoring approach yields up to 6.38% and 6.89% improvements, respectively, in retrieval quality over standard binary retrieval.

Try voyage-3.5 and voyage-3.5-lite!

Interested in getting started today via the Voyage API? The first 200 million tokens are free. Visit the docs to learn more.

Interested in using voyage-3.5 or voyage-3.5-lite alongside MongoDB Atlas? Register your interest in the private preview for automated embedding in Atlas Vector Search.

← Previous

Unlocking Literacy: Ojje’s Journey With MongoDB

In the rapidly evolving landscape of education technology, one startup is making waves with a bold mission to revolutionize how young minds learn to read. Ojje is redefining literacy education by combating one of the most pressing issues in education today—reading proficiency. To do so, Ojje leverages groundbreaking technology to ensure every child can access the world of stories, at their own pace, in their own language. That transformative change is powered by a strategic partnership with MongoDB . Meet Ojje: A vision beyond words From electric cars to diabetes apps, Adrian Chernoff has been at the forefront of breakthrough innovations. Now, as the Founder and CEO of Ojje , he's channeling his passion for invention and entrepreneurship into something deeply personal and universally important—literacy. At its core, Ojje is an adaptive literacy learning platform that offers stories in 15 different reading levels, available in both English and Spanish. Grounded in the science of reading, it features elements like read-aloud functionality and dyslexia-friendly fonts to engage every learner. Ojje is not just a tool—it’s a gateway to personalized literacy education. Ojje's mission is to reach every learner by providing materials that are leveled, accessible, and engaging. By doing so, Ojje aims to vastly improve reading outcomes across K-12 education. Solving a literacy crisis with innovative solutions With literacy rates in the U.S. alarmingly low—almost 70% of low-income fourth grade students cannot read at a basic level according to the National Literacy Institute— Ojje's mission couldn't be more crucial. Chernoff and his team developed their platform in response to teachers' complaints about the stark lack of appropriate reading materials available to students. Schools needed a tool that could effortlessly cater to varying reading abilities within a single classroom. Ojje fills this gap by offering a dynamic platform that adapts to individual students’ needs, allowing educators to personalize instruction. The potential to genuinely connect with every student is realized through Ojje’s innovative use of technology. Powered by MongoDB At the root of every great tech innovation is an infrastructure that allows it to flourish. For Ojje, MongoDB is that foundation. As a startup, speed and adaptability are vital, and MongoDB’s flexible document model provides just that. It allows the Ojje team to launch rapidly, scale efficiently, and to handle a variety of data structures seamlessly—all without the cumbersome need for rigid schemas. “MongoDB handles everything from structured data to student performance tracking, without unnecessary overhead,” Chernoff said. “The platform scales with our needs, and the built-in monitoring tools give our team confidence as usage grows.” Why MongoDB? For Ojje, it was about the flexibility to handle educational content, ensure secure data handling for students, and to offer scalability for thousands of classrooms. MongoDB proved to be the perfect fit, offering a balance of adaptability and comprehensive data management. Working with MongoDB also offered Ojje access to the MongoDB for Startups program, providing essential Atlas credits, valuable technical resources, and access to our vast network of partners. This support played a crucial role during Ojje’s developmental stages and early launch, helping to position the company for successful growth and innovation. What’s next for Ojje? With an eye towards broadening their impact, the Ojje team plans to expand its library to include STEM materials and engaging biographies, alongside enhancing existing content. Additionally, Ojje will introduce tools for educators to track each reader’s progress in real time, further personalizing instruction. “We believe every student deserves the chance to love reading—and every teacher deserves tools that make that possible,” Chernoff said. “That’s why we’re building Ojje: To make literacy more accessible, engaging, and joyful. When students can learn to read and read to learn, it transforms not only their K–12 experience but their entire future.” In an exciting development, Ojje will soon unveil Ojje at Home. This initiative aims to extend literacy support beyond the classroom, providing families with valuable resources to join their children on the journey to literacy. Building a future where every child reads Ojje's combination of strategic foresight, cutting-edge technology, and genuine passion for educational impact make it a standout player in the education sector. By partnering with MongoDB, the company has created a robust, adaptive platform that not only meets the demands of today’s classrooms but is poised to address future literacy challenges. As the digital landscape continues to evolve, so must our methods of teaching and learning. Ojje is leading the charge, ensuring that every child has the opportunity to love reading and reap the lifelong benefits it brings. Interested in MongoDB but not sure where to start? Check out our quick start guides for detailed instructions on deploying and using MongoDB.

May 15, 2025

Next →

That’s a Wrap: MongoDB’s 2025 in Review & 2026 Predictions

It’s nearly the end of the year—again! That means it’s time for an end-of-year blog post that expresses disbelief at the passage of time. Which, as the saying goes, flies when you’re having fun. And definitely when you’re as busy as MongoDB was in 2025. It was a big year for the company—and more importantly, for the tens of thousands of customers and millions of developers who rely on MongoDB’s modern data platform for their most mission-critical workloads. At MongoDB, everything we do starts with our obsession with customers and their needs, and if there’s a theme to MongoDB’s 2025, it was (and will continue to be) enabling customer innovation and helping them succeed in the AI era. So here are a few highlights of how MongoDB acted on behalf of customers in 2025. From the acquisition of Voyage AI to customer success across industries, a lot happened in 2025. Let’s go!* *Read to the end for 2026 thoughts. 2025: The (MongoDB) year that was Voyage AI, modernization, and search In February, MongoDB announced the acquisition of Voyage AI, a pioneer in embedding and reranking models, to enhance the accuracy of AI applications. Integrating Voyage AI's advanced retrieval technology with MongoDB’s modern, AI-ready data platform addresses a critical challenge: LLM model hallucinations caused by a lack of context. By improving retrieval accuracy for specialized domains like finance and law, the integration enables businesses to deploy AI for mission-critical use cases. To learn more, see the MongoDB Voyage AI page. Then, in September, we launched MongoDB AMP, an AI-powered Application Modernization Platform. AMP is designed to accelerate the transformation of legacy applications through a combination of AI-powered tooling, a proven delivery framework, and expert guidance (tools, techniques, and talent) to help enterprises reduce technical debt and modernize 2-3 times faster. Want more? Sure you do! Check out this short video. MongoDB also announced the addition of search and vector search capabilities to MongoDB Community Edition and MongoDB Enterprise Server. This allows developers to build and test AI-native applications, including those using retrieval-augmented generation (RAG), in local or on-premises environments. Previously exclusive to MongoDB Atlas, these features enable secure, hybrid deployments where sensitive data can remain on-premises while still leveraging advanced search tools. Here’s a (slightly less short) video about search and vector search on Enterprise Server. Growing and scaling with MongoDB As noted, everything we do at MongoDB starts with our obsession with customers. 2025 was another banner year for customer success and innovation—we were inspired by what organizations of every shape and size, across industries and geographies, built with MongoDB in 2025. Here are just two of the many stories our customers shared in 2025; much more can be found in my colleague Katie Palmer’s blog series, Innovating with MongoDB. Factory By combining the Atlas modern data platform with Voyage AI’s high-performance embeddings, the AI-native startup Factory—which uses AI agents called Droids to accelerate software development lifecycles for organizations—consolidated its fragmented tech stack. This enabled superior code retrieval, simplified operations, and provided the scalability needed to process billions of tokens daily. McKesson McKesson, a global pharmaceutical distributor, replaced its monolithic legacy infrastructure with MongoDB Atlas to meet strict drug tracing mandates. By adopting our modern cloud data platform, McKesson scaled its operations 300x, managing tracking data for 1.2 billion containers annually without latency, and ensuring compliance and patient safety while reducing developer complexity. For more, check out the video of McKesson at MongoDB.local NYC from September. From niche NoSQL to enterprise powerhouse As senior MongoDB engineer and Technical Fellow Ashish Kumar put it earlier this year, “through a sustained and deliberate engineering effort,” MongoDB has gone from a (seemingly) niche NoSQL solution to a trusted enterprise standard, and now delivers “the high availability, tunable consistency, ACID transactions, and robust security that enterprises demand.” A new era of leadership The face of MongoDB has also changed—our CFO, Mike Berry, joined the company in April, and Dev Ittycheria stepped down as CEO in November, after more than 11 years leading the company (including its 2017 IPO). In a LinkedIn post about his role, new MongoDB CEO CJ Desai noted that the company is “at the forefront of a new data revolution, unlocking the next wave of productivity and intelligence.” “Having spent my career building and scaling technology platforms, I’ve always been drawn to companies defined by clarity of vision, relentless organic innovation, and a customer-first culture. MongoDB exemplifies all three,” said Desai. We couldn’t agree more. Onward! Reading the 2026 tea leaves So what might 2026 bring (for MongoDB and tech at large)? Here are a handful of our leaders’ predictions: “As much as people want to talk about Artificial General Intelligence (AGI), we’re still in the phase where most AI use cases automate redundant tasks but benefit from human-in-the-loop checks. Organizations that use AI to complete work that historically is a drain on human resources—but then uses people to carefully verify what AI builds, apply governance frameworks, and maintain accountability across the data lifecycle—will be more successful.” —Pete Johnson, Field CTO, AI, MongoDB “After years of inflated expectations and unsustainable spending, the AI industry is trapped in a bubble where companies reflexively attempt to deploy LLMs at every problem, driving up costs with minimal to no return. Businesses that break free from this spending cycle are the ones that understand the need to ground LLM responses in factual data and learn from prior mistakes. We believe the best way to do this will be with highly accurate embedding models and rerankers for reliable data retrieval.” —Frank Liu, Staff Product Manager, MongoDB "In 2026, cloud independence will evolve from strategic preference to existential imperative across enterprises of every scale. The outages and disruptions of recent years have exposed a fundamental truth: in an always-on digital economy—where commerce, mobility, governance, and even public safety depend on uninterrupted access to cloud services—single-provider reliance is no longer a calculated risk, but a systemic vulnerability. Compounding this is the inexorable rise of data sovereignty. Regulatory regimes worldwide now demand precise jurisdictional control over data residency, rendering rigid cloud commitments incompatible with compliance at global scale. The defining competitive advantage will belong to organizations that transcend fragile prevention theater and engineer true infrastructural resilience: architectures inherently portable, data frictionlessly mobile, and operations autonomously sustained across heterogeneous clouds through AI-orchestrated redundancy. In short, the winners will not merely mitigate downtime—they will design systems that render the concept obsolete." —Ben Cefalo, SVP, Head of Core Products, MongoDB Happy holidays and happy New Year, everyone!

December 22, 2025