Vector Search Dimensions limited to 2048

I noticed this documentation describing the limits on vector dimensions after investigating a problem with a query.

Is this a new change to the documentation? The vector search still works with my database containing vectors of 4096 dimensions.

Will the number of dimensions supported by Atlas change or will I have to migrate to another vector database provider?

I re-embedded the 700k documents because the product is that good. I hope more dimensions will be supported in the future.

3 Likes

Hi @Daniel_Marco,

Glad you solved it. For other people reading this thread in the future: you can have an array of any size in MongoDB (although is not good practice to have unbounded arrays, or very large ones). As a vector is an array, in theory it can have any size, but the vector index used only supports 2048 dimensions max as you can read in the docs.

Thanks for posting your question and your own solution!

1 Like

@Diego_Freniche amazing, now i have to re-embed all my documents with another model with lower dimensions.

Do you have any plans on supporting 4096 dimensions?

@Daniel_Marco and @Nice_Guy thank you both for your questions, and apologies for not addressing this sooner. I’m Henry, one of the product managers supporting Atlas Vector Search, and I should be able to highlight forthcoming updates to this limit.

We hope to update our dimension limit from 2048 to 4096 within the next few weeks. We’ve seen increased demand for this recently with the high performance of Mistral-derivative embedding models (4096 dims) and OpenAI’s new text-embedding-3-large model (3072 dims by default), and are moving quickly to make this change and kick the tires on our indexing system to make sure there are no surprises at this increased scale of vector comparisons.

I will respond here when this update has landed, and hope that this context is helpful in the meantime.

4 Likes

Waiting for that too

Waiting here as well.

Thank you all for your patience. The vector dimension limit has now been increased to 4096. This will apply to all new clusters, and all clusters without maintenance windows. Those with maintenance windows will have this update applied during their next scheduled window. The updates to our docs are in progress, but this functionality is available now.

One additional caveat worth noting when working with the text-embedding-3-large model is that we only support vector comparisons between query and indexed vectors of the same dimension, so be sure to set these properly if you choose to modify the default dimensionality.

3 Likes

When setting this up in the UI, I assume we can ignore the warning “Number of vector dimensions. Value can be between 1 and 2048, both inclusive.”?

image

Hi @Will_Calderwood , thanks for raising this! You are correct, this warning should be ignored. I’ve filed a ticket to address this bug.

1 Like