Azure AI Search: Do I need to truncate dimensions when using AzureOpenAIEmbeddingSkill with Dimension parameter set?

Question

Azure AI Search: Do I need to truncate dimensions when using AzureOpenAIEmbeddingSkill with Dimension parameter set?

mathias Herbaux 0

Hello,

Following the documentation here: https://learn.microsoft.com/en-us/azure/search/vector-search-how-to-configure-compression-storage

we can use lower dimension vectors if our embedding model is text-embedding-3-large or small.

My understanding might be incorrect but:

With the AzureOpenAIEmbeddingSkill we can specify the dimension of vectors we want to store in our vector field.
And with the binary quantization, we can truncate to the desired dimension so that it match our vector field.

I'm using the AzureOpenAIEmbeddingSKill with a dimension of 512

My vector field is a collection of single, dimension is 512.

I'm using binary quantization compression.

I don't see the point of truncating via the binary quantization compression. In my understanding, it would make sense only if I did not set a dimension on AzureOpenAIEmbeddingSkill which in this case would return vectors with dimension > 512

Is it correct?

Saideep Anchuri 4,020 Reputation points Microsoft External Staff

2025-03-11T04:41:37.1866667+00:00

Hi @mathias Herbaux,

Following up to see if the above answer was helpful.

Thank You.
mathias Herbaux 0 Reputation points

2025-03-11T10:44:43.2566667+00:00

Yes, the answer was helpful. Cannot mark it as "Answer" though.
Saideep Anchuri 4,020 Reputation points Microsoft External Staff

2025-03-11T10:51:41.67+00:00

Hi @mathias Herbaux,

Thanks for your patience, from the above conversation Then, if you could Accept Answer and Upvote it for the benefit of community, it will be helpful to others.

Thank you

Accepted answer

0 additional answers

Your answer

Saideep Anchuri 4,020 Reputation points Microsoft External Staff

2025-03-11T04:41:37.1866667+00:00

Hi @mathias Herbaux,

Following up to see if the above answer was helpful.

Thank You.
mathias Herbaux 0 Reputation points

2025-03-11T10:44:43.2566667+00:00

Yes, the answer was helpful. Cannot mark it as "Answer" though.
Saideep Anchuri 4,020 Reputation points Microsoft External Staff

2025-03-11T10:51:41.67+00:00

Hi @mathias Herbaux,

Thanks for your patience, from the above conversation Then, if you could Accept Answer and Upvote it for the benefit of community, it will be helpful to others.

Thank you

Answer 1

Hi @mathias Herbaux,

Thank you for reaching out to Microsoft Q&A forum!

Azure OpenAI’s text-embedding-3 models support Matryoshka Representation Learning (MRL), which allows multi-level compression for faster searches and reduced storage costs. In Azure AI Search, MRL can be used alongside binary or scalar quantization, enabling dimensionality reduction through the truncationDimension property.

Since you're setting dimension=512 in AzureOpenAIEmbeddingSkill, your embeddings are already at a reduced size. Truncation is only necessary if your embeddings exceed 512 dimensions (e.g., text-embedding-3 models typically output 1,536-dimensional vectors). In your case, binary quantization is still beneficial for optimizing storage and query performance, but truncation isn't required.

I hope you understand! Thank you.

Share via

Azure AI Search: Do I need to truncate dimensions when using AzureOpenAIEmbeddingSkill with Dimension parameter set?

0 additional answers

Your answer