What's new in Azure AI Search

Article
12/11/2024

Azure Cognitive Search is now Azure AI Search. Learn about the latest updates to Azure AI Search functionality, docs, and samples.

Note

Preview features are announced here, but we also maintain a preview features list so you can find them in one place.

December 2024

Item	Type	Description
RAG chat with Azure AI Search + Python	Template	An AI application template for building a RAG solution using Azure AI Search and Python.

November 2024

Item	Type	Description
Network security perimeter	Security	Join a search service to a network security perimeter to control network access to your search service. The Azure portal and the Management REST APIs in the 2024-06-01-preview can be used to view and reconcile network security perimeter configurations.
Shared private link support for Azure AI service connections	Security	Connections to Azure AI for built-in skills processing can now be private using a shared private link on the connection.
Rescoring options for compressed vectors	Relevance	You can set options to rescore with original vectors instead of compressed vectors. Applies to HNSW and exhaustive KNN vector algorithms, using binary and scalar compression. Available in the Create or Update Index (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature.
Store fewer vector instances	vector search	In vector compression scenarios, you can omit storage of full precision vectors if you don't need them for rescoring. Available in the Create or Update Index (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature.
Query rewrite in the semantic reranker	Relevance	You can set options on a semantic query to rewrite the query input into a revised or expanded query that generates more relevant results from the L2 ranker. Available in the Search Documents (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature.
New semantic ranker models	Relevance	Semantic ranker runs with improved models in all supported regions. There is no change to APIs or the Azure portal experience.
Document Layout skill	Applied AI (skills)	A new skill used to analyze a document for structure and provide structure-aware (paragraph) chunking. This skill calls Document Intelligence and uses the Document Intelligence layout model. Available in selected regions through the Create or Update Skillset (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature.
Keyless billing for Azure AI skills processing	Applied AI (skills)	You can now use a managed identity and roles for a keyless connection to Azure AI services for built-in skills processing. This capability removes restrictions for having both search and AI services in the same region. Available in the Create or Update Skillset (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature.
Markdown parsing mode	Indexer data source	With this parsing mode, indexers can generate one-to-one or one-to-many search documents from Markdown files in Azure Storage and OneLake. Available in the Create or Update Indexer (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature.
2024-11-01-preview	API	Preview release of REST APIs for query rewrite, Document Layout skill, keyless billing for skills processing, Markdown parsing mode, and rescoring options for compressed vectors.
Portal support for structured data	Feature	The Import and vectorize data wizard now supports Azure SQL, Azure Cosmos DB, and Azure Table Storage.

October 2024

Item	Type	Description
Lower the dimension requirements for MRL-trained text embedding models on Azure OpenAI	Feature	Text-embedding-3-small and Text-embedding-3-large are trained using Matryoshka Representation Learning (MRL). This allows you to truncate the embedding vectors to fewer dimensions, and adjust the balance between vector index size usage and retrieval quality. A new `truncationDimension` in the 2024-09-01-preview enables access to MRL compression in text embedding models. This can only be configured for new vector fields.
Unpack `@search.score` to view subscores in hybrid search results	Feature	You can investigate Reciprocal Rank Fusion (RRF) ranked results by viewing the individual query subscores of the final merged and scored result. A new `debug` property unpacks the search score. `QueryResultDocumentSubscores`, `QueryResultDocumentRerankerInput`, and `QueryResultDocumentSemanticField` provide the extra detail. These definitions are available in the 2024-09-01-preview.
Target filters in a hybrid search to just the vector queries	Feature	A filter on a hybrid query involves all subqueries on the request, regardless of type. You can override the global filter to scope the filter to a specific subquery. The new `filterOverride` parameter is available on hybrid queries using the 2024-09-01-preview.
Text Split skill (token chunking)	Applied AI (skills)	This skill has new parameters that improve data chunking for embedding models. A new `unit` parameter lets you specify token chunking. You can now chunk by token length, setting the length to a value that makes sense for your embedding model. You can also specify the tokenizer and any tokens that shouldn't be split during data chunking. The new `unit` parameter and query subscore definitions are found in the 2024-09-01-preview.
2024-09-01-preview	API	Preview release of REST APIs for truncated dimensions in text-embedding-3 models, targeted vector filtering for hybrid queries, RRF subscore details for debugging, and token chunking for Text Split skill.
Portal support for customer-managed key encryption (CMK)	Feature	When you create new objects in the Azure portal, you can now specify CMK-encryption and select an Azure Key Vault to provide the key.

August 2024

Item	Type	Description
Debug Session improvements	feature	There are two important improvements. First, you can now debug integrated vectorization and data chunking workloads. Second, Debug Sessions is redesigned for a more streamlined presentation of skills and mappings. You can select an object in the flow, and view or edit its details in a side panel. The previous tabbed layout is fully replaced with more context-sensitive information on the page.
2024-07-01	API	Stable release of REST APIs for generally available vector data types, vector compression, and integrated vectorization during indexing and queries.
Integrated vectorization	Feature	Announcing general availability. Skills-driven data chunking and embedding during indexing.
Vectorizers	Feature	Announcing general availability. Text-to-vector conversion during query execution. Both Azure OpenAI vectorizer and custom Web API vectorizer are generally available.
AzureOpenAIEmbedding skill	Feature	Announcing general availability. A skill type that calls an Azure OpenAI embedding model to generate embeddings during indexing.
Index projections	Feature	Announcing general availability. A component of a skillset definition that defines the shape of a secondary index, supporting a one-to-many index pattern, where content from an enrichment pipeline can target multiple indexes.
Binary and Scalar quantization	Feature	Announcing general availability. Compress vector index size in memory and on disk using built-in quantization.
Narrow data types	Feature	Announcing general availability. Assign a smaller data type on vector fields, assuming incoming data is of that data type.
Import and vectorize data wizard	Azure portal	Announcing general availability. A wizard that creates a full indexing pipeline that includes data chunking and vectorization. The wizard creates all necessary objects and configurations. This release adds wizard support for Azure Data Lake in Azure Storage.
stored property	Feature	Announcing general availability. Boolean that reduces storage of vector indexes by not storing retrievable vectors.
vectorQueries.Weight property	Feature	Announcing general availability. Specify the relative weight of each vector query in a search operation.

July 2024

Item	Type	Description
Chat with your data	Accelerator	A solution accelerator for the RAG pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to create conversational search experiences. The code with sample data is available for use case scenarios such as financial advisor and contract review and summarization.
Conversational Knowledge Mining	Accelerator	A solution accelerator built on top of Azure AI Search, Azure Speech and Azure OpenAI services that allows customers to extract actionable insights from post-contact center conversations.
Build your own copilot	Accelerator	Create your own custom copilot solution that empowers Client Advisor to harness the power of generative AI across both structured and unstructured data. Help our customers to optimize daily tasks and foster better interactions with more clients.

June 2024

Item	Type	Description
Image search in the Azure portal	Feature	Search explorer now supports image search. In a vector index that has vectorized image content, you can drop images into Search Explorer to query for a match.

May 2024

Item	Type	Description
Higher capacity and more vector quota at every tier (same billing rate)	Infrastructure	For most regions, partition sizes are now even larger for Standard 2 (S2), Standard 3 (S3), and Standard 3 High Density (S3 HD) for services created after April 3, 2024. To get the larger partitions, create a new service in a region that provides newer infrastructure. Storage Optimized tiers (L1 and L2) also have more capacity. L1 and L2 customers must create a new service to benefit from the higher capacity. There's no in-place upgrade at this time. Extra capacity is now available in more regions: Germany North, Germany West Central, South Africa North, Switzerland West, and Azure Government (Texas, Arizona, and Virginia).
OneLake integration (preview)	Feature	New indexer for OneLake files and OneLake shortcuts. If you use Microsoft Fabric and OneLake for data access to Amazon Web Services (AWS) and Google data sources, use this indexer to import external data into a search index. This indexer is available through the Azure portal, the 2024-05-01-preview REST API, and Azure SDK beta packages.
Vector relevance hybrid query relevance	Feature	Four enhancements improve vector and hybrid search relevance. First, you can now set thresholds on vector search results to exclude low-scoring results. Second, changes in the query architecture apply scoring profiles at the end of the query pipeline for every query type. Document boosting is a common scoring profile, and it now works as expected on vector and hybrid queries. Third, you can set `MaxTextRecallSize` and `countAndFacetMode` in hybrid queries to control the quantity of BM25-ranked search results that flow into the hybrid ranking model. Fourth, for vector and hybrid search, you can weight a vector query to have boost or diminish its importance in a multiquery request.
Binary vectors support	Feature	`Collection(Edm.Byte)` is a new supported data type. This data type opens up integration with the Cohere v3 binary embedding models and custom binary quantization. Narrow data types lower the cost of large vector datasets. See Index binary data for vector search for more information.
Azure AI Vision multimodal embeddings skill (preview)	Skill	New skill that's bound to the multimodal embeddings API of Azure AI Vision. You can generate embeddings for text or images during indexing. This skill is available through the Azure portal and the 2024-05-01-preview REST API.
Azure AI Vision vectorizer (preview)	Vectorizer	New vectorizer connects to an Azure AI Vision resource using the multimodal embeddings API to generate embeddings at query time. This vectorizer is available through the Azure portal and the 2024-05-01-preview REST API.
Azure AI Foundry model catalog vectorizer (preview)	Vectorizer	New vectorizer connects to an embedding model deployed from the Azure AI Foundry model catalog. This vectorizer is available through the Azure portal and the 2024-05-01-preview REST API. How to implement integrated vectorization using models from Azure AI Foundry.
AzureOpenAIEmbedding skill (preview) supports more models on Azure OpenAI	Skill	Now supports text-embedding-3-large and text-embedding-3-small, along with text-embedding-ada-002 from the previous update. New `dimensions` and `modelName` properties make it possible to specify the various embedding models on Azure OpenAI. Previously, the dimensions limits were fixed at 1,536 dimensions, applicable to text-embedding-ada-002 only. The updated skill is available through the Azure portal and the 2024-05-01-preview REST API.
Azure portal updates	Portal	Import and vectorize data wizard now supports OneLake indexers as a data source. For embeddings, it also supports connections to Azure AI Vision multimodal, Azure AI Foundry model catalog, and more embedding models on Azure OpenAI. When adding a field to an index, you can choose a binary data type. Search explorer now defaults to 2024-05-01-preview and supports the new preview features for vector and hybrid queries.
2024-05-01-preview	API	New preview version of the Search REST APIs provides new skills and vectorizers, new binary data type, OneLake files indexer, and new query parameters for more relevant results. See Upgrade REST APIs if you have existing code written against the 2023-07-01-preview and need to migrate to this version.
Azure SDK beta packages	API	Review the changelogs of the following Azure SDK beta packages for new feature support: Azure SDK for Python, Azure SDK for .NET, Azure SDK for Java
Python code samples	Samples	New end-to-end samples demonstrate integration with Cohere Embed v3, integration with OneLake and cloud data platforms on Google and AWS, and integration with Azure AI Vision multimodal APIs.

April 2024

Item	Type	Description
Security update addressing information disclosure	API	GET responses no longer return connection strings or keys. Applies to GET Skillset, GET Index, and GET Indexer. This change helps protect your Azure assets integrated with AI Search from unauthorized access.
More storage on Basic and Standard tiers	Infrastructure	Basic now supports up to three partitions and three replicas. Basic and Standard (S1, S2, S3) tiers have significantly more storage per partition, at the same per-partition billing rate. Extra capacity is subject to regional availability and applies to new search services created after April 3, 2024. Currently, there's no in-place upgrade, so you must create a new search service to get the extra storage.
More quota for vectors	Infrastructure	Vector quotas are also higher on new services created after April 3, 2024 in selected regions.
Vector quantization, narrow vector data types, and a new `stored` property (preview)	Feature	Collectively, these three features add vector compression and smarter storage options. First, scalar quantization reduces vector index size in memory and on disk. Second, narrow data types reduce per-field storage by storing smaller values. Third, you can use `stored` to opt-out of storing the extra copy of a vector that's used only for search results. If you don't need vectors in a query response, you can set `stored` to false to save on space.
2024-03-01-preview Search REST API	API	New preview version of the Search REST APIs for the new data types, vector compression properties, and vector storage options.
2024-03-01-preview Management REST API	API	New preview version of the Management REST APIs for control plane operations.
2023-07-01-preview deprecation announcement	API	Deprecation announced on April 8, 2024. It becomes unsupported on July 8, 2024. This was the first REST API that offered vector search support. Newer API versions have a different vector configuration. You should migrate to a newer version as soon as possible.

February 2024

Item	Type	Description
New dimension limits	Feature	For vector fields, maximum dimension limits are now `3072`, up from `2048`.

2023 announcements

Month	Type	Announcement
November	Feature	Vector search, generally available. The previous restriction on customer-managed keys (CMK) is now lifted. Prefiltering and exhaustive K-nearest neighbor algorithm are also now generally available.
November	Feature	Semantic ranker, generally available
November	Feature	Integrated vectorization (preview) adds data chunking and text-to-vector conversions during indexing, and also adds text-to-vector conversions at query time.
November	Feature	Import and vectorize data wizard (preview) automates data chunking and vectorization. It targets the 2023-10-01-Preview REST API.
November	Feature	Index projections (preview) defines the shape of a secondary index, used for a one-to-many index pattern, where content from an enrichment pipeline can target multiple indexes.
November	API	2023-11-01 Search REST API is stable version of the Search REST APIs for vector search and semantic ranking. See Upgrade REST APIs for migration steps to generally available features.
November	API	2023-11-01 Management REST API adds APIs that enable or disable semantic ranker.
November	Skill	Azure OpenAI Embedding skill (preview) connects to a deployed embedding model on your Azure OpenAI resource to generate embeddings during skillset execution.
November	Skill	Text Split skill (preview) updated in 2023-10-01-Preview to support native data chunking.
November	Video	How vector search and semantic ranking improve your GPT prompts explains how hybrid retrieval gives you optimal grounding data for generating useful AI responses and enables search over both concepts and keywords.
November	Sample	Role-based access control in Generative AI applications explains how to use Microsoft Entra ID and Microsoft Graph API to roll out granular user permissions on chunked content in your index.
October	Sample	"Chat with your data" solution accelerator. End-to-end RAG pattern that uses Azure AI Search as a retriever. It provides indexing, data chunking, and orchestration.
October	Feature	Exhaustive K-Nearest Neighbors (KNN) scoring algorithm for similarity search in vector space. Available in the 2023-10-01-Preview REST API only.
October	Feature	Prefilters in vector search evaluate filter criteria before query execution, reducing the amount of content that needs to be searched. Available in the 2023-10-01-Preview REST API only, through a new `vectorFilterMode` property on the query that can be set to `preFilter` (default) or `postFilter`, depending on your requirements.
October	API	2023-10-01-Preview Search REST API, breaking changes the definition for vector fields and vector queries.
August	Feature	Enhanced semantic ranking. Upgraded models are rolling out for semantic reranking, and availability is extended to more regions. Maximum unique token counts doubled from 128 to 256.
July	Sample	Vector demo (Azure SDK for JavaScript). Uses Node.js and the @azure/search-documents 12.0.0-beta.2 library to generate embeddings, create and load an index, and run several vector queries.
July	Sample	Vector demo (Azure SDK for .NET). Uses the Azure.Search.Documents 11.5.0-beta.3 library to generate embeddings, create and load an index, and run several vector queries. You can also try this sample from the Azure SDK team.
July	Sample	Vector demo (Azure SDK for Python) Uses the latest beta release of the azure.search.documents to generate embeddings, create and load an index, and run several vector queries. Visit the azure-search-vector-samples/demo-python repo for more vector search demos.
June	Feature	Vector search public preview.
June	Feature	Semantic search availability, available on the Basic tier.
June	API	2023-07-01-Preview Search REST API. Support for vector search.
May	Feature	Azure RBAC (role-based access control, generally available).
May	API	2022-09-01 Management REST API, with support for configuring search to use Azure roles. The Az.Search module of Azure PowerShell and Az search module of the Azure CLI are updated to support search service authentication options. You can also use the Terraform provider to configure authentication options (see this Terraform quickstart for details).
April	Sample	Multi-region deployment of Azure AI Search for business continuity and disaster recovery. Deployment scripts that fully configure a multi-regional solution for Azure AI Search, with options for synchronizing content and request redirection if an endpoint fails.
March	Sample	ChatGPT + Enterprise data with Azure OpenAI and Azure AI Search (GitHub). Python code and a template for combining Azure AI Search with the large language models in OpenAI. For background, see this Tech Community blog post: Revolutionize your Enterprise Data with ChatGPT. Key points: Use Azure AI Search to consolidate and index searchable content. Query the index for initial search results. Assemble prompts from those results and send to the gpt-35-turbo (preview) model in Azure OpenAI. Return a cross-document answer and provide citations and transparency in your customer-facing app so that users can assess the response.

Previous year's announcements

Service rebrand

This service has had multiple names over the years. Here they are in reverse chronological order:

Azure AI Search (November 2023) Renamed to align with Azure AI services and customer expectations.
Azure Cognitive Search (October 2019) Renamed to reflect the expanded (yet optional) use of cognitive skills and AI processing in service operations.
Azure Search (March 2015) The original name.

Service updates

Service update announcements for Azure AI Search can be found on the Azure web site.

Feature rename

Semantic search was renamed to semantic ranker in November 2023 to better describe the feature, which provides L2 ranking of an existing result set.

Share via