What's new in Azure AI Search
Azure Cognitive Search is now Azure AI Search. Learn about the latest updates to Azure AI Search functionality, docs, and samples.
Note
Preview features are announced here, but we also maintain a preview features list so you can find them in one place.
December 2024
Item | Type | Description |
---|---|---|
RAG chat with Azure AI Search + Python | Template | An AI application template for building a RAG solution using Azure AI Search and Python. |
November 2024
Item | Type | Description |
---|---|---|
Network security perimeter | Security | Join a search service to a network security perimeter to control network access to your search service. The Azure portal and the Management REST APIs in the 2024-06-01-preview can be used to view and reconcile network security perimeter configurations. |
Shared private link support for Azure AI service connections | Security | Connections to Azure AI for built-in skills processing can now be private using a shared private link on the connection. |
Rescoring options for compressed vectors | Relevance | You can set options to rescore with original vectors instead of compressed vectors. Applies to HNSW and exhaustive KNN vector algorithms, using binary and scalar compression. Available in the Create or Update Index (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature. |
Store fewer vector instances | vector search | In vector compression scenarios, you can omit storage of full precision vectors if you don't need them for rescoring. Available in the Create or Update Index (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature. |
Query rewrite in the semantic reranker | Relevance | You can set options on a semantic query to rewrite the query input into a revised or expanded query that generates more relevant results from the L2 ranker. Available in the Search Documents (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature. |
New semantic ranker models | Relevance | Semantic ranker runs with improved models in all supported regions. There is no change to APIs or the Azure portal experience. |
Document Layout skill | Applied AI (skills) | A new skill used to analyze a document for structure and provide structure-aware (paragraph) chunking. This skill calls Document Intelligence and uses the Document Intelligence layout model. Available in selected regions through the Create or Update Skillset (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature. |
Keyless billing for Azure AI skills processing | Applied AI (skills) | You can now use a managed identity and roles for a keyless connection to Azure AI services for built-in skills processing. This capability removes restrictions for having both search and AI services in the same region. Available in the Create or Update Skillset (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature. |
Markdown parsing mode | Indexer data source | With this parsing mode, indexers can generate one-to-one or one-to-many search documents from Markdown files in Azure Storage and OneLake. Available in the Create or Update Indexer (2024-11-01-preview), the Azure portal, and in the Azure SDK beta packages that provide this feature. |
2024-11-01-preview | API | Preview release of REST APIs for query rewrite, Document Layout skill, keyless billing for skills processing, Markdown parsing mode, and rescoring options for compressed vectors. |
Portal support for structured data | Feature | The Import and vectorize data wizard now supports Azure SQL, Azure Cosmos DB, and Azure Table Storage. |
October 2024
Item | Type | Description |
---|---|---|
Lower the dimension requirements for MRL-trained text embedding models on Azure OpenAI | Feature | Text-embedding-3-small and Text-embedding-3-large are trained using Matryoshka Representation Learning (MRL). This allows you to truncate the embedding vectors to fewer dimensions, and adjust the balance between vector index size usage and retrieval quality. A new truncationDimension in the 2024-09-01-preview enables access to MRL compression in text embedding models. This can only be configured for new vector fields. |
Unpack @search.score to view subscores in hybrid search results |
Feature | You can investigate Reciprocal Rank Fusion (RRF) ranked results by viewing the individual query subscores of the final merged and scored result. A new debug property unpacks the search score. QueryResultDocumentSubscores , QueryResultDocumentRerankerInput , and QueryResultDocumentSemanticField provide the extra detail. These definitions are available in the 2024-09-01-preview. |
Target filters in a hybrid search to just the vector queries | Feature | A filter on a hybrid query involves all subqueries on the request, regardless of type. You can override the global filter to scope the filter to a specific subquery. The new filterOverride parameter is available on hybrid queries using the 2024-09-01-preview. |
Text Split skill (token chunking) | Applied AI (skills) | This skill has new parameters that improve data chunking for embedding models. A new unit parameter lets you specify token chunking. You can now chunk by token length, setting the length to a value that makes sense for your embedding model. You can also specify the tokenizer and any tokens that shouldn't be split during data chunking. The new unit parameter and query subscore definitions are found in the 2024-09-01-preview. |
2024-09-01-preview | API | Preview release of REST APIs for truncated dimensions in text-embedding-3 models, targeted vector filtering for hybrid queries, RRF subscore details for debugging, and token chunking for Text Split skill. |
Portal support for customer-managed key encryption (CMK) | Feature | When you create new objects in the Azure portal, you can now specify CMK-encryption and select an Azure Key Vault to provide the key. |
August 2024
Item | Type | Description |
---|---|---|
Debug Session improvements | feature | There are two important improvements. First, you can now debug integrated vectorization and data chunking workloads. Second, Debug Sessions is redesigned for a more streamlined presentation of skills and mappings. You can select an object in the flow, and view or edit its details in a side panel. The previous tabbed layout is fully replaced with more context-sensitive information on the page. |
2024-07-01 | API | Stable release of REST APIs for generally available vector data types, vector compression, and integrated vectorization during indexing and queries. |
Integrated vectorization | Feature | Announcing general availability. Skills-driven data chunking and embedding during indexing. |
Vectorizers | Feature | Announcing general availability. Text-to-vector conversion during query execution. Both Azure OpenAI vectorizer and custom Web API vectorizer are generally available. |
AzureOpenAIEmbedding skill | Feature | Announcing general availability. A skill type that calls an Azure OpenAI embedding model to generate embeddings during indexing. |
Index projections | Feature | Announcing general availability. A component of a skillset definition that defines the shape of a secondary index, supporting a one-to-many index pattern, where content from an enrichment pipeline can target multiple indexes. |
Binary and Scalar quantization | Feature | Announcing general availability. Compress vector index size in memory and on disk using built-in quantization. |
Narrow data types | Feature | Announcing general availability. Assign a smaller data type on vector fields, assuming incoming data is of that data type. |
Import and vectorize data wizard | Azure portal | Announcing general availability. A wizard that creates a full indexing pipeline that includes data chunking and vectorization. The wizard creates all necessary objects and configurations. This release adds wizard support for Azure Data Lake in Azure Storage. |
stored property | Feature | Announcing general availability. Boolean that reduces storage of vector indexes by not storing retrievable vectors. |
vectorQueries.Weight property | Feature | Announcing general availability. Specify the relative weight of each vector query in a search operation. |
July 2024
Item | Type | Description |
---|---|---|
Chat with your data | Accelerator | A solution accelerator for the RAG pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to create conversational search experiences. The code with sample data is available for use case scenarios such as financial advisor and contract review and summarization. |
Conversational Knowledge Mining | Accelerator | A solution accelerator built on top of Azure AI Search, Azure Speech and Azure OpenAI services that allows customers to extract actionable insights from post-contact center conversations. |
Build your own copilot | Accelerator | Create your own custom copilot solution that empowers Client Advisor to harness the power of generative AI across both structured and unstructured data. Help our customers to optimize daily tasks and foster better interactions with more clients. |
June 2024
Item | Type | Description |
---|---|---|
Image search in the Azure portal | Feature | Search explorer now supports image search. In a vector index that has vectorized image content, you can drop images into Search Explorer to query for a match. |
May 2024
Item | Type | Description |
---|---|---|
Higher capacity and more vector quota at every tier (same billing rate) | Infrastructure | For most regions, partition sizes are now even larger for Standard 2 (S2), Standard 3 (S3), and Standard 3 High Density (S3 HD) for services created after April 3, 2024. To get the larger partitions, create a new service in a region that provides newer infrastructure. Storage Optimized tiers (L1 and L2) also have more capacity. L1 and L2 customers must create a new service to benefit from the higher capacity. There's no in-place upgrade at this time. Extra capacity is now available in more regions: Germany North, Germany West Central, South Africa North, Switzerland West, and Azure Government (Texas, Arizona, and Virginia). |
OneLake integration (preview) | Feature | New indexer for OneLake files and OneLake shortcuts. If you use Microsoft Fabric and OneLake for data access to Amazon Web Services (AWS) and Google data sources, use this indexer to import external data into a search index. This indexer is available through the Azure portal, the 2024-05-01-preview REST API, and Azure SDK beta packages. |
Vector relevance hybrid query relevance |
Feature | Four enhancements improve vector and hybrid search relevance. First, you can now set thresholds on vector search results to exclude low-scoring results. Second, changes in the query architecture apply scoring profiles at the end of the query pipeline for every query type. Document boosting is a common scoring profile, and it now works as expected on vector and hybrid queries. Third, you can set MaxTextRecallSize and countAndFacetMode in hybrid queries to control the quantity of BM25-ranked search results that flow into the hybrid ranking model. Fourth, for vector and hybrid search, you can weight a vector query to have boost or diminish its importance in a multiquery request. |
Binary vectors support | Feature | Collection(Edm.Byte) is a new supported data type. This data type opens up integration with the Cohere v3 binary embedding models and custom binary quantization. Narrow data types lower the cost of large vector datasets. See Index binary data for vector search for more information. |
Azure AI Vision multimodal embeddings skill (preview) | Skill | New skill that's bound to the multimodal embeddings API of Azure AI Vision. You can generate embeddings for text or images during indexing. This skill is available through the Azure portal and the 2024-05-01-preview REST API. |
Azure AI Vision vectorizer (preview) | Vectorizer | New vectorizer connects to an Azure AI Vision resource using the multimodal embeddings API to generate embeddings at query time. This vectorizer is available through the Azure portal and the 2024-05-01-preview REST API. |
Azure AI Foundry model catalog vectorizer (preview) | Vectorizer | New vectorizer connects to an embedding model deployed from the Azure AI Foundry model catalog. This vectorizer is available through the Azure portal and the 2024-05-01-preview REST API. How to implement integrated vectorization using models from Azure AI Foundry. |
AzureOpenAIEmbedding skill (preview) supports more models on Azure OpenAI | Skill | Now supports text-embedding-3-large and text-embedding-3-small, along with text-embedding-ada-002 from the previous update. New dimensions and modelName properties make it possible to specify the various embedding models on Azure OpenAI. Previously, the dimensions limits were fixed at 1,536 dimensions, applicable to text-embedding-ada-002 only. The updated skill is available through the Azure portal and the 2024-05-01-preview REST API. |
Azure portal updates | Portal | Import and vectorize data wizard now supports OneLake indexers as a data source. For embeddings, it also supports connections to Azure AI Vision multimodal, Azure AI Foundry model catalog, and more embedding models on Azure OpenAI. When adding a field to an index, you can choose a binary data type. Search explorer now defaults to 2024-05-01-preview and supports the new preview features for vector and hybrid queries. |
2024-05-01-preview | API | New preview version of the Search REST APIs provides new skills and vectorizers, new binary data type, OneLake files indexer, and new query parameters for more relevant results. See Upgrade REST APIs if you have existing code written against the 2023-07-01-preview and need to migrate to this version. |
Azure SDK beta packages | API | Review the changelogs of the following Azure SDK beta packages for new feature support: Azure SDK for Python, Azure SDK for .NET, Azure SDK for Java |
Python code samples | Samples | New end-to-end samples demonstrate integration with Cohere Embed v3, integration with OneLake and cloud data platforms on Google and AWS, and integration with Azure AI Vision multimodal APIs. |
April 2024
Item | Type | Description |
---|---|---|
Security update addressing information disclosure | API | GET responses no longer return connection strings or keys. Applies to GET Skillset, GET Index, and GET Indexer. This change helps protect your Azure assets integrated with AI Search from unauthorized access. |
More storage on Basic and Standard tiers | Infrastructure | Basic now supports up to three partitions and three replicas. Basic and Standard (S1, S2, S3) tiers have significantly more storage per partition, at the same per-partition billing rate. Extra capacity is subject to regional availability and applies to new search services created after April 3, 2024. Currently, there's no in-place upgrade, so you must create a new search service to get the extra storage. |
More quota for vectors | Infrastructure | Vector quotas are also higher on new services created after April 3, 2024 in selected regions. |
Vector quantization, narrow vector data types, and a new stored property (preview) |
Feature | Collectively, these three features add vector compression and smarter storage options. First, scalar quantization reduces vector index size in memory and on disk. Second, narrow data types reduce per-field storage by storing smaller values. Third, you can use stored to opt-out of storing the extra copy of a vector that's used only for search results. If you don't need vectors in a query response, you can set stored to false to save on space. |
2024-03-01-preview Search REST API | API | New preview version of the Search REST APIs for the new data types, vector compression properties, and vector storage options. |
2024-03-01-preview Management REST API | API | New preview version of the Management REST APIs for control plane operations. |
2023-07-01-preview deprecation announcement | API | Deprecation announced on April 8, 2024. It becomes unsupported on July 8, 2024. This was the first REST API that offered vector search support. Newer API versions have a different vector configuration. You should migrate to a newer version as soon as possible. |
February 2024
Item | Type | Description |
---|---|---|
New dimension limits | Feature | For vector fields, maximum dimension limits are now 3072 , up from 2048 . |
2023 announcements
Month | Type | Announcement |
---|---|---|
November | Feature | Vector search, generally available. The previous restriction on customer-managed keys (CMK) is now lifted. Prefiltering and exhaustive K-nearest neighbor algorithm are also now generally available. |
November | Feature | Semantic ranker, generally available |
November | Feature | Integrated vectorization (preview) adds data chunking and text-to-vector conversions during indexing, and also adds text-to-vector conversions at query time. |
November | Feature | Import and vectorize data wizard (preview) automates data chunking and vectorization. It targets the 2023-10-01-Preview REST API. |
November | Feature | Index projections (preview) defines the shape of a secondary index, used for a one-to-many index pattern, where content from an enrichment pipeline can target multiple indexes. |
November | API | 2023-11-01 Search REST API is stable version of the Search REST APIs for vector search and semantic ranking. See Upgrade REST APIs for migration steps to generally available features. |
November | API | 2023-11-01 Management REST API adds APIs that enable or disable semantic ranker. |
November | Skill | Azure OpenAI Embedding skill (preview) connects to a deployed embedding model on your Azure OpenAI resource to generate embeddings during skillset execution. |
November | Skill | Text Split skill (preview) updated in 2023-10-01-Preview to support native data chunking. |
November | Video | How vector search and semantic ranking improve your GPT prompts explains how hybrid retrieval gives you optimal grounding data for generating useful AI responses and enables search over both concepts and keywords. |
November | Sample | Role-based access control in Generative AI applications explains how to use Microsoft Entra ID and Microsoft Graph API to roll out granular user permissions on chunked content in your index. |
October | Sample | "Chat with your data" solution accelerator. End-to-end RAG pattern that uses Azure AI Search as a retriever. It provides indexing, data chunking, and orchestration. |
October | Feature | Exhaustive K-Nearest Neighbors (KNN) scoring algorithm for similarity search in vector space. Available in the 2023-10-01-Preview REST API only. |
October | Feature | Prefilters in vector search evaluate filter criteria before query execution, reducing the amount of content that needs to be searched. Available in the 2023-10-01-Preview REST API only, through a new vectorFilterMode property on the query that can be set to preFilter (default) or postFilter , depending on your requirements. |
October | API | 2023-10-01-Preview Search REST API, breaking changes the definition for vector fields and vector queries. |
August | Feature | Enhanced semantic ranking. Upgraded models are rolling out for semantic reranking, and availability is extended to more regions. Maximum unique token counts doubled from 128 to 256. |
July | Sample | Vector demo (Azure SDK for JavaScript). Uses Node.js and the @azure/search-documents 12.0.0-beta.2 library to generate embeddings, create and load an index, and run several vector queries. |
July | Sample | Vector demo (Azure SDK for .NET). Uses the Azure.Search.Documents 11.5.0-beta.3 library to generate embeddings, create and load an index, and run several vector queries. You can also try this sample from the Azure SDK team. |
July | Sample | Vector demo (Azure SDK for Python) Uses the latest beta release of the azure.search.documents to generate embeddings, create and load an index, and run several vector queries. Visit the azure-search-vector-samples/demo-python repo for more vector search demos. |
June | Feature | Vector search public preview. |
June | Feature | Semantic search availability, available on the Basic tier. |
June | API | 2023-07-01-Preview Search REST API. Support for vector search. |
May | Feature | Azure RBAC (role-based access control, generally available). |
May | API | 2022-09-01 Management REST API, with support for configuring search to use Azure roles. The Az.Search module of Azure PowerShell and Az search module of the Azure CLI are updated to support search service authentication options. You can also use the Terraform provider to configure authentication options (see this Terraform quickstart for details). |
April | Sample | Multi-region deployment of Azure AI Search for business continuity and disaster recovery. Deployment scripts that fully configure a multi-regional solution for Azure AI Search, with options for synchronizing content and request redirection if an endpoint fails. |
March | Sample | ChatGPT + Enterprise data with Azure OpenAI and Azure AI Search (GitHub). Python code and a template for combining Azure AI Search with the large language models in OpenAI. For background, see this Tech Community blog post: Revolutionize your Enterprise Data with ChatGPT. Key points: Use Azure AI Search to consolidate and index searchable content. Query the index for initial search results. Assemble prompts from those results and send to the gpt-35-turbo (preview) model in Azure OpenAI. Return a cross-document answer and provide citations and transparency in your customer-facing app so that users can assess the response. |
Previous year's announcements
Service rebrand
This service has had multiple names over the years. Here they are in reverse chronological order:
- Azure AI Search (November 2023) Renamed to align with Azure AI services and customer expectations.
- Azure Cognitive Search (October 2019) Renamed to reflect the expanded (yet optional) use of cognitive skills and AI processing in service operations.
- Azure Search (March 2015) The original name.
Service updates
Service update announcements for Azure AI Search can be found on the Azure web site.
Feature rename
Semantic search was renamed to semantic ranker in November 2023 to better describe the feature, which provides L2 ranking of an existing result set.