December 2024
These features and Azure Databricks platform improvements were released in December 2024.
Note
Releases are staged. Your Azure Databricks account might not be updated until a week or more after the initial release date.
Databricks Runtime 16.1 is GA
December 20, 2024
Databricks Runtime 16.1 and Databricks Runtime 16.1 ML are now generally available.
See Databricks Runtime 16.1 and Databricks Runtime 16.1 for Machine Learning.
The default format for new notebooks is now IPYNB (Jupyter) format
December 20, 2024
The default format for new notebooks you create in your Azure Databricks workspace is now IPYNB (.ipynb
). Previously, the default format for notebooks was Source (.py, .sql, .scala, .r)
. To change the default format, use the Default file format for notebooks setting in the Developer pane of your workspace user settings. See Notebook formats.
Databricks-hosted models for Databricks Assistant is in public preview
December 19, 2024
You can now use Azure Databricks-hosted models to power Databricks Assistant in Azure Databricks. This feature is in public preview.
See Use a Databricks-hosted model for Databricks Assistant.
Python code executor for AI agents (Public Preview)
December 19, 2024
You can now quickly give your AI agents the ability to run Python code. Databricks now offers a pre-built Unity Catalog function that can be used by an AI agent as a tool to expand their capabilities beyond language generation.
See Code interpreter AI agent tools.
databricks-agents SDK 0.13.0 release
December 18, 2024
Version 0.13.0 of the databricks-agents
SDK has been released to PyPI, containing the following changes:
- Honor the current active Databricks CLI profile and MLflow model registry URI when calling
agents.deploy()
and otherdatabricks.agents
APIs. In particular, you can now specify a combination ofDATABRICKS_CONFIG_PROFILE=my-profile
andMLFLOW_REGISTRY_URI=databricks-uc://my-profile
before callingagents.deploy()
to specify the Databricks CLI profile to use to deploy and access agents. - In
mlflow.evaluate()
, only run retrieval and guidelines metrics if retrieval and guideline context are present, respectively. - Add secret-based authentication to clients for
mlflow.evaluate()
.
External groups are now labeled and immutable
December 18, 2024
External groups are groups that are created in Azure Databricks from Microsoft Entra ID. These groups are created using a SCIM provisioning connector and stay in sync with Microsoft Entra ID. External groups are now explicitly labeled as External
and can no longer be updated from the Azure Databricks account console or workspace admin settings page by default. To update external group membership from the Azure Databricks UI, an account admin can disable Immutable external groups in the account console preview page.
See Types of groups in Azure Databricks.
vector_search
now supports vector embedding inputs
December 17, 2024
The AI Function vector_search
now supports vector embedding inputs. You can now use the query_text
or query_vector
parameters to search for specific text or vector embeddings in vector indexes. See vector_search function.
Specify response formats for ai_query
December 17, 2024
ai_query
now supports the responseFormat
field for structured outputs. Use responseFormat
in your ai_query
requests to specify the response format you want the model you are querying to follow. See ai_query function.
Test clean rooms with collaborators within the same metastore
December 17, 2024
You can now test your clean room before full deployment by adding a collaborator from within the same metastore. See Create clean rooms.
Assign compute resources to groups (Public Preview)
December 17, 2024
The new Dedicated access mode (previously Single user) allows you to assign a dedicated all-purpose compute to a group or single user. See Assign compute resources to a group.
This Public Preview also gives your workspace access to the new simplified compute UI. See Use the simple form to manage compute.
A workspace admin must enable this preview. See Manage Azure Databricks Previews.
Delegate the ability to create a storage credential in Unity Catalog to a service principal
December 17, 2024
You can now grant service principals permissions to create a storage credential in a Unity Catalog metastore using the CREATE STORAGE CREDENTIAL
privilege. See CREATE STORAGE CREDENTIAL.
Version 2.2 of the Jobs API is released
December 16, 2024
The Jobs API version is updated from 2.1 to 2.2. Updates in version 2.2 of the Jobs API include default queueing of new or updated jobs, and enhancing paging of job and job run responses that include fields with large numbers of values. To learn more about the updates in this version, see Updating from Jobs API 2.1 to 2.2. To see the full Jobs API 2.2 documentation, see Jobs (latest). Although Databricks recommends using version 2.2 of the Jobs API, you can still access versions 2.1 and 2.0. See Jobs (2.1) and Jobs API 2.0.
Unity Catalog MANAGE
privilege (Public Preview)
December 14, 2024
You can now grant users the MANAGE
privilege on Unity Catalog securable objects. The MANAGE
privilege allows users to perform key actions on a Unity Catalog object, including:
- Managing privileges
- Dropping the object
- Renaming the object
- Transferring ownership
See MANAGE.
Meta Llama 3.3 is now available for provisioned throughput workloads
December 13, 2024
Meta Llama 3.3, a model architecture built and trained by Meta, is now available for serving on Foundation Model APIs provisioned throughput.
View streaming workload metrics for your job runs (Public Preview)
** December 12, 2024**
When you view job runs in the Databricks Jobs UI, you can now view metrics such as backlog seconds, backlog bytes, backlog records, and backlog files for sources supported by Spark Structured Streaming, including Apache Kafka, Amazon Kinesis, and Auto Loader. See View metrics for streaming tasks.
View streaming workload metrics for your Delta Live Tables pipeline updates (Public Preview)
** December 12, 2024**
When you view pipeline updates in the Delta Live Tables UI, you can now view metrics such as backlog seconds, backlog bytes, backlog records, and backlog files for each streaming flow in the pipeline. Streaming metrics are supported for Spark Structured Streaming sources, including Apache Kafka, Amazon Kinesis, and Auto Loader. See View streaming metrics.
Lakehouse Federation supports Oracle (Public Preview)
December 12, 2024
You can now run federated queries on data managed by Oracle. See Run federated queries on Oracle.
Databricks Runtime 16.1 (Beta)
December 11, 2024
Databricks Runtime 16.1 and Databricks Runtime 16.1 ML are now available as Beta releases.
See Databricks Runtime 16.1 and Databricks Runtime 16.1 for Machine Learning
Manage serverless outbound network connections with serverless egress control
December 11th, 2024
Serverless egress control lets you restrict outbound access to specified internet destinations. See What is serverless egress control?.
Network access events system table is now available (Public Preview)
December 11, 2024
Azure Databricks system tables now include a network access table. This table logs an event whenever internet access is denied from your account. To access the table, admins must have the access
system schema enabled. See Network access events system table reference.
Monitor and revoke personal access tokens in your account (Private Preview)
December 11, 2024
Account admins can now view a token report to monitor and revoke personal access tokens (PATs) in the account console. Databricks recommends you use OAuth access tokens instead of PATs for greater security and convenience. To join this preview, contact your Azure Databricks account team. See Monitor and revoke personal access tokens in the account.
Unity Catalog can federate to Hive metastores
December 11, 2024
You can now use Unity Catalog to access and govern data that is registered in a Hive metastore. This includes both externally-managed Hive metastores and legacy internal Databricks Hive Hive metastores.
See Hive metastore federation: enable Unity Catalog to govern tables registered in a Hive metastore.
Remove metastore-level storage to enforce catalog-level storage isolation
December 11, 2024
If you have metastore-level storage for managed tables and volumes (also known as the metastore storage root), but you want to enforce data storage isolation at the catalog or schema level, you can now remove that metastore-level storage without interrupting existing workloads. See Remove metastore-level storage.
Meta Llama 3.3 70B Instruct is now available on Model Serving
December 11, 2024
Mosaic AI Model Serving now supports Meta Llama 3.3 70B Instruct, a state-of-the-art large language model built and trained by Meta. Llama 3.3 70B Instruct is available as part of Foundation Model APIs pay-per-token. This availability also includes support on Function Calling.
Starting December 11, 2024, Meta-Llama-3.1-70B-Instruct replaces support for Meta-Llama-3-70B-Instruct in Foundation Model APIs pay-per-token endpoints.
bamboolib is now deprecated
December 10, 2024
bamboolib is now deprecated. Users can still access bamboolib to perform low-code data analysis within notebooks, but Databricks is no longer actively developing nor supporting this tool. For assistance with code generation, use the Databricks Assistant.
Streamline AI Agent Evaluation using synthetic evaluation sets
December 9, 2024
Evaluate your AI agent by generating a representative evaluation set from your documents. The synthetic generation API is tightly integrated with Agent Evaluation, allowing you to quickly evaluate and improve the quality of your agent’s responses without going through the costly process of human labeling. See Synthesize evaluation sets.
Improve Databricks-to-Databricks Delta Sharing table read performance with history sharing (Public Preview)
December 5, 2024
Improve performance for Databricks-to-Databricks table shares by enabling history sharing. See Improve table read performance with history sharing.
Maximum personal access token lifetime now 730 days (two years)
December 5, 2024
The default maximum lifetime for newly created Databricks-issued personal access tokens is now set to 730 days (two years). Previously, personal access tokens could be created with no expiration by default. With this update, users cannot generate new tokens with a lifetime exceeding 730 days, and tokens created without a specified lifetime are set to a 730-day duration. If you configured the maximum token lifetime for your workspace to less than 730 days, the configuration remains unchanged. See Monitor and revoke personal access tokens and Azure Databricks personal access token authentication.
Mosaic AI Model Training - serverless forecasting (Public Preview)
December 5, 2024
Mosaic AI Model Training - forecasting improves upon the existing AutoML forecasting experience with managed serverless compute, Unity Catalog support, access to deep learning algorithms, and an upgraded interface. See Forecasting (serverless) with AutoML.
Add budget policies to model serving endpoints
December 4, 2024
Budget policies are now supported on model serving endpoints. See Manage model serving endpoints.