Connect to and manage Azure Machine Learning in Microsoft Purview (Preview)
This article outlines how to register Azure Machine Learning and how to authenticate and interact with Azure Machine Learning in Microsoft Purview. For more information about Microsoft Purview, read the introductory article.
This integration between Azure Machine Learning and Microsoft Purview applies an auto push model that, once the Azure Machine Learning workspace has been registered in Microsoft Purview, the metadata from workspace is pushed to Microsoft Purview automatically on a daily basis. It isn't necessary to manually scan to bring metadata from the workspace into Microsoft Purview.
Important
This feature is currently in preview. The Supplemental Terms of Use for Microsoft Azure Previews include additional legal terms that apply to Azure features that are in beta, in preview, or otherwise not yet released into general availability.
Supported capabilities
Metadata Extraction | Full Scan | Incremental Scan | Scoped Scan | Classification | Labeling | Access Policy | Lineage | Data Sharing | Live view |
---|---|---|---|---|---|---|---|---|---|
Yes | Yes | Yes | No | No | No | No | Yes | No | No |
When scanning the Azure Machine Learning source, Microsoft Purview supports:
- Extracting technical metadata from Azure Machine Learning, including:
- Workspace
- Models
- Datasets
- Jobs
Note
- AML workspaces don't currently support pushing metadata through a private endpoint to Microsoft Purview.
- You must register assets in your AML workspace for them to appear in Microsoft Purview.
Prerequisites
You must have an Azure account with an active subscription. Create an account for free.
You must have an active Microsoft Purview account.
You need Data Source Administrator and Data Reader permissions to register a source and manage it in the Microsoft Purview governance portal. For more information about permissions, see Access control in Microsoft Purview.
An active Azure Machine Learning workspace
A user needs the Contributor role in the Azure Machine Learning workspace to enable auto push from Azure Machine Learning workspace.
Register
This section describes how to register an Azure Machine Learning workspace in Microsoft Purview by using the Microsoft Purview governance portal.
Go to your Microsoft Purview account.
Select Data Map on the left pane.
Select Register.
In Register sources, select Azure Machine Learning (Preview) > Continue.
On the Register sources (Azure Machine Learning) screen, do the following:
For Name, enter a friendly name that Microsoft Purview lists as the data source for the workspace.
For Azure subscription and Workspace name, select the subscription and workspace that you want to push from the dropdown. The Azure Machine Learning workspace URL is automatically populated.
Select a collection from the list.
Select Register to register the source.
Scan
After you register your Azure Machine Learning workspace, the metadata will be automatically pushed to Microsoft Purview on a daily basis.
Browse and discover
To access the browse experience for data assets from your Azure Machine Learning workspace, select Browse Assets.
Browse by collection
Browse by collection allows you to explore the different collections you're a data reader or curator for.
Browse by source type
On the browse by source types page, select Azure Machine Learning.
The top-level assets under your selected data type are listed. Pick one of the assets to further explore its contents. For example, after selecting Azure Machine Learning, you'll see a list of workspaces with assets in Unified Catalog.
Selecting one of the workspaces displays the child assets.
From the list, you can select on any of the asset items to view details. For example, selecting one of the Azure Machine Learning job assets displays the details of the job.
Lineage
To view lineage information, select an asset and then select the Lineage tab. From the lineage tab, you can see the asset's relationships when applicable. You can see what source data was used (if registered in Purview), the data asset created in Azure Machine Learning, any jobs, and finally the resulting machine learning model. In more advanced scenarios, you can see:
- If multiple data sources were used
- Multiple stages of training on multiple data assets
- If multiple models were created from the same data sources
For more information on lineage in general, see data lineage and lineage users guide.
Next steps
Now that you've registered your source, use the following guides to learn more about Microsoft Purview and your data: