Deploy and configure Unstructured clinical notes enrichment in healthcare data solutions (preview)

[This article is prerelease documentation and is subject to change.]

Unstructured clinical notes enrichment uses Azure AI Language's Text Analytics for health service to extract and add structure to unstructured clinical notes for analytics. You can deploy and configure the capability after deploying healthcare data solutions (preview) to your Fabric workspace and the Healthcare data foundations capability.

Unstructured clinical notes enrichment is an optional capability under healthcare data solutions in Microsoft Fabric (preview). You have the flexibility to decide whether or not to use it, depending on your specific needs or scenarios.

Prerequisites for deployment

  • To use Azure AI Language's Text Analytics for health service, you must accept the Responsible AI Terms and Conditions for deploying the language service in your environment. Follow the installation steps and guidance in Set up Azure Language service.

  • Unstructured clinical notes enrichment has a direct dependency on the Healthcare data foundations capability. Hence, make sure you successfully set up and execute the Healthcare data foundations pipelines first. For more information, see Deploy and configure Healthcare data foundations.

Deploy Unstructured clinical notes enrichment

To deploy Unstructured clinical notes enrichment to your workspace, follow these steps:

  1. Go to the healthcare data solutions home page on Fabric.

  2. Select the Unstructured clinical notes enrichment tile.

    A screenshot displaying the Unstructured clinical notes enrichment tile.

  3. On the capability page, select Deploy to workspace.

    A screenshot displaying how to deploy the capability to your workspace.

  4. The deployment can take a few minutes to complete. Don't close the tab or the browser while the deployment is in progress. In the meantime, you can work in another tab.

  5. After the deployment completes, you'll get a notification. Select the Manage capability button from the message bar to navigate to the capability management page. Here, you can view, configure, and manage the deployed notebook healthcare#_msft_silver_ta4h.

Configure the notebook

The healthcare#_msft_silver_ta4h notebook runs the NLPIngestionService module in the healthcare data solutions (preview) library and uses the Azure Text Analytics for health service. This service is a natural language processing (NLP) API to process and analyze unstructured text data. The results are stored in the mst_silver lakehouse.

Following are the key configuration parameters of this notebook:

  • NLP Config: Lets you customize the NLP settings to align with specific user requirements.

    • enable_text_analytics_logs: Toggle the value to True or False for activating or deactivating the API logs. The default value is set to False. To learn more about how to enable logging, see Enable logs.
    • nlp_source_table_name: Identifies the source table for the Text Analytics for health service to process.
    • nlp_document_limit: Sets the limit on the number of documents that the Text Analytics for health service can process. The default value is set to 10, with a maximum fallback of 1,000 documents. You can adjust this value as needed. However, keep the cost implications in mind, as explained in Pricing model.

We recommend that you schedule this notebook job to run every 4 hours. The initial run might not have data to consume due to concurrent and dependent jobs, which can cause latency. You can reduce this latency by adjusting the frequency of higher layer jobs.