Configure DQS to Use Reference Data
This topic describes how to configure Data Quality Services (DQS) to use reference data for cleansing your data. You could either use reference data from Azure Marketplace or from direct online third-party reference data providers.
Before You Begin
Prerequisites
To use reference data from Marketplace, you must have a valid Marketplace account key. For detailed information about creating a Marketplace account key, see Create Your Account (https://go.microsoft.com/fwlink/?LinkId=212936). You can also create a Marketplace account key from within Data Quality Client by clicking Configuration under Administration in the Data Quality Client home screen, and then clicking Create a DataMarket Account ID under the Reference Data tab.
Security
Permissions
You must have the dqs_administrator role on the DQS_MAIN database to configure reference data service settings in DQS.
Configure DQS to Use Reference Data from Marketplace
Start Data Quality Client. For information about doing so, see Run the Data Quality Client Application.
In the Data Quality Client home screen, under Administration, click Configuration.
In the Reference Data tab, under the Network Settings area, type appropriate values in the Proxy Server and Port boxes if you or your organization uses proxy server to connect to the Internet.
Specify the Marketplace account key in the DataMarket Account ID box, and click the Validate DataMarket Account ID icon to validate the account key. A message appears to display whether the specified Marketplace account key is valid.
You are now ready to use the reference data services from Marketplace in DQS that are subscribed for the specified Marketplace account key.
Configure DQS to Use Reference Data from Direct Online Third-Party Reference Data Providers
Start Data Quality Client. For information about doing so, see Run the Data Quality Client Application.
In the Data Quality Client home screen, under Administration, click Configuration.
In the Reference Data tab, under the Network Settings area, type appropriate values in the Proxy Server and Port boxes if you or your organization uses proxy server to connect to the Internet.
In the Direct Online 3rd Party Reference Data Service Settings area, click the Add new reference data service provider icon.
In the Create New Direct Online 3rd Party Reference Data Service Provider dialog box, specify the following details:
In the Name box, type a name of the new direct reference data service provider.
(Optional) In the Description box, type a description of the new direct reference data service provider.
In the Category box, type the category of the data provided by the new direct reference data service provider.
In the Schema box, specify the schema that defines the string of fields (column names) to be used from the direct reference data service provider. A field name should not contain a space, and the fields should be separated by commas. For example:
FirstName, LastName, City, State
.In the URI box, type the URI of the direct reference data service provider. Only secure URIs (address starting with "https://") are allowed in DQS.
In the Max Batch Size box, type the maximum number of records per batch that will be sent to the reference data service provider for cleansing. A maximum of 100 records per batch can be specified for the cleansing activity.
In the Account ID box, type the account ID of the subscriber with the reference data service provider.
Click OK to save the data, and close the Create New Direct Online 3rd Party Reference Data Service Provider dialog box. The newly added direct online third party reference data provider becomes available in the Direct Reference Data Service Providers Grid in DQS.
You are now ready to use the reference data services from the newly configured direct online third-party reference data service provider in DQS.
Follow Up: After Configuring DQS to use Reference Data
You must now map the required knowledge base domains to the reference data available from the data providers you just configured. To do so, see Attach a Domain or Composite Domain to Reference Data.