Redigera

Dela via


WordPress.org Microsoft Graph connector (Preview)

With the Microsoft Graph connector for WordPress.org-built websites, your organization can index published posts and pages of your WordPress.org-built websites. After you configure the connector and index content from WordPress.org-built websites, end users can search for those published posts and pages in Microsoft Copilot and from any Microsoft Search client.

This article is for Microsoft 365 administrators or anyone who configures, runs, and monitors a WordPress.org Graph connector.

Note

The WordPress.org connector is in preview. If you wish to get early access to try it, sign up using this form.

Capabilities

  • Index published posts and pages of your WordPress.org-built website.
  • Set ingestion filters of published posts by categories.
  • Customize your crawl frequency. 
  • Create workflows using this connection and plugins from Microsoft Copilot Studio. 
  • Use Semantic search in Copilot to enable users to find relevant content.

Limitations

  • Doesn't index comments.
  • Doesn't crawl user identities and access permissions. All published pages or posts indexed using the WordPress connector are visible to all Microsoft 365 users in your tenant, from Microsoft Search or Copilot.

Prerequisites

  • You must be the search admin for your organization's Microsoft 365 tenant.
  • Install the Microsoft Graph connector agent: To access your WordPress.org-built website, you must install and configure the Microsoft Graph connector agent. Download the agent installer and follow the installation instructions to set it up. Once installed, ensure that the agent is configured correctly to connect your WordPress.org-built website with the graph connector.
  • WordPress.org-built website URL: To connect to your WordPress.org-built website data, you need your organization's WordPress.org-built website URL.
  • WordPress.org-built website Admin account: To connect to your WordPress.org-built website and allow Microsoft Graph Connector to update published posts and pages regularly, you need an admin user account of your WordPress.org-built website with the permission to create an application password. Application password is used to authenticate with a third-party service or application that connects to your WordPress.org-built website via REST API.

Get Started

1. Display name

A display name is used to identify each citation in Copilot, helping users easily recognize the associated file or item. Display name also signifies trusted content. Display name is also used as a content source filter. A default value is present for this field, but you can customize it to a name that users in your organization recognize.

2. WordPress.org-built website URL

A WordPress.org-built website URL is the unique web address assigned to each WordPress.org-built website, allowing you to access your specific WordPress.org-built website.

3. Graph Connector Agent

The graph connector agent acts as a bridge between your WordPress.org instance and the connector APIs, enabling secure and efficient data transfer. In this step, select the agent configuration you want to use for your connector. 

If you have not installed the Microsoft Graph connector agent already, you can download the agent installer and follow the installation instructions to set it up. Once installed, ensure that the agent is configured correctly to connect your on-premises WordPress.org instance with the Graph connector.

4. Authentication Type

We support the basic authentication method. To enable and configure basic authentication in WordPress.org, find more details here. 

5. Staged rollout to limited audience

Deploy this connection to a limited user base if you want to validate it in Copilot and other Search surfaces before expanding the rollout to a broader audience.

At this point, you are ready to create the connection for WordPress.org-built website. You can click on the ‘Create’ button to publish your connection and index published posts and pages from your WordPress.org-built website.

For other settings, like Access Permissions, Data inclusion rules, Schema, Crawl frequency etc., we set defaults based on what works best with WordPress.org-built website data. The default values settings are as follows.

Page Settings Default Values
Users Access Permissions All published pages or posts indexed using the WordPress.org connector are visible to all Microsoft 365 users in your tenant, from Microsoft Search or Copilot.
Content Index Content All published posts and pages are selected by default.
Content Manage Properties To check default properties and their schema, click here.
Sync Incremental Crawl Frequency: Every 15 mins
Sync Full crawl Frequency: Every day

If you want to edit any of these values, you need to choose the ‘Custom Setup’ option.

Custom Setup

Custom setup is for those admins who want to edit the default values for settings. Once you click on the ‘Custom Setup’ option, you should see three other tabs – Users, Content, and Sync.

Users

Access permissions

Currently only published pages and posts from your WordPress.org-built websites are indexed. All data indexed using the WordPress.org connector is visible to all Microsoft 365 users in your tenant, from Microsoft Search or Copilot.

Content

Filter the Indexed Content

You can specify conditions for indexed content. For example, you can choose to index posts or pages and choose to index posts associated to specific categories.

Use the preview results button to verify the sample values of the selected properties and filters.

Manage Properties

Here, you can add or remove available properties from your WordPress.org data source, assign a schema to the property (define whether a property is searchable, queryable, retrievable or refinable), change the semantic label and add an alias to the property. Properties that are selected by default are listed below.

Source Property Label Description Schema
Author Authors Name of all the people who participated/collaborated on the item in the data source. Search, Query, Retrieve
Categories  Query, Retrieve, Refine
Content  Search, Retrieve
Created Created date time Data and time that the item was created in the data source. Query, Retrieve
CreatedBy Created by Name of the person who created the item in the data source. Search, Query, Retrieve
Excerpt Search, Retrieve
id Query, Retrieve
Tags Query, Retrieve, Refine
Title Title The title of the item that you want to be shown in Copilot and other search experiences. Search, Retrieve
Type  Query, Retrieve, Refine
Updated Last modified date time Date and time the item was last modified in the data source. Query, Retrieve
UpdatedBy Last modified by  Name of the person who most recently edited the item in the data source Search, Query, Retrieve
Url url The target URL of the item in the data source. Retrieve

Sync

You can configure full and incremental crawls based on the scheduling options present here. By default, incremental crawl is set for every 15 minutes, and full crawl is set for every day. If needed, you can adjust these schedules to fit your data refresh needs.

Troubleshooting

After publishing your connection, you can review the status under the Data Sources tab in the admin center. To learn how to make updates and deletions, see Manage your connector.

If you have issues or want to provide feedback, contact Microsoft Graph | Support.