Configure historical versions in eDiscovery (preview)
The historical versions (preview) feature in eDiscovery (preview) lets eDiscovery managers in your organization search for and collect content from all versions of documents stored in SharePoint and OneDrive. Then you can add that content to a review set for analysis and review. This feature helps you find and review content from a specific version of a document that may be relevant to a case or investigation, even if the latest version of the same document doesn't contain the relevant information.
To support the historical versions capability in eDiscovery, SharePoint administrators must enable versioning for sites in their organization. Then, when users modify documents in SharePoint or OneDrive, implicit regular versions are created when document is saved (or autosaved). SharePoint versioning allows for tracking of the activity performed on SharePoint items (including documents, events, and tasks). This versioning capability leaves an audit trail that can provide evidence in legal investigations. These older versions of a document are available to the organization, who may be required to share such versions that have sensitive or relevant content during court discovery in a legal matter.
After an eDiscovery administrator turns on historical versions for the organization and then activates it for specific SharePoint sites, the SharePoint content push service crawls all major and minor versions of documents on the activated sites and then sends those versions for indexing. After the crawling and indexing process is complete, documents and their versions are available for eDiscovery search. As long as a specific version can be accessed (by version history), then that version is discoverable in an eDiscovery search.
Tip
Get started with Microsoft Security Copilot to explore new ways to work smarter and faster using the power of AI. Learn more about Microsoft Security Copilot in Microsoft Purview.
SharePoint site performance
Once historical versions is enabled for a site, the performance for the site will the same as it was before the site was enabled. The crawling and indexing processes that are performed on the site after it's enabled occur at a slower rate and be performed during off-peak hours. Enabling historical versions for a site kicks off a backfill process, which finds all the versions of documents on the site and then sends those versions to the index. Depending on the number of document versions for site, this backfill process might impact service health. This impact is mitigated in the following ways:
- We make the best effort to process these versions during off-peak hours.
- We process document versions in our lowest priority queues, which allows most service resources to be delegated to user changes.
Set up historical versions
To enable historical versions in eDiscovery, your organization has to turn it on and then activate specific sites so that all versions of documents stored on those sites are indexed for search. Before you set up eDiscovery for historical versions, you have to enable versioning support in SharePoint.
Step 1: Turn on versioning in SharePoint
The first step is to turn on versioning in SharePoint so that all versions of a document are preserved. For instructions, see Versioning in SharePoint.
Important
Adding SharePoint sites to historical versions isn't currently supported in in multi-geo environments. Only the primary domain of your organization is enabled for historical versions.
Step 2: Turn on historical versions
Important
After you turn on historical versions, you won't be able to turn it off during the public preview. You can turn it off after historical versions is released for general availability.
The next step is to turn on historical versions in eDiscovery. To turn on historical versions for your organization, a person must be a global administrator or an eDiscovery Administrator (a member of the eDiscovery Administrator subgroup in the eDiscovery Manager role group). After historical versions is turned on, it will apply to your entire organization.
Important
Microsoft recommends that you use roles with the fewest permissions. Minimizing the number of users with the Global Administrator role helps improve security for your organization. Learn more about Microsoft Purview roles and permissions.
In the Microsoft Purview portal, navigate to Settings> eDiscovery > *Historical versions (preview).
On the Settings page, select the Historical versions (preview) tab, and then switch the Historical versions tenant control toggle to on.
Note
A warning is displayed that says you won't be able to turn off historical versions. You won't be able to turn off historical versions until the feature is released for general availability.
Select Yes to turn on historical versions.
Step 3: Activate SharePoint sites
After you turn on historical versions for your organization, the last step is to activate SharePoint sites to support historical versions. When you activate a site (by adding it to a list of sites on the Historical versions tab), the site is recrawled and all versions of documents stored on that site are indexed for search.
Note
There is a limit of 100 site activations per organization during the public preview of historical versions. An activation is counted against this limit whenever you enable or disable a site for historical versions. If you enable multiple sites, each site is counted as a single activation. The total number of activations is displayed on the Historical versions tab.
On the Historical versions tab on the eDiscovery Settings page, select Enable to activate sites for historical versions. A flyout out page is displayed that contains a list of all SharePoint sites in your organization.
Select a site to activate, and then select Enable to activate it for historical versions. You can use the search box to search for a specific site.
Note
A warning is displayed that says document versions on the site are indexed and this indexing process will take some time before the versions are ready to be searched. The warning also states that you won't be able to disable historical versions for the selected site for 30 days.
Select Yes to activate the site for historical versions. The site is added to the list of activated sites. The counter showing the number of site activations is also updated.
Repeat the previous steps for each site that you want to activate for historical versions.
Based on the number of documents for a site and the average number of versions per document, we try to estimate the total number of files per site. Based on this, an estimate of how long it may take to index is as follows:
Number of versions / (Processing rate of 100,000 files per day ) + .5 days = Total number of days to process a site
The half-day is added as a buffer since indexing of versions on site is optimized to run during off-peak hours.
For example, if the total number of documents and all versions for a site is 150,000 then it takes at least two days to process the site for historical versions:
150,000 files/100,000 files per day + .5 days = 2 days
Reactivating and deactivating sites
When you deactivate a previously activated site, a clean-up process is triggered. This process takes time to complete. If that same site is activated again, the backfill process of reindexing the site has to be rerun. Both the clean-up and backfill processes are time and resource-intensive. We recommended that you carefully consider and plan which sites you want to activate for historical version so you can avoid repeatedly activating and deactivating historical versions for a site.