Set-SPEnterpriseSearchCrawlContentSource
设置 Search Service 应用程序的爬网内容源的属性。
语法
Set-SPEnterpriseSearchCrawlContentSource
[-Identity] <ContentSourcePipeBind>
[-AssignmentCollection <SPAssignmentCollection>]
[-BDCApplicationProxyGroup <SPServiceApplicationProxyGroupPipeBind>]
[-Confirm]
[-CrawlPriority <CrawlPriority>]
[-CrawlScheduleDaysOfMonth <Int32>]
[-CrawlScheduleMonthsOfYear <MonthsOfYear>]
[-CrawlScheduleRepeatDuration <Int32>]
[-CrawlScheduleRepeatInterval <Int32>]
[-CrawlScheduleStartDateTime <DateTime>]
[-CustomProtocol <String>]
[-EnableContinuousCrawls <Boolean>]
[-LOBSystemSet <String[]>]
[-MaxPageEnumerationDepth <Int32>]
[-MaxSiteEnumerationDepth <Int32>]
[-MonthlyCrawlSchedule]
[-Name <String>]
[-ScheduleType <ContentSourceCrawlScheduleType>]
[-SearchApplication <SearchServiceApplicationPipeBind>]
[-StartAddresses <String>]
[-Tag <String>]
[-WhatIf]
[<CommonParameters>]
Set-SPEnterpriseSearchCrawlContentSource
[-Identity] <ContentSourcePipeBind>
[-AssignmentCollection <SPAssignmentCollection>]
[-BDCApplicationProxyGroup <SPServiceApplicationProxyGroupPipeBind>]
[-Confirm]
[-CrawlPriority <CrawlPriority>]
[-CrawlScheduleDaysOfWeek <DaysOfWeek>]
[-CrawlScheduleRepeatDuration <Int32>]
[-CrawlScheduleRepeatInterval <Int32>]
[-CrawlScheduleRunEveryInterval <Int32>]
[-CrawlScheduleStartDateTime <DateTime>]
[-CustomProtocol <String>]
[-EnableContinuousCrawls <Boolean>]
[-LOBSystemSet <String[]>]
[-MaxPageEnumerationDepth <Int32>]
[-MaxSiteEnumerationDepth <Int32>]
[-Name <String>]
[-ScheduleType <ContentSourceCrawlScheduleType>]
[-SearchApplication <SearchServiceApplicationPipeBind>]
[-StartAddresses <String>]
[-Tag <String>]
[-WeeklyCrawlSchedule]
[-WhatIf]
[<CommonParameters>]
Set-SPEnterpriseSearchCrawlContentSource
[-Identity] <ContentSourcePipeBind>
[-AssignmentCollection <SPAssignmentCollection>]
[-BDCApplicationProxyGroup <SPServiceApplicationProxyGroupPipeBind>]
[-Confirm]
[-CrawlPriority <CrawlPriority>]
[-CrawlScheduleRepeatDuration <Int32>]
[-CrawlScheduleRepeatInterval <Int32>]
[-CrawlScheduleRunEveryInterval <Int32>]
[-CrawlScheduleStartDateTime <DateTime>]
[-CustomProtocol <String>]
[-DailyCrawlSchedule]
[-EnableContinuousCrawls <Boolean>]
[-LOBSystemSet <String[]>]
[-MaxPageEnumerationDepth <Int32>]
[-MaxSiteEnumerationDepth <Int32>]
[-Name <String>]
-ScheduleType <ContentSourceCrawlScheduleType>
[-SearchApplication <SearchServiceApplicationPipeBind>]
[-StartAddresses <String>]
[-Tag <String>]
[-WhatIf]
[<CommonParameters>]
Set-SPEnterpriseSearchCrawlContentSource
[-Identity] <ContentSourcePipeBind>
[-AssignmentCollection <SPAssignmentCollection>]
[-BDCApplicationProxyGroup <SPServiceApplicationProxyGroupPipeBind>]
[-Confirm]
[-CrawlPriority <CrawlPriority>]
[-CustomProtocol <String>]
[-EnableContinuousCrawls <Boolean>]
[-LOBSystemSet <String[]>]
[-MaxPageEnumerationDepth <Int32>]
[-MaxSiteEnumerationDepth <Int32>]
[-Name <String>]
[-RemoveCrawlSchedule]
[-ScheduleType <ContentSourceCrawlScheduleType>]
[-SearchApplication <SearchServiceApplicationPipeBind>]
[-StartAddresses <String>]
[-Tag <String>]
[-WhatIf]
[<CommonParameters>]
说明
此 cmdlet 包含多个参数集。 只能使用一个参数集中的参数,而不能结合使用不同参数集中的参数。 若要详细了解如何使用参数集,请参阅 Cmdlet 参数集。
在最初配置搜索功能时以及添加任何新内容源后,cmdlet Set-SPEnterpriseSearchCrawlContentSource
会更新爬网内容源的规则。
调用此 cmdlet 一次以设置内容源的增量爬网计划,并再次调用它来设置完全爬网计划。
可选 EnableContinuousCrawls 参数的值可以为 True 或 False。 值 True 表示可对此内容源中的项进行持续爬网。 这将导致搜索系统自动启动增量爬网以处理对相应的数据存储库中的项进行的最新更改。 这有助于保持此内容源中的项的索引最新。 Search Service 应用程序管理员仍可以根据需要配置完全爬网。
有关适用于 SharePoint 产品的 Windows PowerShell 的权限和最新信息,请参阅 SharePoint Server cmdlet。
示例
--------------------示例---------------------
$ssa = Get-SPEnterpriseSearchServiceApplication
$cs = Get-SPEnterpriseSearchCrawlContentSource -Identity 'Local SharePoint Sites' -SearchApplication $ssa
$cs | Set-SPEnterpriseSearchCrawlContentSource -ScheduleType Full -DailyCrawlSchedule -CrawlScheduleRunEveryInterval 30
$cs | Set-SPEnterpriseSearchCrawlContentSource -ScheduleType Incremental -DailyCrawlSchedule -CrawlScheduleRepeatInterval 60 -CrawlScheduleRepeatDuration 1440
本示例返回“本地 SharePoint 网站”内容源,并创建一个计划,以便每 30 天运行一次完全爬网,每小时运行一次增量爬网。
参数
-AssignmentCollection
管理对象以便正确进行处理。 使用 SPWeb 或 SPSite 等对象可能会耗用大量内存,而且在 Windows PowerShell 脚本中使用这些对象需要正确管理内存。 通过使用 SPAssignment 对象,可以将对象分配给变量,然后在不需要这些对象时对它们进行处理,以释放内存。 在使用 SPWeb、SPSite 或 SPSiteAdministration 对象时,如果不使用分配集合或 Global 参数,则会自动处理这些对象。
使用全局参数时,所有对象均包含在全局存储中。
如果未立即使用对象,或未通过使用 Stop-SPAssignment
命令来处理对象,则可能会发生内存不足的情况。
Type: | SPAssignmentCollection |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | True |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-BDCApplicationProxyGroup
指定用于 business 类型内容源的代理。 此代理组必须包含默认业务数据连接元数据存储代理。
Type: | SPServiceApplicationProxyGroupPipeBind |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-Confirm
执行命令前,看到确认提示。
有关详细信息,请键入以下命令:get-help about_commonparameters
Type: | SwitchParameter |
Aliases: | cf |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-CrawlPriority
指定此内容源的优先级。
键入的值必须是以下整数之一:1=普通,2=高。
Type: | CrawlPriority |
Aliases: | p |
Accepted values: | Normal, High |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-CrawlScheduleDaysOfMonth
指定在设置 MonthlyCrawlSchedule 参数时要进行爬网的日期。
Type: | Int32 |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-CrawlScheduleDaysOfWeek
指定在设置 WeeklyCrawlSchedule 参数时要进行爬网的日期。
Type: | DaysOfWeek |
Accepted values: | Sunday, Monday, Tuesday, Wednesday, Thursday, Friday, Weekdays, Saturday, Weekends, Everyday |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-CrawlScheduleMonthsOfYear
指定在设置 MonthlyCrawlSchedule 参数时要进行爬网的月份。
Type: | MonthsOfYear |
Aliases: | month |
Accepted values: | January, February, March, April, May, June, July, August, September, October, November, December, AllMonths |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-CrawlScheduleRepeatDuration
指定爬网计划的重复次数。
Type: | Int32 |
Aliases: | duration |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-CrawlScheduleRepeatInterval
指定每次重复爬网计划的间隔分钟数。
Type: | Int32 |
Aliases: | interval |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-CrawlScheduleRunEveryInterval
指定两次爬网之间的间隔。
如果设置了 DailyCrawlSchedule 参数,则指定爬网的间隔天数。
如果设置了 WeeklyCrawlSchedule 参数,则指定爬网的间隔周数。
Type: | Int32 |
Aliases: | every |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-CrawlScheduleStartDateTime
指定爬网的初始日期。 默认值为当前的午夜。
Type: | DateTime |
Aliases: | start |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-CustomProtocol
指定由自定义连接器处理的用于此内容源的自定义协议。
Type: | String |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-DailyCrawlSchedule
计划基于爬网的间隔天数。
Type: | SwitchParameter |
Aliases: | daily |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-EnableContinuousCrawls
指定 EnableContinuousCrawls 参数的值:True 或 False。
Type: | Boolean |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-Identity
指定要更新的爬网内容源。
键入的值必须为 12345678-90ab-cdef-1234-567890bcdefgh 形式的有效 GUID;ContentSource 对象的有效名称(如 ContentSource1);或有效 ContentSource 对象的实例。
Type: | ContentSourcePipeBind |
Position: | 0 |
Default value: | None |
Required: | True |
Accept pipeline input: | True |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-LOBSystemSet
为 business 类型内容源指定一个以逗号分隔的业务数据连接元数据存储系统名称和系统实例名称的列表。
Type: | String[] |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-MaxPageEnumerationDepth
为 web 或 custom 类型内容源指定爬网程序从开始地址爬网到内容项可执行的页面跃点数。
Type: | Int32 |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-MaxSiteEnumerationDepth
为 web 或 custom 类型内容源指定爬网程序从开始地址爬网到内容项可执行的网站跃点数。
Type: | Int32 |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-MonthlyCrawlSchedule
计划基于爬网的间隔月份数。
Type: | SwitchParameter |
Aliases: | monthly |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-Name
为内容源指定新的显示名称。
键入的值必须是有效的内容源名称;例如,ContentSource1。
Type: | String |
Aliases: | n |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-RemoveCrawlSchedule
删除指定的爬网。
Type: | SwitchParameter |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-ScheduleType
指定爬网计划的类型。
键入的值必须是以下项之一:Full 或 Incremental。
Type: | ContentSourceCrawlScheduleType |
Accepted values: | Full, Incremental, Full, Incremental, Full, Incremental, Full, Incremental |
Position: | Named |
Default value: | None |
Required: | True |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-SearchApplication
指定包含内容源的搜索应用程序。
键入的值必须是 12345678-90ab-cdef-1234-567890bcdefgh 形式的有效 GUID;有效的搜索应用程序名称(如 SearchApp1);或有效 SearchServiceApplication 对象的实例。
Type: | SearchServiceApplicationPipeBind |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-StartAddresses
指定对此内容源进行爬网的起始 URL 的列表(用逗号分隔)。
类型必须是有效的 URL,格式为 https://server_name.
Type: | String |
Aliases: | s |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-Tag
指定自定义内容源设置的修改页面的 URL。 指定 URL 的字符串最多可包含 1,024 个字符。
类型必须是有效的 URL,格式为 https://server_name.
Type: | String |
Aliases: | t |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-WeeklyCrawlSchedule
计划基于爬网的间隔周数。
Type: | SwitchParameter |
Aliases: | weekly |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
-WhatIf
显示一条描述命令作用的消息,而不执行命令。
有关详细信息,请键入以下命令:get-help about_commonparameters
Type: | SwitchParameter |
Aliases: | wi |
Position: | Named |
Default value: | None |
Required: | False |
Accept pipeline input: | False |
Accept wildcard characters: | False |
Applies to: | SharePoint Server 2010, SharePoint Server 2013, SharePoint Server 2016, SharePoint Server 2019 |
输入
Microsoft.Office.Server.Search.Cmdlet.ContentSourcePipeBind
Microsoft.SharePoint.PowerShell.SPAssignmentCollection
输出
System.Object