SharePoint 2013 Crawl Tuning Part 2: Content Processing Component

In part 1, we took a crawl performance baseline and came to the conclusion that our feeding bottleneck was the content processing component. So, we have expanded our SharePoint farm to two servers and we have added an additional Contend Processing Component on our new server. Our search topology now looks like this:

We'll now rerun the same full crawl test we performed in part 1 and collect counters from both servers.

Our transactions waiting is still increasing, so we still see that the bottleneck is downstream of the crawler. 

 

The Content Processing Component still looks like our bottleneck, but there is slightly less load now that is spread over two CPCs. Here is the Search Flow Statistics\Input Queue Full Time to compare against part 1.

 

Now, the most important metric: did we improve our crawl time? By adding this single component to our search topology, we were able to cut our time for a full crawl nearly in half. Going from 85 minutes to 45 minutes. This dramatic improvement in performance shows how valuable tuning your search infrastructure can be.

In future installments we'll take a look at what to do when other components are the bottleneck. See part 3

Comments

  • Anonymous
    January 01, 2003
    Wow, I love this - opened a new horizon for me. Thank you so much! I hope there are many more of these articles. Cheers Max
  • Anonymous
    January 30, 2014
    I wrote a basic overview of some of the counters needed to monitor and tune content feeding here . But
  • Anonymous
    February 01, 2014
    my week: SP2013 Sizing & Tuning for fast crawling systems, SharedDocs Integration, Audio & Video
  • Anonymous
    February 07, 2014
    The comment has been removed
  • Anonymous
    December 11, 2014
    Answered my question exactly and provided the requisite irrefutable proof to support my recommendation. Excellent work. Thank you for sharing.