SharePoint 2010: Only the Start address URL of a PHP based Web site content source is being crawled by FAST search connector

When trying to crawl a PHP based web site, the SharePoint crawler (with FAST backend) only processes the first page and will not follow any links thereafter. The "normal" SharePoint crawler (not connected to a FAST backend) crawls this site without error.
The PHP extension is not added to the list of extended connector (FAST connector) property.
Add the file extension PHP for the FAST connector using 'Set-SPEnterpriseSearchExtendedConnectorProperty'.

First get the current value for the extended connector (FAST connector) property using the following PowerShell command:

Get-SPEnterpriseSearchExtendedConnectorProperty –SearchApplication $searchApp –identity ExtensionsToFilter 
where $searchApp is the Fast connector Search Service Application (SSA).

The value returned would be something like “;ascx;asp;aspx;htm;html;jhtml;jsp;”.

Then set the value using for the extended connector (FAST connector) property using the following PowerShell command

Set-SPEnterpriseSearchExtendedConnectorProperty –SearchApplication $searchApp –identity ExtensionsToFilter –Value “;ascx;asp;aspx;htm;html;jhtml;jsp;php;”

- Set-SPEnterpriseSearchExtendedConnectorProperty
- About Windows PowerShell cmdlets (FAST Search Server 2010 for SharePoint)
crawl php start url search index
Note This is a "FAST PUBLISH" article created directly from within the Microsoft support organization. The information contained herein is provided as-is in response to emerging issues. As a result of the speed in making it available, the materials may include typographical errors and may be revised at any time without notice. See Terms of Use for other considerations.

Article ID: 2550268 - Last Review: 06/04/2011 19:51:00 - Revision: 5.0

Microsoft SharePoint Server 2010, Microsoft FAST Search Server 2010 for SharePoint

  • KB2550268