SharePoint 2010: Only the Start address URL of a PHP based Web site content source is being crawled by FAST search connector
When trying to crawl a PHP based web site, the SharePoint crawler (with FAST backend) only processes the first page and will not follow any links thereafter. The "normal" SharePoint crawler (not connected to a FAST backend) crawls this site without error.
The PHP extension is not added to the list of extended connector (FAST connector) property.
Add the file extension PHP for the FAST connector using 'Set-SPEnterpriseSearchExtendedConnectorProperty'.
First get the current value for the extended connector (FAST connector) property using the following PowerShell command:
Get-SPEnterpriseSearchExtendedConnectorProperty –SearchApplication $searchApp –identity ExtensionsToFilter
where $searchApp is the Fast connector Search Service Application (SSA).
The value returned would be something like “;ascx;asp;aspx;htm;html;jhtml;jsp;”.
Then set the value using for the extended connector (FAST connector) property using the following PowerShell command
Set-SPEnterpriseSearchExtendedConnectorProperty –SearchApplication $searchApp –identity ExtensionsToFilter –Value “;ascx;asp;aspx;htm;html;jhtml;jsp;php;”
- Set-SPEnterpriseSearchExtendedConnectorProperty http://technet.microsoft.com/en-us/library/ff608013.aspx
- About Windows PowerShell cmdlets (FAST Search Server 2010 for SharePoint) http://technet.microsoft.com/en-us/library/ff393782.aspx
crawl php start url search index
Article ID: 2550268 - Last Review: 06/04/2011 19:51:00 - Revision: 5.0
Microsoft SharePoint Server 2010, Microsoft FAST Search Server 2010 for SharePoint