Sites that require forms-based authentication or cookie-based authentication are not crawled in SharePoint Server 2007
Note This problem does not apply to content that is saved on a SharePoint Server 2007 site or on a Windows SharePoint Services 3.0 site.For these sites, you must configure the Web application default zone to use NTLM authentication to index the sites. For more information, visit the following Microsoft Web site:
For more information about the AddRule.exe command-line tool, see the "More Information" section.
How to obtain the hotfixThis problem is fixed in a Microsoft Office SharePoint Services hotfix package.
For more information about the SharePoint Server 2007 hotfix package, click the following article number to view the article in the Microsoft Knowledge Base:
|AddRule.exe||This command displays the following help text: |
Usage: AddRule.exe <xml file>
The structure of the input file is specified in the instructions provided with this hotfix.
|AddRule.exe input xml file||This command will add the crawl rule based on the XML file. These rules are added to the end of their current set of crawl rules. The administrator can later change the order by using the user interface.|
If the XML file is malformed, you may receive an error message.
- If there is no <Rules> tag, you receive the following error message: Syntax error: [rules] element not found as the only node at the root.
- If a required node is missing in the XML file, you receive the following error message: Syntax error: <missing node> element unexpected.
- If a node in the XML file is incorrectly duplicated, you receive the following error message: Syntax error: <node name> element already exists for the current rule
- If the type is not "FORM" or "COOKIE," you receive the following error message: Syntax error: unrecognized value for the <type> element
- If the login_type is not "POST," you receive the following error message: Syntax error: unrecognized value for the <login_type> elementNote If the administrator reruns this command by using another input file and then finds that the path is identical to an existing rule, the command will modify the rule.
Crawl rules object modelThe CrawlRuleAuthenticationType enumeration includes the following new values:
- FormsRuleAccess = 4
- CookieRuleAccess = 5
The forms-based authentication rule takes the following input parameters in the following order:
- type::CrawlRuleAuthenticationType: This will be FormsRuleAccess.
- AuthSubmissionMethod::String: This will be "POST."
- AuthSubmissionPath::String: This is the URL in which the parameters should be posted.
- authData::NameValueCollection: This is where the hidden name value pairs are stored.
- privateAuthData:: NameValueCollection: This is where the encrypted name value pairs such as user names and passwords are stored.
- errorPages::StringCollection: This will store the various error pages that would indicate to the crawler to refetch a cookie or to fail the URL with an "Access Denied" error message.
- type::CrawlRuleAuthenticationType: This will be CookieRuleAccess.
- cookies::StringCollection: This will store the cookies that the crawler should use.
- errorPages::StringCollection: This will store the various error pages that would indicate to the crawler to fail the URL with an "Access Denied" error message.
Custom security trimming for Enterprise Search resultsAfter the content that is stored on sites that require forms-based authentication or cookie-based authentication is crawled by the indexer, the content search results are returned for all users regardless of the users' credentials. If you have to security trim this content, refer to the "Custom Security Trimming for Enterprise Search Results Overview" topic. To view this topic, visit the following Microsoft Web site:
Article ID: 934577 - Last Review: 01/16/2015 02:02:00 - Revision: 3.0
- kbnosurvey kbarchive kbmoss2007postrtmfix kberrmsg kbexpertiseinter kbqfe kbbug kbfix KB934577