Article ID: 923317 - View products that this article applies to.
Consider the following scenario:
Additionally, if you use a hyphen in the thesaurus file, you will find that an error message is logged in the event log. For example, the thesaurus file contains the follow tag:
You receive the following event log when you run a full-text search query that uses this thesaurus file:
Note The special characters that are listed here are only examples. Other special characters may also cause the problem.
To work around this problem, avoid using special characters in thesaurus file terms.
Microsoft has confirmed that this is a problem in the Microsoft products that are listed in the "Applies to" section.
When the word breaker encounters a word-breaking character in a term, the word breaker parses the character as a white space character. For example, if the word breaker encounters the term "look&out," the word breaker parses the term as the two words "look" and "out."
Note Word-breaking characters include the following:
When the word breaker encounters a hyphen (-) in a term, the word breaker correctly parses the term. However, the full-text thesaurus component treats the characters that are connected by the hyphen together with the hyphen itself as empty characters. For example, if the original term is "well-known celebrity," the term appears as "celebrity" in the thesaurus file.
The problem that is described in the "Symptoms" section may also cause a duplicate entry or an empty entry for a term in the thesaurus file. The full-text thesaurus component cannot load a thesaurus file if the file contains a duplicate entry or an empty entry.
Note In SQL Server 2005, if the full-text thesaurus component finds an error in the thesaurus file, the full-text thesaurus component does not load the thesaurus file.
For example, you define the following terms in the thesaurus file:
In this scenario, the ampersand will be parsed as a white space character. By default, the word "you" is listed in the noise file and is ignored. Therefore, the word breaker parses the term "Windows&you" as "Windows." Because the thesaurus file already contains the term "Windows," the thesaurus file now contains duplicate terms. Therefore, the full-text thesaurus component does not load the thesaurus file.
Article ID: 923317 - Last Review: September 22, 2006 - Revision: 2.1