Documents are partially indexed by Exchange search when they embed other documents in Exchange Server 2013

Vzťahuje sa na: Exchange Server 2013 EnterpriseExchange Server 2013 Standard Edition

Symptoms


When you run the Get-FailedContentIndexDocuments cmdlet to search for items that can't be indexed by Exchange Search in a Microsoft Exchange Server 2013 environment, the messages that attach a document and the document embeds other documents are listed as a failed item. Additionally, when you use In-Place eDiscovery and Outlook Web App (OWA) to search for this kind of message, the related messages are unavailable. The extract content that's parsed from an unsearchable item looks something like this:
Description: The document parser encountered a processing error.
AdditionalInfo: 309003 Document 'exchange://localhost/Attachment/XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX/XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX/XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX/XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX/XXXXXXXXXXXX.X/UseCaseSpecificationMS Exchange_ILM_Year_Mon_Date_DRAFT.docx' was partially processed. The parser was not able to parse the whole document.

Cause


This issue occurs because the formats for embedding documents are parsed by IFilters. However, the .pptx, .docx, .text, .xml, .gif, .jpg, and .pdf formats aren't parsed by IFilters. Therefore, these documents are partially processed.

Resolution


To fix this issue, install Cumulative Update 10 for Exchange Server 2013.

Note This update provides support for embedded documents in the .pptx and .docx formats.

Status


Microsoft has confirmed that this is a problem in the Microsoft products that are listed in the "Applies to" section.