Symptoms
When you run the Get-FailedContentIndexDocuments cmdlet to search for items that can't be indexed by Exchange Search in a Microsoft Exchange Server 2013 environment, the messages that attach a document and the document embeds other documents are listed as a failed item. Additionally, when you use In-Place eDiscovery and Outlook Web App (OWA) to search for this kind of message, the related messages are unavailable. The extract content that's parsed from an unsearchable item looks something like this:
Description: The document parser encountered a processing error.
AdditionalInfo: 309003 Document 'exchange://localhost/Attachment/XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX/XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX/XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX/XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX/XXXXXXXXXXXX.X/UseCaseSpecificationMS Exchange_ILM_Year_Mon_Date_DRAFT.docx' was partially processed. The parser was not able to parse the whole document.Cause
This issue occurs because the formats for embedding documents are parsed by IFilters. However, the .pptx, .docx, .text, .xml, .gif, .jpg, and .pdf formats aren't parsed by IFilters. Therefore, these documents are partially processed.
Resolution
To fix this issue, install Cumulative Update 10 for Exchange Server 2013. Note This update provides support for embedded documents in the .pptx and .docx formats.
Status
Microsoft has confirmed that this is a problem in the Microsoft products that are listed in the "Applies to" section.