After creating a content source in the FAST Content Search Service Application (SSA), and starting a full crawl, you may observe that the number of documents that are successfully crawled and submitted to Microsoft FAST Search Server 2010 for SharePoint is greater than the number of documents indexed by FAST and searchable.
This issue occurs because the FAST document processing pipeline may drop documents if the documents contain META tags such as "noindex."
The Windows PowerShell command Get-FASTSearchContentCollection shows all content collections and the number of documents that are in each collection. This represents the number of all successfully indexed documents from all content sources in the FAST Content SSA or any another FAST Search specific connectors.
You can check what documents are dropped when you use the document processing tools. To do this, follow these steps:
Use the following command to enable debug and trace from the document processors from the FAST Search management console:
psctrl debug on psctrl doctrace on
Re-crawl the content, and then use the following command to see the detailed output for each processed document:
This shows successfully processed documents and any dropped documents, with detailed information.
Use the following command to turn off debug and trace: